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COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF 

OVARIAN CANCER 

5 RELATED APPLICATIONS 

The present application claims priority to U.S. provisional patent application 
serial no. 60/191,031 filed on March 21, 2000, U.S. provisional patent application serial 
no. 60/207,124, filed on May 25, 2000, U.S. provisional patent application serial no. 
60/21 1,940, filed on June 15, 2000, U.S. provisional patent application serial no. 
10 60/216,820, filed on July 7, 2000, U.S. provisional patent application serial no. 

60/220,661, filed on July 25, 2000, and U.S. provisional patent application serial no. 
60/257,672, filed on December 21, 2000, all of which are expressly incorporated by 
reference. 

1 5 FIELD OF THE INVENTION 

The field of the invention is ovarian cancer, including diagnosis, 
characterization, management, and therapy of ovarian cancer. 

BACKGROUND OF THE INVENTION 
20 Ovarian cancer is responsible for significant morbidity and mortality in 

populations around the world. Ovarian cancer is classified, on the basis of clinical and 
pathological features, in three groups, namely epithelial ovarian cancer (EOC; >90% of 
ovarian cancer in Western countries), germ cell tumors {circa 2-3% of ovarian cancer), 
and stromal ovarian cancer {circa 5% of ovarian cancer; Ozols et aL, 1997, Cancer 
25 Principles and Practice of Oncology, 5th ed., DeVita et al 9 Eds. pp. 1502). Relative to 
EOC, germ cell tumors and stromal ovarian cancers are more easily detected and treated 
at an early stage, translating into higher/better survival rates for patients afflicted with 
these two types of ovarian cancer. 

There are numerous types of ovarian tumors, some of which are benign, and 
30 others of which are malignant. . Treatment (including non-treatment) options and 

predictions of patient outcome depend on accurate classification of the ovarian cancer. 
Ovarian cancers are named according to the type of cells from which the cancer is 
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derived and whether the ovarian cancer is benign or malignant. Recognized histological 
tumor types include, for example, serous, mucinous, endometrioid, and clear cell 
tumors. In addition, ovarian cancers are classified according to recognized grade and 
stage scales. 

5 In grade I, the tumor tissue is well differentiated from normal ovarian tissue. In 

grade II, tumor tissue is moderately well differentiated. In grade III, the tumor tissue is 
poorly differentiated from normal tissue, and this grade correlates with a less favorable 
prognosis than grades I and II. Stage I is generally confined within the capsule 
surrounding one (stage I A) or both (stage IB) ovaries, although in some stage I (i.e. 

10 stage IC) cancers, malignant cells may be detected in ascites, in peritoneal rinse fluid, or 
on the surface of the ovaries. Stage II involves extension or metastasis of the tumor 
from one or both ovaries to other pelvic structures. In stage IIA, the tumor extends or 
has metastasized to the uterus, the fallopian tubes, or both. Stage IIB involves extension 
of the tumor to the pelvis. Stage IIC is stage IIA or IDB in which malignant cells may be 

1 5 detected in ascites, in peritoneal rinse fluid, or on the surf ace of the ovaries. In stage III, 
the tumor comprises at least one malignant extension to the small bowel or the 
omentum, has formed extrapelvic peritoneal implants of microscopic (stage IIIA) or 
macroscopic (< 2 centimeter diameter, stage IIIB; > 2 centimeter diameter, stage IIIC) 
size, or has metastasized to a retroperitoneal or inguinal lymph node (an alternate 

20 indicator of stage IIIC). In stage IV, distant (Le. non-peritoneal) metastases of the tumor 
can be detected. 

The durations of the various stages of ovarian cancer are not presently known, 
but are believed to be at least about a year each (Richart et al. 9 1969, Am. J. Obstet 
Gynecol 105:386). Prognosis declines with increasing stage designation. For example, 

25 5-year survival rates for patients diagnosed with stage I, II, III, and IV ovarian cancer 
are 80%, 57%, 25%, and 8%, respectively. 

Despite being the third most prevalent gynecological cancer, ovarian cancer is 
the leading cause of death among those afflicted with gynecological cancers. The 
disproportionate mortality of ovarian cancer is attributable to a substantial absence of 

30 symptoms among those afflicted with early-stage ovarian cancer and to difficulty 

diagnosing ovarian cancer at an early stage. Patients afflicted with ovarian cancer most 
often present with non-specific complaints, such as abnormal vaginal bleeding, 
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gastrointestinal symptoms, urinary tract symptoms, lower abdominal pain, and 
generalized abdominal distension. These patients rarely present with paraneoplastic 
symptoms or with symptoms which clearly indicate their affliction. Presently, less than 
about 40% of patients afflicted with ovarian cancer present with stage I or stage II. 
5 Management of ovarian cancer would be significantly enhanced if the disease could be 
detected at an earlier stage, when treatments are much more generally efficacious. 

Ovarian cancer may be diagnosed, in part, by collecting a routine medical history 
from a patient and by performing physical examination, x-ray examination, and 
chemical and hematological studies on the patient. Hematological tests which may be 

10 indicative of ovarian cancer in a patient include analyses of serum levels of proteins 
designated CA125 and DF3 and plasma levels of lysophosphatidic acid (LP A). 
Palpation of the ovaries and ultrasound techniques (particularly including endovaginal 
ultrasound and color Doppler flow ultrasound techniques) can aid detection of ovarian 
tumors and differentiation of ovarian cancer from benign ovarian cysts. However, a 

1 5 definitive diagnosis of ovarian cancer typically requires performing exploratory 
laparotomy of the patient 

Potential tests for the detection of ovarian cancer (e.g., screening, reflex or 
monitoring) may be characterized by a number of factors. The "sensitivity" of an assay 
refers to the probability that the test will yield a positive result in an individual afflicted 

20 with ovarian cancer. The "specificity" of an assay refers to the probability that the test 
will yield a negative result in an individual not afflicted with ovarian cancer. The 
"positive predictive value" (PPV) of an assay is the ratio of true positive results (i.e. 
positive assay results for patients afflicted with ovarian cancer) to all positive results 
(i.e. positive assay results for patients afflicted with ovarian cancer + positive assay 

25 results for patients not afflicted with ovarian cancer). It has been estimated that in order 
for an assay to be an appropriate population-wide screening tool for ovarian cancer the 
assay must have a PPV of at least about 10% (Rosenthal et a/., 1998, Sem. Oncol 
25:315-325). It would thus be desirable for a screening assay for detecting ovarian 
cancer in patients to have a high sensitivity and a high PPV. Monitoring and reflex tests 

30 would also require appropriate specifications. 
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Owing to the cost, limited sensitivity, and limited specificity of known methods 
of detecting ovarian cancer, screening is not presently performed for the general 
population. In addition, the need to perform laparotomy in order to diagnose ovarian 
cancer in patients who screen positive for indications of ovarian cancer limits the 
5 desirability of population-wide screening, such that a PP V even greater than 1 0% would 
be desirable. 

Prior use of serum CA125 level as a diagnostic marker for ovarian cancer 
indicated that this method exhibited insufficient specificity for use as a general 
screening method. Use of a refined algorithm for interpreting CA125 levels in serial 

1 0 retrospective samples obtained from patients improved the specificity of the method 
without shifting detection of ovarian cancer to an earlier stage (Skakes, 1995, Cancer 
76:2004). Screening for LPA to detect gynecological cancers including ovarian cancer 
exhibited a sensitivity of about 96% and a specificity of about 89%. However, CA1 25- 
based screening methods and LPA-based screening methods are hampered by the 

1 5 presence of CA125 and LPA, respectively, in the serum of patients afflicted with 

conditions other than ovarian cancer. For example, serum CA125 levels are known to 
be associated with menstruation, pregnancy, gastrointestinal and hepatic conditions such 
as colitis and cirrhosis, pericarditis, renal disease, and various non-ovarian malignancies. 
Serum LPA is known, for example, to be affected by the presence of non-ovarian 

20 gynecological malignancies. A screening method having a greater specificity for 

ovarian cancer than the current screening methods for CA125 and LPA could provide a 
population-wide screening for early stage ovarian cancer. 

Presently greater than about 60% of ovarian cancers diagnosed in patients are 
stage III or stage IV cancers. Treatment at these stages is largely limited to 

25 cytoreductive surgery (when feasible) and chemotherapy, both of which aim to slow the 
spread and development of metastasized tumor. Substantially all late stage ovarian 
cancer patients currently undergo combination chemotherapy as primary treatment, 
usually a combination of a platinum compound and a taxane. Median survival for 
responding patients is about one year. Combination chemotherapy involving agents 

30 such as doxorubicin, cyclophosphamide, cisplatin, hexamethylmelamine, paclitaxel, and 
methotrexate may improve survival rates in these groups, relative to single-agent 
therapies. Various recently-developed chemotherapeutic agents and treatment regimens 
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have also demonstrated usefulness for treatment of advanced ovarian cancer. For 
example, use of the topoisomerase I inhibitor topectan, use of amifostine to minimize 
chemotherapeutic side effects, and use of intraperitoneal chemotherapy for patients 
having peritoneally implanted tumors have demonstrated at least limited utility. 
5 Presently, however, the 5-year survival rate for patients afflicted with stage III ovarian 
cancer is 25%, and the survival rate for patients afflicted with stage IV ovarian cancer is 
8%. 

In summary, the earlier ovarian cancer is detected, the aggressiveness of 
therapeutic intervention and the side effects associated with therapeutic intervention are 
10 minimized. More importantly, the earlier the cancer is detected, the survival rate and 
quality of life of ovarian cancer patients is enhanced. Thus, a pressing need exists for 
methods of detecting ovarian cancer as early as possible. There also exists a need for 
methods of detecting recurrence of ovarian cancer as well as methods for predicting and 
monitoring the efficacy of treatment. The present invention satisfies these needs. 

15 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with ovarian cancer as well as 
methods of assessing whether a patient is afflicted with ovarian cancer. This method 
comprises the step of comparing the level of expression of a marker in a patient . sample, 

20 wherein the marker is listed in Tables 1 -2, and the normal level of expression of the 
marker in a control, e.g., a sample from a patient without ovarian cancer. A significant 
difference between the level of expression of the marker in the patient sample and the 
normal level is an indication that the patient is afflicted with ovarian cancer. Preferably, 
a protein corresponding to the marker is a secreted protein. Alternatively, the marker 

25 can correspond to a protein having an extracellular portion, to one which is normally 
expressed in ovarian tissue at a detectable level, or both. 

In one method, the markers) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 
the method wherein the marker is over- or under-expressed by at least two-fold in at 

30 least about 20% of stage I ovarian cancer patients, stage II ovarian cancer patients, stage 
HI ovarian cancer patients, stage IV ovarian cancer patients, grade I ovarian cancer 
patients, grade II ovarian cancer patients, grade III ovarian cancer patients, epithelial 
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ovarian cancer patients, stromal ovarian cancer patients, germ cell ovarian cancer 
patients, malignant ovarian cancer patients, benign ovarian patients, serous neoplasm 
ovarian cancer patients, mucinous neoplasm ovarian cancer patients, endometrioid 
neoplasm ovarian cancer patients and/or clear cell neoplasm ovarian cancer patients. 
5 In one embodiment of the methods of the present invention, the patient sample is 

an ovary-associated body fluid. Such fluids include, for example, blood fluids, lymph, 
ascitic fluids, gynecological fluids, cystic fluids, urine, and fluids collected by peritoneal 
rinsing. In another embodiment, the sample comprises cells obtained from the patient. 
In this embodiment, the cells may be found in a fluid selected from the group consisting 

10 of a fluid collected by peritoneal rinsing, a fluid collected by uterine rinsing, a uterine 
fluid, a uterine exudate, a pleural fluid, and an ovarian exudate. In another embodiment, 
the patient sample is in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 

15 sample of: 

• a protein corresponding to the marker or fragment of the protein (e.g. 
using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein) 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
20 thereof, having at least a portion with which the marker is substantially 

homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-2 fixed thereto at selected positions) 

• a transcribed polynucleotide or fragment thereof, wherein the 

25 polynucleotide anneals with the marker under stringent hybridization 

conditions. 

• a metabolite which is produced directly (ie. 9 catalyzed) or indirectly by a 
protein corresponding to the marker 

The methods of the present invention are particularly useful for patients with an 
30 identified pelvic mass or symptoms associated with ovarian cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
developing ovarian cancer (e.g., patients having a familial history of ovarian cancer, 
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patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in 
monitoring the efficacy of treatment of an ovarian cancer patient (e.g. the efficacy of 
chemotherapy). 

5 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 10 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-2 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 
10 control humans not afflicted with ovarian cancer. The markers of Tables 1-2 may also 
be used in combination with known ovarian cancer markers in the methods of the 
present invention. 

In a preferred method of assessing whether a patient is afflicted with ovarian 
cancer (e.g., new detection ("screening"), detection of recurrence, reflex testing), the 
1 5 method comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control non-ovarian 
cancer sample. 

20 A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with ovarian 
cancer. 

The methods of the present invention further include a method of assessing the 
efficacy of a test compound for inhibiting ovarian cancer in a patient This method 
25 comprises comparing: 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-2, 
and 

30 b) expression of the marker in a second sample obtained from the patient 

and maintained in the absence of the test compound. 
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A significant difference between the level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting ovarian cancer in the patient. For example, the first and 
second samples can be portions of a single sample obtained from the patient or portions 
5 of pooled samples obtained from the patient. 

The invention further relates to a method of assessing the efficacy of a therapy 
for inhibiting ovarian cancer in a patient. This method comprises comparing: 

a) expression of a marker in a first sample obtained from the patient prior 
to providing at least a portion of the therapy to the patient, wherein the 

1 0 marker is selected from the group consisting of the markers listed in 

Tables 1-2, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy. 

A significant difference between the level of expression of the marker in the second 
1 5 sample, relative to the first sample, is an indication that the therapy is efficacious for 

inhibiting ovarian cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any traditional 

therapy for treating ovarian cancer including, but not limited to, chemotherapy, radiation 

therapy and surgical removal of tissue, e.g., an ovarian tumor. Thus, the methods of the 
20 invention may be used to evaluate a patient before, during and after thereapy, for 

example, to evaluate the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 

progression of ovarian cancer in a patient, the method comprising: 

a) detecting in a patient sample at a first time point, the expression of a marker, 
25 wherein the marker is selected from the group consisting of the markers listed in Tables 

1-2; 

b) repeating step a) at a subsequent time point in time; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
monitoring the progression of ovarian cancer in the patient. 

30 The invention also includes a method of selecting a composition for inhibiting 

ovarian cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 
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b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) selecting one of the test compositions which alters the level of 

expression of the marker in the aliquot containing that test composition, 
relative to other test compositions. 
In addition, the invention includes a method of inhibiting ovarian cancer in a 
patient. This method comprises the steps of: 
0 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) administering to the patient at least one of the test compositions which 

alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 
The invention also includes a kit for assessing whether a patient is afflicted with 
ovarian cancer. This kit comprises reagents for assessing expression of a marker listed 
0 in Tables 1-2. 

In another aspect, the invention relates to a kit for assessing the suitability of 
each of a plurality of compounds for inhibiting an ovarian cancer in a patient. The kit 
comprises a reagent for assessing expression of a marker listed in Tables 1-2, and may 
also comprise a plurality of compounds. 

5 In another aspect, the invention relates to a kit for assessing the presence of 

ovarian cancer cells. This kit comprises an antibody, wherein the antibody binds 
specifically with a protein corresponding to a marker listed in Tables 1-2. The kit may 
also comprise a plurality of antibodies, wherein the plurality binds specifically with a 
protein corresponding to a different marker listed in Tables 1-2. 

0 The invention also includes a kit for assessing the presence of ovarian cancer 

cells, wherein the kit comprises a nucleic acid probe. The probe binds specifically with 
a transcribed polynucleotide corresponding to a marker listed in Tables 1-2. The kit 
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may also comprise a plurality of probes, wherein each of the probes binds specifically 
with a transcribed polynucleotide corresponding to a different marker listed in Tables 1- 
2. 

The invention further relates to a method of making an isolated hybridoma which 
5 produces an antibody useful for assessing whether a patient is afflicted with ovarian 
\ cancer. The method comprises isolating a protein corresponding to a marker listed in 
Tables 1-2, immunizing a mammal using the isolated protein, isolating splenocytes from 
the immunized mammal, fusing the isolated splenocytes with an immortalized cell line 
to form hybridomas, and screening individual hybridomas for production of an antibody 
1 0 which specifically binds with the protein to isolate the hybridoma. The invention also 
includes an antibody produced by this method. 

The invention further includes a method of assessing the ovarian carcinogenic 
potential of a test compound. This method comprises the steps of: 

a) maintaining separate aliquots of ovarian cells in the presence and 
1 5 absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
The marker is selected from those listed in Tables 1-2. A significantly altered 
level of expression of the marker in the aliquot maintained in the presence of (or 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
20 test compound, is an indication that the test compound possesses ovarian carcinogenic 
potential. 

Additionally, the invention includes a kit for assessing the ovarian carcinogenic 
potential of a test compound. The kit comprises ovarian cells and a reagent for 
assessing expression of a marker in each of the aliquots. The marker is selected from 
25 those listed in Tables 1-2. 

The invention further relates to a method of treating a patient afflicted with 
ovarian cancer or at risk of developing ovarian cancer. This method comprises 
enhancing expression of a marker listed in Tables 1-2 or providing to cells of the patient 
a protein corresponding to a marker listed in Tables 1-2, wherein the marker is 
30 underexpressed in patients afflicted with ovarian cancer. The protein can be provided to 
the cells, for example, by providing a vector comprising a polynucleotide encoding the 
protein to the cells. 
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The invention includes another method of treating a patient afflicted with ovarian 
cancer or at risk of developing ovarian cancer. This method comprises inhibiting 
expression or overexpression of a marker listed in Tables 1-2 by, e.g., providing to cells 
of the patient an antisense oligonucleotide complementary to a polynucleotide 
5 corresponding to a marker listed in Tables 1 -2, wherein the marker is overexpressed in 
patients afflicted with ovarian cancer. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known ovarian cancer markers. It will further 
be appreciated that the methods and kits may be used to identify cancers other than 
10 ovarian cancer. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered genes associated with the cancerous 
state of ovarian cells. It has been discovered that the level of expression of individual 
15 genes, also referred to as markers, and combinations of these genes, correlates with the 
presence of ovarian cancer in a patient. Methods are provided for detecting the presence 
of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an 
ovarian cancer, and with other characteristics of ovarian cancer that are relevant to 
prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. 

20 

Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles "a" and "an" are used herein to refer to one or to more than one (i.e. 
25 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-2. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA (i.e. including any introns 
30 occurring therein), RNA generated by transcription of genomic DNA (i.e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (i.e. including proteins both before and 



WO 01/070979 



PCT/US01/09126 



-12- 

after cleavage of normally cleaved regions such as transmembrane signal sequences). 
As used herein, "marker" may also include a cDNA made by reverse transcription of an 
RNA generated by transcription of genomic DNA (including spliced RNA). 

The term "probe" refers to any molecule which is capable of selectively binding 
5 to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
10 and organic monomers. 

An "ovary-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through ovarian cells or into which cells or proteins shed from 
ovarian cells e.g. ovarian epithelium, are capable of passing. Exemplary ovary- 
associated body fluids include blood fluids, lymph, ascites, gynecological fluids, cystic 
1 5 fluid, urine, and fluids collected by peritoneal rinsing. 

The "normal" level of expression of a marker is the level of expression of the 
marker in ovarian cells of a patient, e.g. a human, not afflicted with ovarian qancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
marker of a patient at a greater or lesser level, respectively, than normal level of 
20 expression of the marker (e.g. at least two-fold greater or lesser level). 

As used herein, the term "promoter/regulatory sequence" means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
promoter sequence and in other instances, this sequence may also include an enhancer 
25 sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell under most or all physiological conditions 
of the cell. 
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An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
5 A "tissue-specific" promoter is a nucleotide sequence which, when operably 

linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 

A "transcribed polynucleotide" is a polynucleotide {e.g. an RNA, a cDNA, or an 

10 analog of one of an RNA or cDNA) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

"Complementary" refers to the broad concept of sequence complementarity 

1 5 between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic \ 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 
Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of ■• 

20 base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 
capable of base pairing with a residue of the second region. Preferably, the first region 

25 comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portion are capable of base pairing with nucleotide 
residues in the second portion. More preferably, all nucleotide residues of the first 

30 portion are capable of base pairing with nucleotide residues in the second portion. 
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"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by- the 
same nucleotide residue, then the regions are homologous at that position. A first region 
5 is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence 5-ATTGCC-3' and a region having the nucleotide sequence 5- 

10 TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 

15 the same nucleotide residue. 

A marker is "fixed" to a substrate if it is covalently or non-covalently associated 
with the substrate such the substrate can be rinsed with a fluid {e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

20 As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA 

or DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 

25 or less, respectively, than the normal level by an amount greater than the standard error 
of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 

30 about three, four, or five times, higher or lower, respectively, than the normal level of 
expression of the marker. 
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Ovarian cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, ovarian cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 
reagent, e.g. a probe, for specifically detecting a marker of the invention, the 
manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 

Description 

The present invention is based, in part, on identification of novel markers which 
are over-expressed in ovarian cancer cells as compared to their expression in normal (i.e. 
non- cancerous) ovarian cells. The markers of the invention correspond to DNA, RNA, 
and polypeptide molecules which can be detected in one or both of normal and 
cancerous ovarian cells. The enhanced expression of one or more of these markers in 
ovarian cells is herein correlated with the cancerous state of the tissue. The invention 
thus includes compositions, kits, and methods for assessing the cancerous state of 
ovarian cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 
among others: 

1) assessing whether a patient is afflicted with ovarian cancer; 

2) assessing the stage of ovarian cancer in a human patient; 

3) assessing the grade of ovarian cancer in a patient; 

4) assessing the benign or malignant nature of ovarian cancer in a patient; 

5) assessing the histological type of neoplasm (e.g. serous, mucinous, 



endometroid, or clear cell neoplasm) associated with ovarian 



cancer in a patient; 
making an isolated hybridoma which produces an antibody useful for 



7) 
8) 



assessing whether a patient is afflicted with ovarian cancer; 
assessing the presence of ovarian cancer cells; 
assessing the efficacy of one or more test compounds for inhibiting 



ovarian cancer in a patient; 



WO 01/070979 PCT7US01/09126 

-16- 

9) assessing the efficacy of a therapy for inhibiting ovarian cancer in a 

patient; 

1 0) monitoring the progression of ovarian cancer in a patient; 

11) selecting a composition or therapy for inhibiting ovarian cancer in a 
5 patient; 

1 2) treating a patient afflicted with ovarian cancer; 

1 3) inhibiting ovarian cancer in a patient; 

14) assessing the ovarian carcinogenic potential of a test compound; 
and 

10 15) inhibiting an ovarian cancer in a patient at risk for developing 

ovarian cancer. 



The invention thus includes a method of assessing whether a patient is afflicted 
with ovarian cancer. This method comprises comparing the level of expression of a 

1 5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-ovarian cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with ovarian cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-2. 

20 The polynucleotides set forth in Tables 1-2 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. Also provided by this invention are 
polynucleotides that correspond to the polynucleotides of Tables 1-2. In one 
embodiment, these polynucleotides are obtained by identification of a larger fragment or 

25 full-length coding sequence of these polynucleotides. Gene delivery vehicles, host cells, 
compositions and databases (all described herein) containing these polynucleotides are 
also provided by this invention. 

Any marker or combination of markers listed in Tables 1-2, as well as any 
known markers in combination with the markers set forth in Tables 1 -2, may be used in 

30 the compositions, kits, and methods of the present invention. In general, it is preferable 
to use markers for which the difference between the level of expression of the marker in 
ovarian cancer cells and the level of expression of the same marker in normal ovarian 
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cells is as great as possible. Although this difference can be as small as the limit of 
detection of the method for assessing expression of the marker, it is preferred that the 
difference be at least greater than the standard error of the assessment method, and 
preferably a difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 
5 500-, 1000-fold or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from ovarian cells (Le. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

10 corresponding to each of these markers can be detected in an ovary-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

15 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

It is a simple matter for the skilled artisan to determine whether any particular 
marker corresponds to a secreted protein. In order to make this determination, the 
protein corresponding to a marker is expressed in a test cell (e.g. a cell of an ovarian cell 

20 line), extracellular fluid is collected, and the presence or absence of the protein in the 
extracellular fluid is assessed (e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion 
of a protein corresponding to a marker of the invention. About 8 x 10 5 293T cells are 

25 incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 10% fetal bovine serum) under a 5% (v/v) C0 2 , 
95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMNE™ 

30 (GIBCO/BRL Catalog no. 1 8342-012) per well. The transfection mixture is maintained 
for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
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methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 
of DMEM-MC and about 50 microcuries of Trans- 35 S™ reagent (ICN Catalog no. 
51006) are added to each well. The wells are maintained under the 5% C0 2 atmosphere 
described above and incubated at 37°C for a selected period. Following incubation, 150 
5 microliters of conditioned medium is removed and centriflxged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of ovary-associated body fluids include blood fluids (e.g. whole blood, 
blood serum, blood having platelets removed therefrom, etc.), lymph, ascitic fluids, 

10 gynecological fluids (e.g. ovarian, fallopian, and uterine secretions, menses, vaginal 
douching fluids, fluids used to rinse cervical cell samples, etc.), cystic fluid, urine, and 
fluids collected by peritoneal rinsing (e.g. fluids applied and collected during 
laparoscopy or fluids instilled into and withdrawn from the peritoneal cavity of a human 
patient). In these embodiments, the level of expression of the marker can be assessed by 

1 5 assessing the amount (e.g. absolute amount or concentration) of the marker in an ovary- 
associated body fluid obtained from a patient. The fluid can, of course, be subjected to a 
variety of well-known post-collection preparative and storage techniques (e.g. storage, 
freezing, ultrafiltration, concentration, evaporation, centrifugation, etc.) prior to 
assessing the amount of the marker in the fluid. 

20 Many ovary-associated body fluids (i.e. usually excluding urine) can have 

ovarian cells, e.g. ovarian epithelium, therein, particularly when the ovarian cells are 
cancerous, and, more particularly, when the ovarian cancer is metastasizing. Cell- 
containing fluids which can contain ovarian cancer cells include, but are not limited to, 
peritoneal ascites, fluids collected by peritoneal rinsing, fluids collected by uterine 

25 rinsing, uterine fluids such as uterine exudate and menses, pleural fluid, and ovarian 
exudates. Thus, the compositions, kits, and methods of the invention can be used to 
detect expression of markers corresponding to proteins having at least one portion which 
is displayed on the surface of cells which express it. Examples of such proteins are 
indicated in the Tables herein. Although not every protein having at least one cell- 

30 surface portion is indicated in the Tables, it is a simple matter for the skilled artisan to 
determine whether the protein corresponding to any particular marker comprises a cell- 
surface protein. For example, immunological methods may be used to detect such 
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proteins on whole cells, or well known computer-based sequence analysis methods (e.g. 
the SIGNALP program; Nielsen et al, 1997, Protein Engineering 10:1-6) may be used 
to predict the presence of at least one extracellular domain (i.e. including both secreted 
proteins and proteins having at least one cell-surface domain). Expression of a marker 
5 corresponding to a protein having at least one portion which is displayed on the surface 
of a cell which expresses it may be detected without necessarily lysing the cell (e.g. 
using a labeled antibody which binds specifically with a cell-surface domain of the 
protein). 

Expression of a marker of the invention may be assessed by any of a wide 

1 0 variety of well known methods for detecting expression of a transcribed molecule or 
protein. Non-limiting examples of such methods include immunological methods for 
detection of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification 
methods, protein function or activity assays, nucleic acid hybridization methods, nucleic 
acid reverse transcription methods, and nucleic acid amplification methods. 

15 In a preferred embodiment, expression of a marker is assessed using an antibody 

(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 
fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 

20 which binds specifically with a protein corresponding to the marker, such as the protein 
encoded by the open reading frame corresponding to the marker or such a protein which 
has undergone all or a portion of its normal post-translational modification. 

In another preferred embodiment, expression of a marker is assessed by 
preparing mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient 

25 sample, and by hybridizing the mRNA/cDNA with a reference polynucleotide which is a 
complement of a polynucleotide comprising the marker, and fragments thereof. cDNA 
can, optionally , be amplified using any of a variety of polymerase chain reaction 
methods prior to hybridization with the reference polynucleotide; preferably, it is not 
amplified. Expression of one or more markers can likewise be detected using 

30 quantitative PCR to assess the level of expression of the marker(s). Alternatively, any 
of the many known methods of detecting mutations or variants (e.g. single nucleotide 





WO 01/070979 PCT/US01/09126 

-20- 

polymorphisms, deletions, etc.) of a marker of the invention may be used to detect 
occurrence of a marker in a patient. 

In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
5 complementary to or homologous with at least a portion (e.g. at least 7, 1 0, 1 5, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 

10 be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 
expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be performed under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 

1 5 a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 
detection limit of the method used to assess expression in at least one of normal ovarian 
cells and cancerous ovarian cells. 

It is understood that by routine screening of additional patient samples using one 

20 or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific ovarian cancers, 
as well as other cancers such as breast cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (i.e. 50% or more) or substantially all (i.e. 80% or more) of ovarian cancer. 

25 Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with ovarian cancer of various stages (i.e. stage I, II, in, and IV ovarian 
cancers, as well as subclassifications IA, IB, IC, IIA, IIB, IIC, IIIA, IIIB, and IIIC, using 
the FIGO Stage Grouping system for primary carcinoma of the ovary; 1987, Am. J. 
Obstet. Gynecol. 156:236), of various histologic subtypes (e.g. serous, mucinous, 

30 endometroid, and clear cell subtypes, as well as subclassifications and alternate 
classifications adenocarcinoma, papillary adenocarcinoma, papillary 
cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
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cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
endometrioid stromal sarcoma, mesodermal (Mullerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
5 ovarian tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
various grades (i.e. grade I {well differentiated} , grade II {moderately well 
differentiated}, and grade III {poorly differentiated from surrounding normal tissue} ). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 

10 samples were obtained are correlated, it will also be confirmed that altered expression of 
certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 

1 5 nature of ovarian cancer in patients. In addition, these compositions, kits, and methods 
can be used to detect and differentiate epithelial, stromal, and germ cell ovarian cancers. 

When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of ovarian cancer in a patient, it is preferred that the marker or panel of markers 

20 of the invention is selected such that a positive result is obtained in at least about 20%, 
and preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an ovarian cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 

25 general population (more preferably coupled with an assay specificity greater than 
99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
30 in non-cancerous samples of the same type, either in a single reaction mixture (i.e. using 
reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
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significantly enhanced level of expression of more than one of the plurality of markers 
in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with ovarian cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
5 corresponding normal levels, is an indication that the patient is afflicted with ovarian 
cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more marks and a significantly lower level of expression of one or more markers in a 
sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with ovarian cancer. When a plurality of markers is used, it is preferred that 2, 

10 3, 4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 
markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (f.e. by interference attributable to cells of non-ovarian origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 

15 has a restricted tissue distribution, e.g., normally not expressed in a non-epithelial tissue, 
and more preferably a marker which is normally not expressed in a non-ovarian tissue. 

Only a small number of markers are known to be associated with ovarian cancers 
(e.g. AKT2, Ki-8AS, ERBB2, c-MYC, RBI, and TP53; Lynch, supra). These markers are 
not, of course, included among the markers of the invention, although they may be used 

20 together with one or more markers of the invention in a panel of markers, for example. 
It is well known that certain types of genes, such as oncogenes, tumor suppressor genes, 
growth factor-like genes, protease-like genes, and protein kinase-like genes are often 
involved with development of cancers of various types. Thus, among the markers of the 
invention, use of those which correspond to proteins which resemble known proteins 

25 encoded by known oncogenes and tumor suppressor genes, and those which correspond 
to proteins which resemble growth factors, proteases, and protein kinases are preferred. 

Known oncogenes and tumor suppressor genes include, for example, abl, abr, 
akt2, ape, bcl2a, bc!2j3, bcl3, bcr 9 breal, brca2 9 cbl, ccndl, cdc42, cdk4, crk- II, 
csflrlfins, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfrlerbb-1, elkl, elk3, eph, erg, etsl, 

30 ets2,fer,f^lsrc2,flillergb2,fos,jpslfes,fral,fra2,fyn, hck, hek, her2/erbb- 2/neu, 
her3/erbb-3, her4lerbb-4, hrasl, hst2, hstfl, igfbp2 > irik4a, ink4b, int2lfgf3,jun,junb, 
jund, kip2, kit, kras2a, kras2b, Ick, lyn, mas, max, mcc, mdm2 ) met, mlhl, mmplO, mos, 




WO 01/070979 PCI7US01/09126 

-23- 

msh2 9 msh3 9 msh6, myb 9 myba 9 mybb, myc, mycll, mycn 9 nfl 9 nf2 9 nme2 9 nras 9 p53 9 
pdgfb,phb, piml,pmsl,pms2,ptc,pten, rqfl 9 rapla, rbl 9 rel, ret, rosl 9 ski 9 srcl 9 tall, 
tgfbr2, tgfb3, tgfbr3, thral 9 thrb 9 tiaml, timp3 t tjpl, tp53, trk 9 vov, vhl, vil2, wqfl 9 wntl 9 
wnt2 9 wtl 9 and yesl (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 
5 Facts Book 9 2nd Ed., Academic Press; Fishel et al 9 1994, Science 266:1403-1405). 

Known growth factors include platelet-derived growth factor alpha, platelet- 
derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 
thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 

10 stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 
growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 

1 5 connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 
factor 1 , vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 
growth factor (vascular endothelial growth factor-related protein), transforming growth 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 

20 factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 
and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 

25 brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 
and its precursors, Mch2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 

30 72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 
(macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
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cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
5 plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
NIK, PK428, Krs-2, SAK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-like midbody-associated protein 
kinase-1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 

10 checkpoint kinase Chkl, Nek3 protein kinase, BMK1 beta kinase, Clkl, Clk2, Clk3, 
extracellular signal-regulated kinases 1,3, and 6, cdc28 protein kinase 1, cdc28 protein 
kinase 2, pLK, Mytl, c-Jun N-terminal kinase 2, Cam kinase 1, the MAP kinases, 
insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 

15 Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 
associated coiled-coil containing protein kinase pl60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- 

20 activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl, lipid-activated protein kinase PRO, focal adhesion kinase, dual-specificity 
tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 

25 Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology. 2 protein), agammaglobulinemia protein-tyrosine kinase (Atk), protein 
kinase ATR, guanylate kinase 1, thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 

30 (Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fujinami avian sarcoma viral oncogene 
homologs), proto-oncogene c-cof, oncogene pim-1, and oncogene masl. 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing ovarian cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
ovarian cancer include, for example, patients having a familial history of ovarian cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age (i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. non-cancerous) human 
ovarian tissue can be assessed in a variety of ways. In one embodiment, this normal 
level of expression is assessed by assessing the level of expression of the marker in a 

10 portion of ovarian cells which appears to be non-cancerous and by comparing this 
normal level of expression with the level of expression in a portion of the ovarian cells 
which is suspected of being cancerous. For example, when laparoscopy or other 
medical procedure, reveals the presence of a lump on one portion of a patient's ovary, 
but not on another portion of the same ovary or on the other ovary, the normal level of 

1 5 expression of a marker may be assessed using one or both or the non-affected ovary and 
a non-affected portion of the affected ovary, and this normal level of expression may be 
compared with the level of expression of the same marker in an affected portion (i.e. the * 
lump) of the affected ovary. Alternately, and particularly as further information 
becomes available as a result of routine performance of the methods described herein, . \ 

20 population-average values for normal expression of the markers of the invention may be 
used. In other embodiments, the 'normal' level of expression of a marker may be 
determined by assessing expression of the marker in a patient sample obtained from a 
non-cancer-afflicted patient, from a patient sample obtained from a patient before the 
suspected onset of ovarian cancer in the patient, from archived patient samples, and the 

25 like. 

The invention includes compositions, kits, and methods for assessing the 
presence of ovarian cancer cells in a sample (e.g. an archived tissue sample or a sample 
obtained from a patient). These compositions, kits, and methods are substantially the 
same as those described above, except that, where necessary, the compositions, kits, and 
30 methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 
necessary to adjust the ratio of compounds in the compositions of the invention, in the 
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kits of the invention, or the methods used to assess levels of marker expression in the 
sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of ovarian cancer cells 
{e.g. in a sample such as a patient sample). The kit comprises a plurality of reagents, 
each of which is capable of binding specifically with a nucleic acid or polypeptide 
corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
nucleic acid {e.g. a genomic DNA, an mRNA, a spliced mRNA, a cDNA, or the like) 
include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

The kit of the invention may optionally comprise additional components useful 
for performing the methods of the invention. By way of example, the kit may comprise 
fluids {e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the '< 
invention, a sample of normal ovarian cells, a sample of ovarian cancer cells, and the 



The invention also includes a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether patient is afflicted with an ovarian 
cancer. In this method, a protein corresponding to a marker of the invention is isolated 
{e.g. by purification from a cell in which it is expressed or by transcription and 
translation of a nucleic acid encoding the protein in vivo or in vitro using known 
methods). A vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is 
immunized using the isolated protein. The vertebrate may optionally (and preferably) be 
immunized at least one additional time with the isolated protein, so that the vertebrate 
exhibits a robust immune response to the protein. Splenocytes are isolated from the 
immunized vertebrate and fused with an immortalized cell line to form hybridomas, 
using any of a variety of methods well known in the art. Hybridomas formed in this 
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manner are then screened using standard methods to identify one or more hybridomas 
which produce an antibody which specifically binds with the protein. The invention 
also includes hybridomas made by this method and antibodies made using such 
hybridomas. 

5 The invention also includes a method of assessing the efficacy of a test 

compound for inhibiting ovarian cancer cells. As described above, differences in the 
level of expression of the markers of the invention correlate with the cancerous state of 
ovarian cells. Although it is recognized that changes in the levels of expression of 
certain of the markers of the invention likely result from the cancerous state of ovarian 

1 0 cells, it is likewise recognized that changes in the levels of expression of other of the 
markers of the invention induce, maintain, and promote the cancerous state of those 
cells. Thus, compounds which inhibit an ovarian cancer in a patient wall cause the level 
of expression of one or more of the markers of the invention to change to a level nearer 
the normal level of expression for that marker {i.e. the level of expression for the marker 

15 in non-cancerous ovarian cells). 

This method thus comprises comparing expression of a marker in a first ovarian 
cell sample and maintained in the presence of the test compound and expression of the 
marker in a second ovarian cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 

20 Tables 1-2 is an indication that the test compound inhibits ovarian cancer. The ovarian 
cell samples may, for example, be aliquots of a single sample of normal ovarian cells 
obtained from a patient, pooled samples of normal ovarian cells obtained from a patient, 
cells of a normal ovarian cell line, aliquots of a single sample of ovarian cancer cells 
obtained from a patient, pooled samples of ovarian cancer cells obtained from a patient, 

25 cells of an ovarian cancer cell line, or the like. In one embodiment, the samples are 
ovarian cancer cells obtained from a patient and a plurality of compounds known to be 
effective for inhibiting various ovarian cancers are tested in order to identify the 
compound which is likely to best inhibit the ovarian cancer in the patient. 

This method may likewise be used to assess the efficacy of a therapy for 

30 inhibiting ovarian cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
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efScacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-2 then the therapy is efficacious for inhibiting 
ovarian cancer. As above, if samples from a selected patient are used in this method, 
then alternative therapies can be assessed in vitro in order to select a therapy most likely 
5 to be efficacious for inhibiting ovarian cancer in the patient. 

As described herein, ovarian cancer in patients is associated with an altereration 
in the level of expression of one or more markers listed in Tables 1-2. While, as 
discussed above, some of these changes in expression level result from occurrence of the 
ovarian cancer, others of these changes induce, maintain, and promote the cancerous 

1 0 state of ovarian cancer cells. Thus, ovarian cancer characterized by an increase in the 
level of expression of one or more markers listed in either or both of Tables 1-2 can be 
inhibited by inhibiting expression of those markers. 

Expression of a marker listed in Tables 1-2 can be inhibited in a number of ways 
generally known in the art. For example, an antisense oligonucleotide can be provided 

15 to the ovarian cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the function or activity of the protein corresponding to the marker(s). Using 

20 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the marker(s). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the ovarian cancer cells of the patient. 

25 Expression of a marker listed in Tables 1 -2 can be enhanced in a number of ways 

generally known in the art. For example, a polynucleotide encoding the marker and 
operably linked with an appropriate promoter/regulator region can be provided to 
ovarian cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 

30 capable of crossing the cell membrane, inserting itself in the cell membrane, or is 

normally a secreted protein, then expression of the protein can be enhanced by providing 
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the protein (e.g. directly or by way of the bloodstream or another ovary-associated fluid) 
to ovarian cancer cells in the patient. 

As described above, the cancerous state of human ovarian cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention 
includes a method for assessing the human ovarian cell carcinogenic potential of a test 
compound. This method comprises maintaining separate aliquots of human ovarian 
cells in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-2 in the aliquot maintained in the presence of 
the test compound (relative to the aliquot maintained in the absence of the test 
compound) is an indication that the test compound possesses human ovarian cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 
level of expression is enhanced or inhibited, or by comparing both. 

Various aspects of the invention are described in further detail in the following 
subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to novel isolated nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g., those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules 
{e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 
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An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5 5* and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 

10 molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

1 5 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook et ai., ed., Molecular Cloning: A Laboratory 
Manual, 2nd ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

20 A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 

or genomic DNA as a template and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 

25 invention can be prepared by standard synthetic techniques, e.g. , using an automated 
DNA synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
30 of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 
complementary to a given nucleotide sequence is one which is sufficiently 
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complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion 
of a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 
5 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 

10 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 

1 5 radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 
express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

20 The invention further encompasses nucleic acid molecules that differ, due to 

degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 
a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 

It will be appreciated by those skilled in the art that DNA sequence 
25 polymorphisms that lead to changes in the amino acid sequence can exist within a 

population (e.g., the human population). Such genetic polymorphisms can exist among 
individuals within a population due to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given genetic locus. In addition, it will be 
appreciated that DNA polymorphisms that affect RNA expression levels can also exist 
30 that may affect the overall expression level of that gene {e.g. , by affecting regulation or 
degradation). 




WO 01/070979 PCTYUS01/09126 

-32- 

As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
5 marker of the invention. Such natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
variety of individuals. Any and all such nucleotide variations and resulting amino acid 

1 0 polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 

1 5 4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 
under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 

20 identical to each other typically remain hybridized to each other. Such stringent 

conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989). A 
preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 

25 annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C. 
Further preferred hybridization conditions are taught in Lockhart, et al y Nature 
Biotechnology, Volume 14, 1996 August: 1675-1 680; Breslauer, et al, Proc. Natl. Acad. 

30 Sci. USA, Volume 83, 1986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw, etal, BioTechniques, 
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Volume 8, No. 6 1990: 674-678; and Milner, et al, Nature Biotechnology, Volume 15, 
1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of 
the invention that can exist in the population, the skilled artisan will further appreciate 
that sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 
the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 
sequence without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
acid residues that are conserved among the homologs of various species (e.g., murine 
and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from the 
naturally-occurring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 
invention. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
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amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
5 serine, threonine, tyrosine, cysteine), non-polar side chains (e.g. , alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 

1 0 and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid of the invention, e.g., 

1 5 complementary to the coding strand of a double-stranded cDNA molecule 

corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 
invention. The antisense nucleic acid can be complementary to an entire coding strand, 

20 or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5 1 and 3' untranslated regions") are the 5' and 3' 
sequences which flank the coding region and are not translated into amino acids. 

25 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 

40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 

30 variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
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substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
5 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5 -methoxy aminomethy l-2-thiouracil 5 beta- 
D-mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

10 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 

1 5 expression vector into which a nucleic acid has been sub-cloned in an antisense 

orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
20 to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense 
25 nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an ovary-associated body fluid. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then 
30 administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
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to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 
nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 

5 strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et al, 1987, Nucleic Acids Res. 1 5:6625- 

10 6641). The antisense nucleic acid molecule can also comprise a 2'-o- 

methylribonucleotide (Inoue et al, 1987, Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al, 1987, FEBS Lett 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 

1 5 nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 

20 invention can be designed based upon the nucleotide sequence of a cDNA 

corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al U.S. Patent No. 
4,987,071 ; and Cech et al U.S. Patent No. 5,1 16,742). Alternatively, an mRNA 

25 encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:141 1-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited 

30 by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
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(1991) Anticancer DrugDes. 6(6):569-84; Helene (1992) Ann N.Y. Acad. Sci. 660:27- 
36; and Maher (1992) Bioassays 14(12):807-15. 

In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
5 stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et aL, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 

1 0 pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
etal (1996),supra; Perry-O'Keefe et at (1996) Proc. Natl Acad. Sci. USA 93:14670- 

15 675. 

PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs can also be used, e.g., in the analysis of single base pair mutations in 

20 a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used 
in combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as 
probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- 
O'Keefe etal. t 1996, Proc. Natl. Acad. Sci. USA 93:14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 

25 cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g. , RNASE H and DNA polymerases, to interact with the DNA 

30 portion while the PNA portion would provide high binding affinity and specificity. 

PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
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1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996), supra, and Finn et al (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
5 such as 5 r -(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al t 1989, Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5 1 PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 

10 synthesized with a 5' DNA segment and a 3 f PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad 

15 Set USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad. Set USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol etal, 1988, Bio/Techniques 
6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 

20 this end, the oligonucleotide can be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
region which is complementary to a nucleic acid of the invention, such that the 

25 molecular beacon is useful for quantitating the presence of the nucleic acid of the 

invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
portions of the nucleic acid in such an orientation that when the complementary regions 

30 are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
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one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 



II. Isolated Proteins and Antibodies 
5 One aspect of the invention pertains to isolated proteins which correspond to 

individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 

1 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 

1 5 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical; 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

20 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

25 also preferably substantially free of culture medium, z. e. , culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, Le. 9 it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

30 preparations of the protein have less than about 30%, 20%, 1 0%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide of interest. 




• 
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Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 
which include fewer amino acids than the full length protein, and exhibit at least one 
activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 

Preferred polypeptides are encoded by the nucleotide sequences of Tables 1-2. 
Other useful proteins are substantially identical (e.g. , at least about 40%, preferably 
50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these sequences and retain the 
functional activity of the protein of the corresponding naturally-occurring protein yet 
differ in amino acid sequence due to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 
is a function of the number of identical positions shared by the sequences (Le., % 
identity = # of identical positions/total # of positions (e.g., overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two sequences is the algorithm of 



Karlin and Altschul (1990) Proc. Natl. Acad Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proa Natl Acad. Sci. USA 90:5873-5877. Such an 
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algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et at 
(1990) J. Mol Biol 215:403-410. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
5 be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 

10 molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs {e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:1 1-17. Such an algorithm is incorporated into the 

15 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) 

20 Proc. Natl Acad. Sci. USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a &-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

25 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein" or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

30 polypeptide (i.e., a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
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other. The heterologous polypeptide can be fused to the ammo-terminus or the 
carboxyl-terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in which a polypeptide 
corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
5 sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 
sequence at its amino terminus. For example, the native signal sequence of a 
polypeptide corresponding to a marker of the invention can be removed and replaced 

10 with a signal sequence from another protein. For example, the gp67 secretory sequence 
of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 

15 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et al, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

20 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane-bound) and a protein on the surface of a ceil 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

25 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating {e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

30 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 
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Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
5 primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et ah, supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention can be cloned into such an 
1 0 expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 

A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 

15 during secretion in one or more cleavage events. Such signal peptides contain 

processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (/. e. , the cleavage products). In one 

20 embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 
in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 

25 be readily purified from the extracellular medium by art recognized methods. 
Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 

30 sequence which can function as either agonists (mimetics) or as antagonists. Variants 
can be generated by mutagenesis, e.g. , discrete point mutation or truncation. An agonist 
can retain, substantially the same, or a subset, of the biological activities of the naturally 
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occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively 
binding to a downstream or upstream member of a cellular signaling cascade which 
includes the protein of interest. Thus, specific biological effects can be elicited by 
5 treatment with a variant of limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

Variants of a protein of the invention which function as either agonists 

1 0 (mimetics) or as antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the protein of the invention for agonist or antagonist 
activity. In one embodiment, a variegated library of variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of variants can be produced by, for example, enzymatically . 

1 5 ligating a mixture of synthetic oligonucleotides into gene sequences such that a 

degenerate set of potential protein sequences is expressible as individual polypeptides, 
or alternatively, as a set of larger fusion proteins (e.g. , for phage display). There are a 
variety of methods which can be used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate oligonucleotide sequence. Methods for 

20 synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, 
Tetrahedron 39:3; Itakura e/a/., 1984, Annu. Rev. Biochem. 53:323; Itakurae/ al, 1984, 
Science 198:1056; Ike etal, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 

25 population of polypeptides for screening and subsequent selection of variants. For 
example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, removing single stranded portions 
from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
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derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. The most widely used 

techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 

10 isolation of the vector encoding the gene whose product was detected. Recursive 

ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1 992, Proc. Natl Acad. Set 
USA 59:781 1-7815; Delgrave et al, 1993, Protein Engineering 6(3):327- 33 1). 

15 An isolated polypeptide corresponding to a marker of the invention, or a 

fragment thereof, can be used as an immunogen to generate antibodies using standard 
techniques for polyclonal and monoclonal antibody preparation. The fiill-length 
polypeptide or protein can be used or, alternatively, the invention provides antigenic 
peptide fragments for use as immunogens. The antigenic peptide of a protein of the 

20 invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues 
of the amino acid sequence of one of the polypeptides of the invention, and encompasses 
an epitope of the protein such that an antibody raised against the peptide forms a specific 
immune complex with a marker of the invention to which the protein corresponds. 
Preferred epitopes encompassed by the antigenic peptide are regions that are located on 

25 the surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 
hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(/.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
30 vertebrate. An appropriate immunogenic preparation can contain, for example, 

recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
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fiirther include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 
5 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

10 does not substantially bind other molecules in a sample, e.g., a. biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments which can be generated 
by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 

1 5 "monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 

20 polyclonal antibody compositions are ones that have been selected for antibodies 

directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 

25 immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 

30 standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 
isolated from the subject (e.g., from the blood or serum of the subject) and further 
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purified by well-known techniques, such as protein A chromatography to obtain the IgG 
firaction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or {e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 
5 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, i.e., one that is 

10 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 
those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 

1 5 sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 
polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 

20 prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 
cell hybridoma technique (see Kozbor et al, 1983, Immunol Today 4:72), the EBV- 
hybridoma technique (see Cole et al, pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 

25 hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g., using a standard 
ELISA assay. 

30 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody directed against a polypeptide of the invention can be identified 
and isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
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antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available (e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the Stratagene 
SurfZAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods 
5 and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 
92/20791; PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; 
PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 

10 Publication No. WO 90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et 
al (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al (1989) Science 246:1275- 
1281; Griffiths <rt al (1993) EMBO J. 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 

1 5 made using standard recombinant DNA techniques, are within the scope of the 

invention. A chimeric antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 

20 incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarily determining 
regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 

25 monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 
European Patent Application 184,187; European Patent Application 171,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al (1988) Science 240:1041- 

30 1043; Liu et al (mi) Proc. Natl Acad Sci. USA 84:3439-3443; Liu et al (1987) J. 
Immunol 139:3521- 3526; Sxmetal (mi) Proc. Natl Acad Sci. USA 84:214-218; 
Nishimura et al (1987) Cancer Res. 47:999-1005; Wood et al (1985) Nature 314:446- 
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449; and Shaw e/a/. (1988) J. Natl Cancerlnst. 80:1553-1559); Monison (1985) 
Science 229:1202-1207; Oi etal (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones et al (1986) Nature 321 :552-525; Verhoeyan et al (1988) Science 239:1534; and 
Beidlerefa/. (1988)7. Immunol 141:4053-4060. 
5 Antibodies of the invention may be used as therapeutic agents in treating 

cancers. In a preferred embodiment, completely human antibodies of the invention are 
used for therapeutic treatment of human cancer patients, particularly those having an 
ovarian cancer. Such antibodies can be produced, for example, using transgenic mice 
which are incapable of expressing endogenous immunoglobulin heavy and light chains 

1 0 genes, but which can express human heavy and light chain genes. The transgenic mice 
are immunized in the normal fashion with a selected antigen, e.g. , all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 

15 cell differentiation, and subsequently undergo class switching and somatic mutation. 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 

20 and protocols for producing such antibodies, see, e.g. , U.S. Patent 5,625, 126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 
5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

25 Completely human antibodies which recognize a selected epitope can be 

generated using a technique referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al, 
1994, Bio/technology 12:899-903). 

30 An antibody directed against a polypeptide corresponding to a marker of the 

invention (e.g., a monoclonal antibody) can be used to isolate the polypeptide by 
standard techniques, such as affinity chromatography or immunoprecipitation. 
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Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing 
5 procedure, e.g. , to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

10 phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

15 materials include luciferase, luciferin, and aequorin, and examples of suitable 

. , . , , l25 T 131 35_ 3 _ _ 

radioactive material mclude I, I, S or H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin 
or cytotoxic agent includes any agent that is detrimental to cells. Examples include 

20 taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof Therapeutic agents include, but are not limited to, antimetabolites 

25 (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, 
melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 

30 doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 
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The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
5 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as 
tumor necrosis factor, .alpha.-interferon, .beta.-interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-l"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

10 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see. e.g., Arnon et aL, "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug 

1 5 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

20 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

25 Accordingly, in one aspect, the invention provides substantially purified 

antibodies or fragments thereof, and non-human antibodies or fragments thereof, which 
antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence selected from the group consisting of the amino acid sequences of the present 
invention, an amino acid sequence encoded by the cDNA of the present invention, a 

30 fragment of at least 1 5 amino acid residues of an amino acid sequence of the present 
invention, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of the present invention (wherein the percent identity is determined using the 
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ALIGN program of the GCG software package with a PAM120 weight residue table, a 
gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence which is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of the nucleic acid molecules of the present invention, or a complement 
5 thereof, under conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X 
SSC, 0.1% SDS at 65°C. In various embodiments, the substantially purified antibodies 
of the invention, or fragments thereof, can be human, non-human, chimeric and/or 
humanized antibodies. 

In another aspect, the invention provides non-human antibodies or fragments 

1 0 thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 15 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 

15 amino acid sequence of the present invention (wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 

20 complement thereof, under conditions of hybridization of 6X SSC at 45°C and washing 
in 0.2 X SSC, 0.1% SDS at 65°C. Such non-human antibodies can be goat, mouse, 
sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 

25 antibodies. 

In still a further aspect, the invention provides monoclonal antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
30 present invention, a fragment of at least 1 5 amino acid residues of an amino acid 
sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
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identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
5 or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 
to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 

10 cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extracellular domain of the amino acid sequences of the present invention. 

1 5 Any of the antibodies of the invention can be conjugated to a therapeutic moiety 

or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

20 The invention also provides a kit CQntaining an antibody of the invention 

conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutical^ acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 

25 pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 

30 sequence of the present invention, an amino acid sequence encoded by the cDNA of the 
nucleic acid molecules of the present invention, a fragment of at least 15 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 
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which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a PAM120 weight residue table, a gap length penalty of 12, and a 
gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
5 molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 
antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 
1 0 recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal. Optionally, antibodies are collected from the 
antibody-producing cell. 

15 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a polypeptide corresponding to a marker of 
the invention (or a portion of such a polypeptide). As used herein, the term "vector" 

20 refers to a nucleic acid molecule capable of transporting another nucleic acid to which it 
has been linked. One type of vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 

25 they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 

30 operably linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors (e.g., 
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replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell. This 
5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

10 of the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol.1 85, 

15 Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
25 expression of a polypeptide corresponding to a marker of the invention in prokaryotic 
(e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 
vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using T7 promoter regulatory sequences and T7 
30 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
5 vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 

10 from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

1 5 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al , 1 988, Gene 69:301-3 1 5) and pET 1 1 d (Studier et al , p. 60-89, In 
Gene Expression Technology: Methods in Enzymology vol.185, Academic Press, San 
Diego, CA, 1991). Target gene expression from the pTrc vector relies on host RNA 

20 polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 
supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

25 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: Methods 
in Enzymology vol. 185, Academic Press, San Diego, CA, 1990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 

30 vector so that the individual codons for each amino acid are those preferentially utilized 
.in £ coli (Wada et al, 1992, Nucleic Acids Res. 20:21 1 1-21 18). Such alteration of 
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nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
5 al, 1987, EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et al, 1987, Gene 54:1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., Sf 

10 9 cells) include the pAc series (Smith et al., 1983, Mol Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers, 1 989, Virology 1 70:3 1-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 

15 (Kaufinan et al, 1987, EMBO J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al, supra. 

20 In another embodiment, the recombinant mammalian expression vector is 

capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al, 

25 1987, Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton, 1988, 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBO J. 8:729-733) and immunoglobulins (Banerji etal, 1983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989, Proc. Natl. Acad. Sci. USA 

30 86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,3 1 6 and European Application Publication No. 264,166). Developmentally- 
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regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss, 1990, Science 249:374-379) and the a-fetoprotein promoter (Camper 
and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
5 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 
manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mKNA encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 

10 orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 
regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid, or attenuated virus in which antisense nucleic 

1 5 acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
al, 1986, Trends in Genetics, Vol. 1(1). 

Another aspect of the invention pertains to host cells into which a recombinant 

20 expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 

25 identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell {e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
30 conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
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phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al {supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g. , cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 
invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 
In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 
corresponding to a marker of the invention have been introduced. Such host ceils can 
then be used to create non-human transgenic animals in which exogenous sequences 
encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 
are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
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preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 
5 • develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

10 an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 

15 the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 

20 and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 

25 based upon the presence of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

30 To create an homologous recombinant animal, a vector is prepared which 

contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
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thereby alter, e.g., functionally disrupt, the gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted {i.e., no longer encodes a functional protein; also referred to as a 
"knock out" vector). Alternatively, the vector can be designed such that, upon 
5 homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein (e.g. , the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5' and 3' ends by 
additional nucleic acid of the gene to allow for homologous recombination to occur 

1 0 between the exogenous gene carried by the vector and an endogenous gene in an 

embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 

15 recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et al, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., sl 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 

20 Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 
1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 

25 transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 

30 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso etal. (1992) Proc. 
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Natl Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman a/., 1991, Science 
251:1351-1355). If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
5 selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
10 produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 



IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 

15 as "active compounds") corresponding to a marker of the invention can be incorporated 
into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 

20 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 

25 incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 

30 or nucleic acid corresponding to a marker of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a phaimaceutically 
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acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

The invention also provides methods (also referred to herein as 
5 "screening assays") for identifying modulators, i.e., candidate or test compounds or 
agents (e.g., peptides, peptidomimetics, peptoids, small molecules or other drugs) which 
(a) bind to the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect 
on the activity of the marker or, more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its natural substrates (e.g., peptide, 
1 0 protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 

expression of the marker. Such assays typically comprise a reaction between the marker 
and one or moire assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

15 The test compounds of the present invention may be obtained from any available 

source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
20 peptide backbone which are resistant to enzymatic degradation but which nevertheless 
remain bioactive; see, e.g., Zuckermann et al, 1994, J. Med. Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The biological 
25 library and peptoid library approaches are limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997 .Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et 
30 al (1994) Proc. Natl Acad. Sci. USA 91:1 1422; Zuckermann et al (1994). J. Med 
Chem. 37:2678; Cho etal (1993) Science 261:1303; Carrell et al (1994) Angew. Chem. 
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Int. Ed. Engl. 33:2059; Carell etal. (1994) Angew. Chem. Int. Ed Engl. 33:2061; and in 
Gallop etal (1994) J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull etal, 1992, Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al, 1990, 
Proc. Natl. Acad. Sci. 87:6378-6382; Felici, 1991, J. Mol. Biol. 222:301-310; Ladner, 
supra.). 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 
example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
marker compound in a complex. For example, compounds (e.g., marker substrates) can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 
assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 
thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 
the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 
is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 
use of the marker protein as "bait protein" in a two-hybrid assay or three-hybrid assay 
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(see,e.g., U.S. Patent No. 5,283,317; Zervos<rf al, 1993, Cell 72:223-232; Madura etal, 
1993, J. Biol Chem. 268:12046-12054; BarteUf al ,1993, Biotechniques 14:920-924; 
Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 
5 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

10 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 

1 5 fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
• dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 

20 the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 

25 or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 
peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 

30 in this embodiment is an ovarian cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 
supplied from any source. 
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The basic principle of the assay systems used to identify compounds that 
interfere with the interaction between the marker and its binding partner involves 
preparing a reaction mixture containing the marker and its binding partner under 
conditions and for a time sufficient to allow the two products to interact and bind, thus 

5 forming a complex. In order to test an agent for inhibitory activity, the reaction mixture 
is prepared in the presence and absence of the test compound. The test compound can 
be initially included in the reaction mixture, or can be added at a time subsequent to the 
addition of the marker and its binding partner. Control reaction mixtures are incubated 
without the test compound or with a placebo. The formation of any complexes between 

10 the marker and its binding partner is then detected. The formation of a complex in the 
control reaction, but less or no such formation in the reaction mixture containing the test 
compound, indicates that the compound interferes with the interaction of the marker and 
its binding partner. Conversely, the formation of more complex in the presence of 
compound than in the control reaction indicates that the compound may enhance 

1 5 interaction of the marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
binding partner may be conducted in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting complexes anchored to the solid phase at the end of the 

20 reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 
information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 

25 substance, i. e., by adding the test substance to the reaction mixture prior to or 

simultaneously with the marker and its interactive binding partner. Alternatively, test 
compounds that disrupt preformed complexes, e.g., compounds with higher binding 
constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 

30 various formats are briefly described below. 
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In a heterogeneous assay system, either the marker or its binding partner is 
anchored onto a solid surface or matrix, while the other corresponding non-anchored 
component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
5 of methods, either non-covalent or covalent, that are typically well known to one who 
practices the art. Non-covalent attachment can often be accomplished simply by coating 
the solid surface with a solution of the marker or its binding partner and drying. 
Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

10 In related embodiments, a fusion protein can be provided which adds a domain 

that allows one or both of the assay components to be anchored to a matrix. For 
example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 
transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 

1 5 combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
immobilized complex assessed either directly or indirectly, for example, as described 

20 above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 
partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

25 marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 

succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 
advance and stored. 

30 In order to conduct the assay, the corresponding partner of the immobilized assay 

component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
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and any complexes formed will remain immobilized on the solid surface. The detection 
of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the non-immobilized component is pre-labeled, the detection of label- 
immobilized on the surface indicates that complexes were formed. Where the non- 
5 immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 
initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 

1 0 complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 
This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 

15 determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

In such a homogeneous assay, the reaction products may be separated from 
20 unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifugation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifugation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 
different sedimentation equilibria of complexes based on their different sizes and 
25 densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 

Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 
appropriate gel filtration resin in a column format, for example, the relatively larger 
30 complex may be separated from the relatively smaller uncomplexed components. 

Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
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the remaining individual reactants, for example through the use of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, JMol Recognit 11:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed Sci. Appl, 699:499-525). Gel electrophoresis 
5 may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 
New York. 1999). In this technique, protein or nucleic acid complexes are separated 
based on size or charge, for example. In order to maintain the binding interaction during 
the electrophoretic process, nondenaturing gels in the absence of reducing agent are 

1 0 typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 
utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1 999). In this technique, all proteins binding to an antibody specific to one of the 

15 binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifiigation. The bound assay 
components are released from the beads (through a specific proteolysis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
the complex), and a second immunoprecipitation step is performed, this time utilizing 

20 antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 
interactions between the marker and its binding partner. 

25 Also within the scope of the present invention are methods for direct detection of 

interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 
Lakowiczef a/, U.S. Patent No. 5,631,169; Stavrianopoulos etal, U.S. Patent No. 

30 4,868,1 03). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor' molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor' molecule (e.g., 
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marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 
Alternately, the 'donor' protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor' molecule label may be differentiated from that of the 
5 'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 

1 0 means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 
result in the generation of a signal variant to that of background. In this way, test 
substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 

15 In another embodiment, modulators of marker expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
mJRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
to the level of expression of mRNA or protein in the absence of the candidate 

20 compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
marker mRNA or protein expression. Conversely, when expression of marker mRNA 

25 or protein is less (statistically significantly less) in the presence of the candidate 

compound than in its absence, the candidate compound is identified as an inhibitor of 
marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 
or protein. 

30 In another aspect, the invention pertains to a combination of two or more of the 

assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 
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marker protein can be further confirmed in vivo, e.g., in a whole animal model for 
cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
5 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., an marker modulating agent, an antisense 
marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 

1 0 herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 

15 ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 
sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 

20 molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 
about 1 00 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 

25 subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 
about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 

30 can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 
activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
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researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 

5 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

0 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

5 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

0 enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

5 water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

0 example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance 



WO 01/070979 



PCT/US01/09126 



-73- 

of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
5 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

10 compound {e.g. , a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredientsenumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium, 
and then incorporating the required other ingredients from those enumerated above. In 

15 the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 

20 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

25 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be 

included as part of the composition. The tablets, pills, capsules, troches, and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder 
such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a 

30 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray fiom a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories {e.g. , with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

15 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 . 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
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and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
5 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration (e.g., into the ovarian epithelium). A method for 

1 0 lipidation of antibodies is described by Cruikshank et al (1 997) X Acquired Immune 
Deficiency Syndromes and Human Retrovirology 14:193. 

The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by, for example, intravenous injection, local administration (U.S. 

1 5 Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et al , 1 994, Proc. Natl 
Acad. Sci. USA 91:3054-3057). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 

20 e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

25 V. Predictive Medicine 

The present invention pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 

30 assays for determining the level of expression of polypeptides or nucleic acids 

corresponding to one or more markers of the invention, in order to determine whether 
an individual is at risk of developing ovarian cancer. Such assays can be used for 
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prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs or other compounds administered either to inhibit ovarian cancer or to treat 
5 or prevent any other disorder {ie. in order to understand any ovarian carcinogenic 
effects that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

10 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample (e.g. an ovary-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

15 the polypeptide or nucleic acid (e.g. , mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
viti-o techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

20 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

25 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 

30 thus forming a complex that can be removed and/or detected in the reaction mixture. 
These assays can be conducted in a variety of ways. 
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For example, one method to conduct such an assay would involve anchoring the 
marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 
5 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 

10 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 

1 5 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 

20 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 

25 (e.g. , by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 

In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 

30 directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
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It is also possible to directly detect marker/probe complex formation without 
further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
al, U.S. Patent No. 5,631,169; Stavrianopoulos, etal, U.S. Patent No. 4,868,103). A 
5 fluorophore label on the first, 'donor' molecule is selected such that, upon excitation 
with incident light of appropriate wavelength, its emitted fluorescent energy will be 
absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 

10 that emit different wavelengths of light, such that the 'acceptor' molecule label may be 
differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 

1 5 should be maximal. An FET binding event can be conveniently measured through 
standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 

20 (see, e.g., Sjolander, S. and Urbaniczky, C, 1991, Anal. Chem. 63:2338-2345 and 
Szabo et al, 1995, Curr. Opin. Struct Biol 5:699-705). As used herein, "BIA" or 
"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants {e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 

25 of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 
resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 

Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 

30 assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential 
centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
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differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Sci. 18(8):284-7). 
5 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

10 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g. , Heegaard, N.H., 1 998, J. Mol. Recognit Winter 11(1- 

15 6):141-8; Hage, D.S., and Tweed, S.A. JChromatogr B Biomed Sci Appl 1997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel et al, ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

20 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 

In a particular embodiment, the level of mRNA corresponding to the marker can 

25 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 

30 against the isolation of mRNA can be utilized for the purification of RNA from ovarian 
cells (see, e.g., Ausubel etal, ed., Current Protocols in Molecular Biology, John Wiley 
& Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
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readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 
4,843,155). 

The isolated mRNA can be used in hybridization or amplification assays that 
5 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

10 of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

15 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Aflymetrix gene chip array. A skilled artisan can readily 

20 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

25 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl Acad Sci. 
USA, 88:189-193), self sustained sequence replication (Guatelli et al, 1990, Proc. Natl 
Acad Sci USA 87:1874-1878), transcriptional amplification system (Kwoh etal, 1989, 
Proc. Natl Acad Sci USA 86:1 173-1 177), Q-Beta Replicase (Lizardi et al, 1988, 
Bio/T zchnology 6:1 197), rolling circle replication (Lizardi et al, U.S. Patent No. 

30 5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
detection schemes are especially useful for the detection of nucleic acid molecules if 



WO 01/070979 



PCT7US01/09126 



-81- 

such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3' regions 
of a gene (plus and minus strands, respectively, or vice-versa) and contain a short region 
in between. In general, amplification primers are from about 10 to 30 nucleotides in 
5 length and flank a region from about 50 to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mRNA does not need to be isolated from the ovarian cells 
prior to detection. In such methods, a cell or tissue sample is prepared/processed using 
10 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

As an alternative to making determinations based on the absolute expression 
level of the marker, determinations may be based on the normalized expression level of 

15 the marker. Expression levels are normalized by correcting the absolute expression 
level of a marker by comparing its expression to the expression of a gene that is not a 
marker, e.g. , a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

20 sample, e.g., a patient sample, to another sample, e.g., a non-ovarian cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

25 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 

30 that marker. This provides a relative expression level. 
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Preferably, the samples used in the baseline determination will be from ovarian 
cancer or from non-ovarian cancer cells of ovarian tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in 
normal tissues as a mean expression score aids in validating whether the marker assayed 
5 is ovarian specific (versus normal cells). In addition, as more data is accumulated, the 
mean expression value can be revised, providing improved relative expression values 
based on accumulated data. Expression data from ovarian cells provides a means for 
grading the severity of the ovarian cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to 

1 0 a marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 
preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. , Fab or F(ab') 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 

1 5 encompass direct labeling of the probe or antibody by coupling (/. e. , physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 
or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 

20 detected with fluorescently labeled streptavidin. 

Proteins from ovarian cells can be isolated using techniques that are well known 
to those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 

25 York). 

A variety of formats can be employed to determine whether a sample contains a 
protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (R1A), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
30 readily adapt known protein/antibody detection methods for use in determining whether 
ovarian cells express a marker of the present invention. 
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In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
5 binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

One skilled in the art will know many other suitable carriers for binding antibody 
or antigen, and will be able to adapt such support for use with the present invention. For 

10 example, protein isolated from ovarian cells can be run on a polyacrylamide gel 

electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 
detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 

1 5 support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an 
ovary-associated body fluid such as a urine sample). Such kits can be used to determine 
if a subject is suffering from or is at increased risk of developing ovarian cancer. For 

20 example, the kit can comprise a labeled compound or agent capable of detecting a 
polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the 
invention in a biological sample and means for determining the amount of the 
polypeptide or mRNA in the sample (e.g., an antibody which binds the polypeptide or an 
oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide). Kits 

25 can also include instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 
marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 

30 For oligonucleotide-based kits, the kit can comprise, for example: (1) an 

oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
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a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
marker of the invention. The kit can also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label (e.g., an enzyme or a substrate). The kit can 
5 also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 

10 B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 
of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) ovarian cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individuals genotype 

1 5 and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g. , drugs) for prophylactic or therapeutic treatments . 

20 based on a consideration of the individual's genotype. Such pharmacogenomics can. 
further be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

25 Pharmacogenomics deals with clinically significant variations in the response to 

drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., 
Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 

30 conditions transmitted as single factors altering the way the body acts on drugs are 

referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
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dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
5 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

10 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

15 experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 

20 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, phannacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 

25 identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 
and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 



30 



C. Monitoring Clinical Trials 

Monitoring the influence of agents (e.g. , drug compounds) on the level of 
expression of a marker of the invention can be applied not only in basic drug screening, 
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but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for ovarian 
cancer. In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (eg., an agonist, 
5 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the 

10 level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent can be desirable to increase expression of the 

15 marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease . 
expression of the marker(s) to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

20 D. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising a marker of the present 
invention is also provided. As used herein, "electronic apparatus readable media" refers 
to any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 

25 not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 
these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon a marker of the present invention. 

30 As used herein, the term "electronic apparatus" is intended to include any 

suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use with the 
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present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 
electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 
5 As used herein, "recorded" refers to a process for storing or encoding 

information on the electronic apparatus readable medium. Those skilled in the art can 
readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the markers of the present invention. 
A variety of software programs and formats can be used to store the marker 

10 information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in 
a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

1 5 forms. Any number of dataprocessor structuring formats (e.g. , text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the the , 
markers of the present invention. 

By providing the markers of the invention in readable form, one can routinely 
access the marker sequence information for a variety of purposes. For example, one 

20 skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 
target sequence or target motif. 

25 The present invention therefore provides a medium for holding instructions for 

performing a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer, wherein the method comprises the steps of determining 
the presence or absence of a marker and based on the presence or absence of the marker, 
determining whether the subject has ovarian cancer or a pre-disposition to ovarian 

30 cancer and/or recommending a particular treatment for ovarian cancer or pre-ovarian 
cancer condition. 
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The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer associated with a marker wherein the method comprises 
the steps of determining the presence or absence of the marker, and based on the 
5 presence or absence of the marker, determining whether the subject has ovarian cancer 
or a pre-disposition to ovarian cancer, and/or recommending a particular treatment for 
the ovarian cancer or pre-ovarian cancer condition. The method may further comprise 
the step of receiving phenotypic information associated with the subject and/or acquiring 
from a network phenotypic information associated with the subject. 

1 0 The present invention also provides in a network, a method for determining 

whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated 
with a marker, said method comprising the steps of receiving information associated 
with the marker receiving phenotypic information associated with the subject, acquiring 
information from the network corresponding to the marker and/or ovarian cancer, and 

1 5 based on one or more of the phenotypic information, the marker, and the acquired 

information, determining whether the subject has a ovarian cancer or a pre-disposition to . 
ovarian cancer. The method may further comprise the step of recommending a 
particular treatment for the ovarian cancer or pre-ovarian cancer condition. 

The present invention also provides a business method for determining whether a 

20 subject has ovarian cancer or a pre-disposition to ovarian cancer, said method 

comprising the steps of receiving information associated with the marker, receiving 
phenotypic information associated with the subject, acquiring information from the 
network corresponding to the marker and/or ovarian cancer, and based on one or more 
of the phenotypic information, the marker, and the acquired information, determining 

25 whether the subject has ovarian cancer or a pre-disposition to ovarian cancer. The 
method may further comprise the step of recommending a particular treatment for the 
ovarian cancer or pre-ovarian cancer condition. 

The invention also includes an array comprising a marker of the present 
invention. The array can be used to assay expression of one or more genes in the array. 

30 In one embodiment, the array can be used to assay gene expression in a tissue to 

ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes 
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can be simultaneously assayed for expression. This allows a profile to be developed 
showing a battery of genes specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 
5 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is useful, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 
expression in a second tissue can be determined. In this context, the effect of one cell 

1 0 type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 

1 5 opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
determined at the molecular level. Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course of, 

20 expression of one or more genes in the array. This can occur in various biological 

contexts, as disclosed herein, for example development of ovarian cancer, progression of 
ovarian cancer, and processes, such a cellular transformation associated with ovarian 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
25 the expression of other genes in the same cell or in different cells. This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
30 serve as a molecular target for diagnosis or therapeutic intervention. 
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E. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, ovarian cancer. As used herein, a "surrogate marker" is an objective 
5 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder {e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 

10 particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 
of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 

1 5 RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 
markers in the art include: Koomen et al. (2000) J. Mass. Spectrom. 35: 258-264; and 
James (1 994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 

20 used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 
pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 

25 pharmacodynamic marker may be indicative of the concentration of the drug in a 

biological tissue, in that the marker is either expressed or transcribed or not expressed or 
transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 

30 the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 
Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
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of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
readily detectable than the drug itself. Also, the marker may be more easily detected 
5 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 

1 0 observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al (1991) Em. Health Perspect 90: 229-238; 
Schentag (1999) Am. J. Health-Syst Pharm. 56 Suppl. 3: S21-S24; andNicolau (1999) 
Am, J. Health-Syst. Pharm. 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 

1 5 used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e;g., 
McLeod etal (1999) Eur. J, Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a 
specific drug or class of drugs prior to administration of the drug. By assessing the* 

20 presence or quantity of one or more pharmacogenomic markers in a subject, a drug 

therapy which is most appropriate for the subject, or which is predicted to have a greater 
degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 

25 present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 
pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 



• 
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VI. Experimental Protocol 



10 



15 



20 



25 



A. Subtracted Libraries 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mKNA (tester) 
compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 
SMART PCR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 
abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

To create the subtracted libraries, a first group of regular cDNA libraries was 
constructed. Library johOa was constructed from a pool of 5 normal ovarian epithelial 
cell cultures. Library johOb was constructed from a pool of 5 ascites short cultured 
samples from ovarian cancer patients. Library johOc was constructed from a pool of 6 
serous late stage (III/IV) tumor samples. Three subtracted libraries were generated from 
tumor samples. Library johOd was a subtracted ascites library, where the tester was 
johOb, and the driver was johOa. The johOe and the johOf library were both subtracted 
stage III/IV serous tumor libraries. The tester for both of these libraries was johOc, and 
the driver was a pooled RNA from normal tissues. The tissues used for this driver pool 
were: kidney, small intestine, prostate, lung, heart, muscle, spleen, pancreas, liver, and 
lymphocyte. Library cMhOg was the same as the johOc and johOf libraries, with the 
exception that normal ovary was added to the driver. cMhOh, i, j, and k are all stage I/II 
subtracted libraries made from pooled tumor RNAs of different histological types 
(h=serous, I-endometriod, j=clear cell, k=mucinous). The driver was the same for these 
4 libraries. It consisted of normal ovarian epithelial RNA and PBML RNA. Of the 
markers listed in Table 1, SEQ ID NOS: 1-129, 916-1029, 1566-1571 and 1607-1865 
were identified in library johOa. Markers identified in johOb include SEQ ID NOS: 
130-177, 1030-1081, 1572-1574, and 1866-1974. Markers identified in johOc include 
SEQ ID NOS: 178-269, 1082-1120, 1575-1577, and 1975-2060. Markers identified in 
johOd include SEQ ID NOS: 270-370, 1 121-1304, 1578-1592, and 2061-2244. Markers 



# 
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identified in johOe include SEQIDNOS: 371-611, 1305-1416, 1593-1596 and 2245- 
2487. Markers identified in johOf include SEQ ID NOS: 612-915, 1417-1565, 1597- 
1606, and 2488-2871. Of the markers listed in Table 1A, SEQ ID NOS: 2872-2976, 
3817-3898, 4438-4443 and 4474-4675 were identified in library cMhOg. Markers 
5 identified in cMhOh include SEQ ID NOS: 2977-3376, 3899-4072, 4444-4455, and 
4676-5303. Markers identified in cMhOi include SEQ ID NOS: 3377-3495, 4073-4158, 
4456-4460, and 5304-5637. Markers identified in cMhOj include SEQ ID NOS: 3496- 
3742, 4195-4390, 4461-4468, and 5638-6197. Markers identified in cMhOk include 
SEQ ID NOS: 3743-3816, 4391-4437, 4469-4473 and 6198-6398. 

10 

VII. Summary Of The Data Provided In The Tables 

Tables 1, 1A, 2 and 3 are being filed concurrently herewith on a compact disc in 

lieu of paper copies. The compact disc submitted is formatted from an IBM-PC and is 

compatible with MS-Windows. The disc contains the following four (4) files: 
15 Tablel.text, containing l,223kb, TablelA.text, containing l,582kb, Table2.text, . 

containing 10,600kb, and Table3.text, containing 568kb. The material on the compact 

disc, namely Tables 1, 1 A, 2 and 3, is expressly incorporated by reference. 

Tables 1 and 1A show 6398 novel nucleotide sequences. These 6398 novel 

sequences were determined to be novel through various BLAST searches of available 
20 databases. Of these novel markers, SEQ ID NOS : 1 566 - 1 606 and 4438-4473 are 

preferred, SEQ ED NOS: 916-1565 and 3817-4437 are more preferred, and SEQ ID 

NOS: 1 - 915 and 2872-3816 are most preferred. 

The sequences of Tables 1 and 1 A were re-interpreted and vector sequences 

removed and those sequences are set forth in Table 2. 
25 Table 3 correlates the SEQ ID NOS. from Tables 1 and 1 A with those of Table 

2. 

The contents of all references, patents, published patent applications, and 
databases cited throughout this application are hereby incorporated by reference. 



Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
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described herein. Such equivalents are intended to be encompassed by the following 
claims. 

What is claimed is: 
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Claims 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence of 
Tables 1-2, or a complement thereof. 

5 

2. A vector which contains the nucleic acid molecule of claim 1 . 

3. A host cell which contains the nucleic acid molecule of claim 1 . 

10 4. An isolated polypeptide which is encoded by a nucleic acid molecule 

comprising a nucleotide sequence of Tables 1-2. 

5 . An antibody which selectively binds to a polypeptide of claim 4. 

15 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 

7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

20 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 4 in the 
sample. 

25 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 
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9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 
5 b) determining whether the nucleic acid probe or primer binds to a 

nucleic acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. 

1 0. The method of claim 9, wherein the sample comprises mRNA molecules 
1 0 and is contacted with a nucleic acid probe. 

1 1 . The method of claim 9, wherein the sample is isolated from ovarian 

tissue. 

15 12. The method of claim 9, wherein the sample is a tumor sample. 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 

20 1 4. A method of assessing whether a patient is afflicted with ovarian cancer, 

the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

25 b) the normal level of expression of the marker in a control non- 

ovarian cancer sample, 

wherein a significant difference between the level of expression of the marker in 
the patient sample and the normal level is an indication that the patient is afflicted with 
ovarian cancer. 

30 

1 5. The method of claim 1 4, wherein the marker corresponds to a secreted 

protein. 
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1 6. The method of claim 1 4, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

1 7. The method of claim 14, wherein the sample comprises cells obtained 
from the patient. 

1 8 . The method of claim 1 7, wherein the sample is an ovarian tissue sample. 

1 9. The method of claim 14, wherein the sample is an ovary-associated body 



20. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein or protein 
fragment corresponding to the marker. 

21 . The method of claim 20, wherein the presence of the protein or protein 

^ fragment is detected using a reagent which specifically binds with the protein or protein 
fragment 

22. The method of claim 2 1 , wherein the reagent is selected from the group 
consisting of an antibody, an antibody derivative, and an antibody fragment. 

23 . The method of claim 1 4, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 
polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 

24. The method of claim 23, wherein the transcribed polynucleotide is an 



mRNA. 



25. The method of claim 23, wherein the transcribed polynucleotide is a 



cDNA. 
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26. The method of claim 23, wherein the step of detecting further comprises 
amplifying the transcribed polynucleotide. 

27. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 

polynucleotide which anneals with the marker or anneals with a portion of a 
polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 

10 28. The method of claim 14, wherein the level of expression of the marker in 

the sample differs from the normal level of expression of the marker in a patient not 
afflicted with ovarian cancer by a factor of at least about 2. 

29. The method of claim 14, wherein the level of expression of the marker in 
15 the sample differs from the normal level of expression of the marker in a patient not 

afflicted with ovarian cancer by a factor of at least about 5. 

30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of 
20 markers independently selected from the markers listed in Tables 1-2, and 

b) the normal level of expression of each of the plurality of markers 
in samples of the same type obtained from control humans not afflicted with 
ovarian cancer, 

wherein the level of expression of more than one of the markers is significantly 
25 altered, relative to the corresponding normal levels of expression of the markers, is an 
indication that the patient is afflicted with ovarian cancer. 

3 1 . The method of claim 3 0, wherein the level of expression of each of the 
markers is significantly altered, relative to the corresponding normal levels of 

30 expression of the markers, is an indication that the patient is afflicted with ovarian 
cancer. 
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The method of claim 30, wherein the plurality comprises at least three of 



33. The method of claim 30, wherein the plurality comprises at least five of 
5 the markers. 

34. A method for monitoring the progression of ovarian cancer in a patient, 
the method comprising: 

a) detecting in a patient sample at a first point in time, the expression 
10 of a marker, wherein the marker is selected from the group consisting of the 

markers listed in Tables 1 -2; 

b) repeating step a) at a subsequent point in time; and 

c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of ovarian cancer. 

15 

35. The method of claim 34, wherein the marker corresponds to a secreted 

protein. 

36. The method of claim 34, wherein the marker corresponds to a transcribed 
20 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. - 

37. The method of claim 34, wherein the sample comprises cells obtained 
from the patient. 

25 38. The method of claim 37, wherein the patient sample is an ovarian tissue 

sample. 

39. The method of claim 34, wherein between the first point in time and the 
subsequent point in time, the patient has undergone surgery to remove ovarian tissue. 

30 
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40. A method of assessing the efficacy of a test compound for inhibiting 
ovarian cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient 
and exposed to the test compound, wherein the marker is selected from the group 

5 consisting of the markers listed in Tables 1-2, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
1 0 efficacious for inhibiting ovarian cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

15 42. The method of claim 40, wherein the first and second samples are . 

portions of pooled samples obtained from the patient. 

43 . A method of assessing the efficacy of a therapy for inhibiting ovarian 
cancer in a patient, the method comprising comparing; , 
20 a) expression of a marker in the first sample obtained from the 

patient prior to providing at least a portion of the therapy to the patient, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) expression of the marker in a second sample obtained from the 
25 patient following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the second 
sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 
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44. A method of selecting a composition for inhibiting ovarian cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately exposing aliquots of the sample in the presence of a 
5 plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2; and 

d) selecting one of the test compositions which alters the level of 
10 expression of the marker in the aliquot containing that test composition, relative 

to other test compositions. 

45. A method of inhibiting ovarian cancer in a patient, the method 
comprising: 

15 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 

20 1-2; and 

d) administering to the patient at least one of the test compositions 
which alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 

25 46. A kit for assessing whether a patient is afflicted with ovarian cancer, the 

kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1 -2. 

47. A kit for assessing the presence of ovarian cancer cells, the kit 
30 comprising a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 



WO 01/070979 PCT/US01/09126 

-102- 

48. A kit for assessing the suitability of each of a plurality of compounds for 
inhibiting ovarian cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the 
5 group consisting of the markers listed in Tables 1-2. 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with ovarian cancer, the method 
comprising: 

1 0 isolating a protein or protein fragment corresponding to a marker selected 

from the group consisting of the markers listed in Tables 1-2; * 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
15 hybridomas; and 

screening individual hybridomas for production of an antibody which 
specifically binds with the protein or protein fragment to isolate the hybridoma. 

50. An antibody produced by a hybridoma made by the method of claim 42. 

20 

51. A kit for assessing the presence of human ovarian cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein or 
protein fragment corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 

25 

52. A method of assessing the ovarian cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of ovarian cells in the presence and 
absence of the test compound; and 
30 b) comparing expression of a marker in each of the aliquots, wherein 

the marker is selected from the group consisting of the markers listed in Tables 
1-2, 
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wherein a significantly altered level of expression of the marker in the aliquot 
maintained in the presence of the test compound, relative to the aliquot maintained in the 
absence of the test compound, is an indication that the test compound possesses human 
ovarian cell carcinogenic potential. 

5 

53 . A kit for assessing the ovarian cell carcinogenic potential of a test 
compound, the kit comprising ovarian cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
Tables 1-2. 

10 

54. A method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer, the method comprising inhibiting expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

15 55 . A method of treating a patient afflicted with ovarian cancer, the method 

comprising providing to cells of the patient an antisense oligonucleotide complementary . 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 
1-2. 

20 56. A method of inhibiting ovarian cancer in a patient at risk for developing 

ovarian cancer, the method comprising decreasing expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

57. A method for determining whether ovarian cancer has metastasized in a 
25 patient, the method comprising comparing: 

a) the level of expression of a marker in a patient smaple, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) the normal level or non-metastatic level of expression of the 
3 0 marker in a control sample 
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wherein a significant difference betweent he level of expression in the patient 
sample and the normal level or non-metastatic level is an indication that the ovarian 
cancer has mestastasized. 



5 58. The method of claim 57, wherein the marker corresponds to a secreted 

protein. 

59. The method of claim 57, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

10 

60. The method of claim 57, wherein the sample comprises cells obtained 
from the patient. 



6 1 . The method of claim 60, wherein the patient sample is an ovarian tissue 

15 sample. 

62. A method for assessing the aggressiveness or indolence of ovarian cancer 
comprising comparing: 

a) the level of expression of a marker in a sample, wherein at least 
20 one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control sample, 
wherein a significant difference between the level of expression in the sample 

and the normal level is an indication that the cancer is aggressive or indolent. 

25 63. The method of claim 62, wherein the marker corresponds to a secreted 

protein. 

64. The method of claim 62, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

30 

65. The method of claim 62, wherein the sample comprises cells obtained 
from the patient. 
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66. The method of claim 65, wherein the patient sample is an ovarian tissue 
sample. 
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' 1/467 



Sequence 1 

ACGCGTCCGGGAGACACAAAGCAGGAAGCCTCCGGGAGACCAGAGCTGGGTGCAGACATA 

CACACACACATACACACAGACACACAGAGTCACACACACTCACACACACTCTCTCTCTCT 

CTCCCTCTGTCTTTCTCTCTCTCTCTCTCTCTCTGTCCTTTCCTCCTGGACAGATCCACA 

GTTATACACAGAAACAAACACACACGCACGTAGAGAAGTGATTCACAAACACTTAAAGAC 

ATAAATCACAGGTGCAAAGCCATACCTGGGCTCAAAAACCTCCAAGAGAACGCAGCCTCA 

GACCCACCCAGGGGCCAGGGGCCAGGCTGTTTGCGAGACCAGCCCAGGCGGGACCCAGGC 

TTGCACGGGCAGGTACACGACATTCTTGGGCATACGCAGCCCGCCTGGCCGGAGCTGTGG 

GAGTCCTCAGCCCCAAGACCCAGCAGGCGTCTGAGGCCTGCCCACTAAGGAGGAGGAGTC 

ATTGCTGCCATCATTTATCATNCCCTCCCCAGCCACAGTCTGAGGAGCCCNTGTNCACCC 
TTCCCA 

Sequence 2 

CGTCCGGAAAGCTGGTGGGAATGCTAAGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTA 

CGATCCTTGCTGGTCCTTGCAGGAGAAGTACAACAGCAGAGCCGCGGCCCTCTTTAGGGA 

TAAGGTGGTCGCTCTGGCCGAAGGCAGAGAGTGGTCTTTGGAGTCATCACCTGCCCAGAA 

CTGGACCCTACCTNAGCCCANGACGCTGCCGTCCATGGTGCACCGAGTCTCTGGCCAGCC 

GCAGAGTGTGACCGCCTCCTNGGACAAGGCTTTTGAAGACTGGCTGAATGATGACCTCGG 

CTCCTATCAAGGGGCCCAGGGGAATCGCTACGTGGGGTTTGGGAACACGCCACCGCCTCA 

NAAGAAAGAAGATGACTTTCTCAACAACGCCATGTCCTCCCTGTACTCGGGCTGGA 

Sequence 3 

NCCACGCGTCCGGGACGCGGGCGCCAGGTGCACAGCCCCAGTCCGCTGCGGGCGGGCGTC 

GACATCTGCCGCGTGAGCGCGAGCTGGAGCTACACCGCTTTCGTGACCCGTGGAGGCCGC 

TTGGAGCTGTCGGGCTCAGCCAGCGGCGCGGCGGGCCGCTGCAAGGACGCGTGGGCCTCG 

GAGGGGCTCCTCGCGGTGCTGCGCGCCGGGCCGGGGCCGGAGGCGTTACTGCAGGTCTGG 

GCGGCCGAATCGGCGCTGCGTGGGGAGCCATTGTGGGCCCAGAATGGTGGTGCCCGAGGC ' 

CGAAGGGGGAAGGACGATCCGGCCGGGTGAAGGCCCAAGCTTGGGGAGGCTACCCCTTGC 

TTGCCCTGCGGCCCCGTGCCCCTACGTGAGCCCCCGCGGGCCGCCCTTCTACCGGCCTTT 

TGGCTTCCGGGAGCTTGCGGGGCAACGCCAACTTGGGAGCTGGGGCCGCCAGCAACCCCG 

TTGCTTGTTGGACGCTGTGCCCAAAGGTGGTTTCTGGGGGGCGGGG 

Sequence 4 

AGTCNCCACGCGTCCGGGAATTGANGCCGCGGGGCGGGCGGCGGCGGCGGCTGGGCGGCG 

GCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGG 

GACTGGGGCCACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGC 

GGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCC 

TGGTGGCCCTCGCCCTGGGACCGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCC 

TGTCTACCTGCCGGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACT 

Sequence 5 

TCNCCACGCGTCCGACTGTATGTATTCTGGATACAGGGGATACTGGGCTCGCTATGTGTG 

TGGAGCCATCCCTTCCTTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTC 

TGAGCTTCCTTTCCTGCTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTG 

CTACTTCGGGTCAGGTGGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACT 

TCAAGA GGAG AGGCTGCAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCCGTGTC 

ACTTTACTTTTGTACACAAGGGGGTCTCCTTAGTGCCCTCGAGAAGGGATTCTTGGGCeC 

TGAGCTTCTACTCCTGAGGCCACCTTCTGTGCAAGCCCCAAGCTCCCTCAACTCTAGGCT 

TGGAGTCCTCAGTGGGGAAAAGCCCTGNTTTGGGG 

Sequence 6 

CGTCCGCGTCCTCGCTGCGGAAAGTTGGGGCAACCTGTTGCTAGTCTGGTCGTTGGTGAC 
AGCGAGGCTTCCGCGCTCGCfGCTGGTGAGCAGCCCCGGCGTGCCCCGCGGGCTGGAAGA 
GGCGGCGGCGTGATGCGGCCCGTGGACGCGCCCCGCGCCGGCCCGCGAGAACCTGGCCTC 
CCTGGAGCGCGAGCGCGCCCGGGCGCACTGGCGGGCCCGCAGGAAGCTGCTGGAGATCCA 



• 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

2/467 



GAGCCTGCTCGACGCCATCAAGAGTGAGGTGGAGGCAGAGGAGCGGGGCGCCCGGGCCCC 

AGCACCCCGCCCCGCGTGCGGAGGCTGAGGAGNCGGGTGGCTCGGCTGTGCGCCCGAAGC 

AGAGAGGAAGGCTGCGGAAGCGGCGCGGATGGGGCAGGCGGGATCGTGGGAGCTGCACNN 

ACCGGATCGCCGGCTTGCGAGTGCTGCTGAGCCGGCGAGGCCCNCGGGTCTGGAAGCGGA 

ANCGCGCGGG 



NTTCGGGAGTCGACCACGCGTCCGCGCGCTGGAGGAGTGGAGCAAGCACCCGGCCCGGCC 

CTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAG 

GTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCG 

CGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGG 

CGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTTGCC 

GCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 

CCCCCTGGGGAAGGCGGCACCAGGGGAGCCTTGGGCGCCCNGGGCTTCGGCCGCGACCCC 

ATTTGGGGTAGACCACMGAMGCTTCGGGACCCTTTCGGCACCTTTGGACAGCCAAGAA • 

TGGCTGNTGGGCACCCTTTCTTCT 

Sequence 8 

CCCCGCGTCCGGAGCACGCAAAGGGAATAAATTGTAATTAGGTGGTGGGTGGCTAAAAAT 

GACAA TGCAAAGGTGTTGGATTAAAAAAAAAATCTGGTAGTAGAGGGAAATTATGGAGGA 

I 1 I I I AAAAAGGTTMTGATAATATCCATCTACTTATGTMCTTTTTTTGGAGATACCTG 

ATAATAGTGTAGAGTGCATTGGAGAGGAAMGTAGGAGTTGTAAAGACCATTTTGGATAA 

ACTTTGAAGCAAGGGATMTGGCCTCMCCMGGTAGTGGTGCTGAAGATTGTTTACATA 

MTAAGCAGATACAMTAGAAGGGATTmCMGTGGCATTGTMCTGCACTTTTCAAAG 

GTTATTTGCCAMMTCAAATTAACGGTATCTTCAAAAATCATGTTTGATGGATGTATCA 

TCAAGGGCTTTCTTAAAATTTTGTTGAAAGCCAAGGAA 

Sequence 9 

CGCCTTCCCGGGAAGTTTGGAGGGCCCCGNAGGGGAAGCCCCCGCGNCTTCNGGGGGCCN 

GNCGGGCTTGGAAGGCMNCCCCACCCCMGTTTCCCCGCCNANGGANTNCAATGAANCT 

TGACCGGGGCCCCCCGGMCCCCNCGCTNGNCTTNTTNGGGGGTGGTTCCTTGGGTTCCG 

GTGGGGGGGAACCCCAAGTGCTTTTTCAAGGCCCCGGCCGGGCGCGGGGCCCCGGAAAGG 

GCCTTTCAAGTTCTTNCCTTTCCCCGGNTTGAAAGAAGGNAAAAGGCCGGGAANGGAAAC 

CNGGGNAAAACCCGCNGGGNNGGGGCGGCCTTCNNCCGCCNGGGCGCCCCTTGCCNGGGG 

GGGGGNAAAGGGGNCAMGTTTTCCCNGGGGCCCCCGGGGCCCCGGCNGNCCCTTTNAAN 

TCAAGGGGGCCGGGNNCGGNCTTTCCCCAANNCGGCCAAGTTCCTTCAAAGGGGCCCCCC 

CGGGNTTTGGGCCCGGNCCGGGNCNGNMCCTTGGGGMGGAAMAATTCAAAMGTTTT 

GTTGGCCCGGTTTCCGGGTNGGAATTGCCCCCCNAAAATTTGNAAGCCGGGGGGGGGGNN 

NCCTTGGGGGCCCCTTCTTTGCCCCCNTTTAAAGGGAANGGGCNAAAACCTTTNCCCAAC 

CGNCC AANN GCCCCCGTNAAAAAANGGGCCGGCCTTNNTTTTGNCCGGGGCCCCNAANAA 

GGGCCTTTTCGNTTTTTTCGGGTTTTTTCCCCCGG 

TTTTTGGCTTTTAAAGGGGGG 

Sequence 10 

NCGCGTCCGCCGCATTGTGGCCAAGTGCCATGAGGAGCAGCTGGATCATTCTGTCCAGTC 

ATATATTAAGTTCGTGTTCAAGACCAGGGCATGCAAGGAGAGGACTGTACATGAGGAACT 

GNCTAMAATGTGACTGGTCTTTTGAAATCAAATGACTCAACAACAGTAAAGCATGTCCT 

AAAGCATTCCTGGTTCTTCTTTGCMTTATCCTAAAATCGATGGCACAGCACTTGATTGA 

CACAAATAAMTCCAGCTTCCCCGGCCTCAGAGATTTCCTGAATCTTACCAAAATGAATT 

GGACMTCTTGNCATGGTCCTATCCGACCATGTGATTTGGGAAATACAAGGATGCACTTG 

MGAAACANGAAGGGCAMCCACAGCGTTGCCAGATTTCTCAAGCGCTGCTTTAC 

Sequence 11 

CGCCNCGCGTCCGGCTTCCTAGAAGAGCACAGTCCCTTAAAGCACCCTCTATTGCTACAA 

TTAAAAGTCTAGCAGATTGTAACTTTAGTTACACAAGTTCTAGAGATGCTTTTGGCTATG 

CTACACTGAAAAGACTACAGCAACAAAGAATGCATCCATCCTTATCTCACTCTGAAGCTT 
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TGGCATCTCCAGCAAAAGATGTGCTATTTACTGATACCATCACCATGAAGGCCAACAGTT 

TTGAGTCCAGATTAACACCAAGCAGGTTCATGAAAGCCTTAAGTTATGCATCATTAGATA 

AAGAAGATTTATTGAGTCCTATTAATCAAAATACCCTGCAACCGATCTTCCTCAGTGCGG 

GCCATGGNGTCCAGTGCCACATNGGGGGGGTCAGAATGATTACATTGGGCTTGCTCTCCC 

GGNGGATATAMTGATATATTTCANGGTMGGGTATTTCTTATTTTTAGACAAAAAACAT 

CCCCNCATGATGATCCAGGNGCCAGAGCATTTGCCCTGAATGCAGGAGGGCTTTCATNTG 

GNACTGGGNGGGCTTTGNAAAAAATTTTTT 

Sequence 12 

TTCGGGAGTCGACCACGCGTCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGA 

CGAGGGCCGGGTTCTGCGGCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAG 

CTTCTTCACCGACGGGTCCTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTC 

TAAGCGACACTCAACCTCTGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGG 

CTGGTTGTATTATAAGCAGATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACAT 

GCTGGGCTGGATCAGAGCGATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCCGGCT 

GTGCCAACCAAGCTCTTGATCAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGC 

TCTGGGCCCAAAGCTTGATTTCTTCCC 

Sequence 13 

GTATTAATGTTCTCAGGCATGAAGCAGAATTTTACGGGATCACTCCATTAGTAAGAAGGC 

TTCTCTTATGTGMGMTTGGAGCGTTCCTCTTGTGGCAGTGTCCTTTTTCATGGTTACT 

TGCCCCCACCAGGTATTCCTAGTNCGTAAAATAAACAACACAGTCAGATCTGCTGATTCT 

AGGAATGGTCTAAATTCTACAGAAGGTGAAGCCCGGGGAAATGGTACACAGCCTGTTCTC 

TCTGGAACGGGAGAAGAAACTGTTAGGCTAGGATTTCCTGTGGATCCACNAAAGGTGCTA . 

ATAGTAGCTGGCCATCACAACTGGATTGTAGCTGCATATGCCCATTTTGCTGTGTGTTAC 

AGAATCAAAGAATNTTNANGATGGCAGCAAGTGNTTACGAGCCCATATTTGGATTGGACT 

ATCGAACGAGTAGNTTTAAATGCAAAGGTGGATGGAGGGCCACATGGAGACAAAAGACAA 

AAATG 

Sequence 14 

GCCNCGCGTCCGAAAAAATTAAGAGAAGGCCTGGCGGCCGGTCTGAAGTCATCTATAATT 

ATGTACAACGCCCCTTCATCCAGATGTCATGGGAAAAGGAAGAAGGGAAGAGTCGCCATG 

TGGATTTCTAGTGTGTTCGAAGCAAATCCCTCACGAATCTGGTAGCTGCTGGAGATGATG 

TCTTGGAGGACCAGGAGATATTAATGCATCACCCACCCCAAGTGGATGAACTTGACCGGC 

TAAATGCCCCACTTTCTCAGATGGCTTCTAACGACTTTCAGGATTAGGGCCAGCTGTGGG 

TCTACTCCTTGTTGGAGCCCATCTCACCTGGGATGCCTGCAGCCAGCCCTCCCTCGTGAT 

TTGTCTCACCT TGAG TAGGAGACATGCTTCTCCCCTAACCTTTTCCTTTCTGCCATAATT 

AACATATGTCCTTTTCAGTAAGTCCATGCCTCTGGCAGGGGATGAAAGAAGTACTCACTG 

GGTAATTAGCTACCATCTTTGCAGCACCCTGGTMCTTGAAAAATTT 

Sequence 15 

TCGGGAGTCGACCCCGCGTCCGCCGAGCGGGGCGGCGCGGCTGGCGGGGCCGGCGGCCGG 

CTGAAGCGAGAGCGCGACGCGACGCGACCGCGGCTTCCCGAGCTGCGCCTGGCCGNCCAG 

CGCCGCGGNCCGCCCGAGGCCTGGAGGGGTCCGGGCCGCCGTCCATGGTCGCGGCGTCCT 

GAGGCGGGGGACGCGCCCGGCGCCCCCGGCCCTCCTNCGCCTCCTCCCGCGGGGCGGGCG 

GCCTCCTCCGGCGCCTNCCCGCGCCCGCCCGCCGNTCGCCGCCGCCTCCCTCCCTCCTTC 

CCTGCGGCTCCCCCGGCTTTCGGAGCCCGGGGGCGGCCTGTGGCGCGCGGAGCCCGCGCC 

GGACTGCGCCTNTTTGGACCTTGAGGGGAAACATGCGTTTGCCNTGGATCGTTTGAAATT 

CT GAGTT T GGGAT CCCCGNCCGGCCGNCTGGCTTTTTCGCCCGCGGG I 1 1 I I ICCTTTTT 

TCCTTTTGC I I I 1 1 1 I I CCTTCTT 

Sequence 16 

NGCCCCCGCGTCCGTTTTMTTATTTTGTTNGAGCCTGCANAGTAANGTTNTTAAAAATA 
TAACGTTCATACGCATTTTMTTAACTTTGAAAGTTCATATGCATCAGAAAATTTATGAA 
AATTTGAATGAAAAAAATTTCATCTATTTATTTTTCTAATTTTAATGGCAAATTTACACT 
ATTATGGCTGATAATTCTGTGAACTTACCTTCTTGTTGACTGATTCTTTTTCCCTTAATC 
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CCAGCTTTAAGGAGATAGGTGAAGTTATTGTACAAAGTTAAGTGATACCATAAAGTATAT 

ATTATAAAGTCATACATGGCTTTTGGACAGTNTTATATTTCAGTTGCAGTGCTGCATTCC 

ATTAMTTTCATAAMTGCTAGGGAAMTGTGTTTGATAAAATTTThJTGCAGTGAGAAAT 

GACAGACTGAGTGCCTGACAATTTAAGCCACATATGAMGTATGCAAGTAAAGANTTCAG 

GTCCTTAATGTCATCTATATCATGGTATAAAAG 

Sequence 17 

CCACGCGTCCGGACGAGACGAGCCCACTAGTGTCCCCCGAGCGGGCCCAACCCCCGGACT 

ACACCTTCCCGTCGGGCTCGGGCGCTCACTTTCCGCAGGTGCCCGGGGGCGCGGTCCGAG 

TGGCTGGCGGCGGCCGGCTCGGGCCCCTNTCCGCCGGGCTCGCCGGGCCACGACCGCTGA 

GCGGCAGCCACTGTTGGATCGGGCCCGGGGCGCGGNGGCCCAGGGCCAGACCCAAACCGT 

GGCGGCGCAGGCCCAGGCTCTGGCCGTTCANGCCGNGGCGGCAGTCCACGCCGATCAGGC 

CCACCGNGAGCGGAACGAG 

Sequence 18 

GGGAGTCGACCNCGCGTCCGGGCGGTGGGTGTCCGCTTCTCTCTGCTCTTCGACTGCACC 

GCACTCGCGCGTGACCCTGACTCCCCCTAGTCAGCTCAGCGGTGCTGCCATGGCGTGGCG 

GCGGCGCGAAGCCGGCGTCGGGGCTCGCGGCGTGTTGGCTCTGGCGTTGCTCGCCCTGGC 

CCTGTGCGTGCCCGGGGCCCGGGGCCGGGCTCTCGAGTGGTTCTCGGCCGTGGTAAACAT 

CGAGTACGTGGACCCGCAGACCAACCTGACGGTGTGGAGCGTCTCGGAGAGTGGCCGCTT 

CGGCGACAGCTCGCCCAAGGAGGGCGCGCATGGCTTGGTGGGGCGTCCCGTGGGCGCCCG 

GCGGAGACCTCGAGGGCTGCGCGCCCGACACGCGCTTTTTTCGTGCCCGAGCCCCGCGGC 

CGAGGGGCCCGCGCCCTTGGGTCGCCCTGGTGGCTCGTGGGGGCTTGCACCTTTAAGGAC 

AAGGTGCTTGTGGCGGGCGCCGAGG 

Sequence 19 

NATGTNGNNC NAAA AAGGCCNGCNTTANAGGCCAGGAAACNCGTAAAAAGGGCNCGCGTT 

GCTGTGCGTCT7TTCCATAGGCTCGCGNCCCCCTGACCNAGTCATCATCAAAAAATCCGA 

CNGCTCAAGTCATGAGGTTGGCCGAAAACTCCGACAGGGACTTNTAANAGNATACCCANG 

GGCGNTTTCCCCCTGGGAAGGCTCCCTTCGTGGCGCNTCTCNNTGTTTCCAGACCCCTGC 

CCCGCTTTACNCGGNATTACCCTNGTCCCCGCCNTTTTCTTCCCTTTCGNGGAAAGCGGT 

NGGGCGCCTTCT CNTC AATTAGGCTTCACCGCCTGNTAANGGTATTCTCAAGTTNCGGNT 

GTANGGGTGCCGTTTTCGCTTCCAAAGNCTGGGGCCTTNTGTGCCACCGGAAACCCCCCC 

Sequence 20 

TTCGGGAGTCGACCNCGCGTCCGCCTGGAGCCGCCAGAGTTTCCGCACCCGGGAGGGAGA 

TGCGGCCGGGGCTCAGGCTCCTTGCAGTTGTAATTTAGATTCGAGAAGTGGTTTATCCTT 

TGACTGGAAAAGAAAAGTAGCTGCAGTATTCCCCCAGCACTTGCTGAGAGCATGCCGTAT 

GCCAGGCTGTGAGGCTCGAGAGACAAGCAGTGGAAGAGTTGCGGCCTGT7TCATCTCTGG 

ATTGTAAATCTGAGCCTCCTTCTGGCCCCTGGAAGGGGACAGCATCACGATGGAATGATT 

CCTAACCAGCATAATGCTGGAGCCGGGAGCCACCAACCTGCAGTTTTCAGAATGGCCGTG 

TTGGACACTGATTTGGATCACATTCTTCCATCTtCTGTTCTTCCTCCATTCTGGGCTAAG 

TTAGTAGTGGGATCGG7TGCCATTGTGTGTTTTGCACGCAGCTATGATGGAGACTTTGTC 

TTTGATGACTCAGAAGCTATTGTTAACAATAAGG 

Sequence 21 

CGACCACGCGTCCGGCAGCCGCGGGGCGGGCGGCGGCGGCGGCGGCGGCGGCCGGGACCC 

AGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGGGACTGGGGCC 

ACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGCGGATGGCCGG 

TCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCCTGGTGGCCCT 

CGCCCTGGACGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCCTGTCTACCTGCC 

GGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACTTCCGTGTGCAGCAGGTGGGCCACTA 

CCCACCTGCCAACTCCTCTCTGAGCTCCCGATCTGAGACCTTTCTGCTCCTACAGCCCTG 

GCCCAAGGCCCAGCCACTTCTTCGGGCCTTCTACCCAA 

Sequence 22 
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CGCGTCCGCCCGGTGCCTCCGCCCATGGAACGCGCGGAGTGGCGCCGCCGCGGCTACGCG 

CCGCTGCTCTATCTGCAGTCACACTGCGACGTGCCAGCGGACCGGGACCGCTACGTGCGC 

GAGCTCATGCGCCACATCCCGGTAGACTCCTACGGGAAATGCCTGCAGAATCGGGAGCTG 

CCTACCGCGCGGCTACAGGACACAGCCACGGCCACCACCGAGGATCCAGAGCTCTTGGCT 

TTCTTGTCCCGCTATAAGTTCCACTTGGCCCTGGAAAATGCCATCTGTAACGACTACATG 

ACAGAAAAACTGTGGCGTCCCATGCACCTGGGCCGCTGTGCCCGTGTACCGCGGTTCTCC 

CTCTGTGAGGGACTGGATGCCGAACAATCACTCCGTCATCCTGATTGATGATTTTGAGTC 

TCCTCAGAAGCTGGCAGAGTTTATTGACTTTCTGGACAAGAATGATGAGGAGTATATGAA 

ATACCTGGCATACAAGCAACCT 

Sequence 23 

CGCGTCCGGCTGGGCGAATNAGGGATTCCGGTTCACAATGGATGCTGATAAAGAGAAAGA 

TTTGCAGAMTTTCTTAAAAATGTGGATGAAATCTCCAATTTAATTCAGGAGATGAATTC 

TGATGACCCAGTTGTGCAACAGAAAGCTGTCCTGGAGACAGAAAAGAGACTACTGCTTAT 

GGAGGAAGACCAGGAGGAGGATGAATGCAGGACCACCTTGAACAAGACTATGATCAGTCC 

TCCACAAACTGCTCTGAAGAGTGCAGAAGAAATAAACTCAGAGGCCTTCTTGGCATCTGT 

GGAGAAGGATGCAAAGGAACGAGCCAAGAGAAGAAGGGAAAACAAAGTCTTGGCGGATGC 

CCTAAAAGAAAAAGGGAATGAAGCATTTGCTGAAGGCAATTATGAAACAGCTATCCTGCG 

CTACAGTGAGGGGTTTGGAGAAGCTGAAGGACATGAAAGTGCTGTACACCAACCGAGCCC 

AGGCTTATATGAAACTTGAGGA 

Sequence 24 

GGGAGTCGACCNCGCGTCCGCTCCCTCTGAGTTGCGCTGGGCTTGGCTGCTGCACCATGA 

CCCTGGAGGCGATCCGCTACTCGCGGGGCTCCCTGCAGATCCTAGACCAGCTGCTGCTGC 

CCAAGCAGAGCCGCTACGAGGCGGTGGGCTCGGTGCACCAGGCCTGGGAGGCCATCCGCG 

CCATGAAGGTGCGGGGCGCCCCGGCCATAGCCCTGGTGGGCTGTCTCAGCCTCGCCGTGG 

AGCTGCAGGCGGGCGCCGGGGGACCGGGACTCGCCGCGCTCGTGGCCTTCGTGCGCGACA 

AGCTGAGCTTCCTCGTCACCGCCCGGCCCACCGCTGTCAACATGGCCCGCGCCGCCCGCG 

ACCTGGCTTGATGTTGCAGCCCGGGAGGCCGAACGGGAGGGGCGCTACGGAAGAGGCCGG 

TCCGGGAGAGAGTGATCTGCTGCACCGAGGACATGCTGGAGAAAGACCTCAGAGACAACC 
GAAGCATTG 

Sequence 25 

GGAGTCGACCNCGCGTCCGGGATAACGAAGCTGCTACCATGATGATGGCTGATCTCATGT 

TCAGAAAACAAGACTATGAACAAGCAGTGTTTCATTTACAGCAGCTTTTAGAACGTAAGC 

CAGACTCCTCGAGTTCAGGGATCACACCATATTCCCAATATCAGACAAAATGCCACACAC 

ATGGATGTGGCAACATAGATGTTTATTGGTTGAATGGATCAGTGAATGACTGTAAAACAC 

CAAGTCAATTAAAAACACAGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCCGTG 

AGCCAAGATCACACCACTGCACTCCAGCCTCGGTGACAGAGTGAGACTTGGTCTCAAAAA 

CAACCACAAMTTTTAAATAATTATATGACATTATCTCGTTTGATTGATCTCCTAAGAAG 

ATGTGGAAAACTCGA 

Sequence 26 

ACCGTCCGGGGCCATC CAGGA GAGCCTCCTACCAGCACAGAAGGCCTGTGCCCCAGCGCC 

CTGAGCGAGACAAGCCGTTTTGATAATGACTTGCAGCTAGCCATGGAGCTCTCTGCCAAA 

GAGCTGGAGGAATGGGAGCTCCGGCTCCAGGAGGAAGAGGCTGAGCTCCAGCAAGTCTTA 

CAGCTGTCACTCACTGACAAATAGACCTTTCAGCCTGTGAGCCTCTGCACAAAGCAGAGG 

CTGTGGGCTGTCACAGATGCTGTGTCAACCAGGGCCXJTAGGGCTAAGGGCCTGCACCTTG 

CGTGCATGCAGCAGGCAACAACTGCCCCTTCTTTATGCAGAGGTGCAGAACCAGGGACTC 

CTGGGCCCATCCAGGCTGCTTCCTTGGGGTGG 

Sequence 27 

NCCNCGCGTCCGGCCGGCGATGCCGCGCCCCCGGGCCGGGCTGTAGCGGGGCCGCGGCTG 
GACGTGTGCGCCGGGCAGGCGGGACATGGAGGTGGTGGACGAGACGGAGGCGCTGCAGCG 
CTTCTTCGAAGGCCACGACATCAACGGTGCCCTGGAGCCCTCCAACATAGACACCAGCAT 
CCTGGAGGAGTACATCAGCAAGGAGGATGCCTCCGACCTCACACTGCCGGACTCTCCCCC 
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AGACTCGGGCTCCGAGGCCTACTCCCCCCAGCAGGTGAATGAGCCCCACCTNCTGCGCAC 

GATAACCCCTGAGACACTGTGCCACGTGGGGAGTGCCCTTC 

Sequence 28 

CGCGTCCGCNAGGGAGGGCCGAGCGAGGCGCAGGCAACCGGGCAGCAGGCATGATGGCCT 

CGCCTAGTGACTCCAGCCGCTCGCTGACCAGCCGGCCCAGCACCAGGGGCCTTACCCACC 

TNCGCCTNCACCGACCCTGGCTGCAGGCCCTGCTTACGCTGGGGCTGGTCCAAGTGCTCC 

TGGGCATACTGGTGGTCACCTTCAGCATGGTGGCCTCTTCCGTCACCACCACCGAGAGCA 

TNAAGAGGTCCTGCCCGTCTTGGG 

Sequence 29 

ACGCGTCCGATTTTAGTNGCAAGGAGTCCATGTGTTCAACTCCAGCATTTCCTGTGTCTC 

CAGAGACACCGTATGTGAAAACAGCGCTGCGCCATCCTCCGTTCAGCCCACCTGAGCCCC 

CGCTGAGCAGCCCAGCCAGTCAGCACAAAGGAGGACGTGAACCACGAAGCTGCCCTGAGA 

CGCTCACTCACGCTGTGGGGATGTCAGAGAGCCCCATCGGACeCAAATCCACGATGCTCC 

GGGCTGATGCGTCCTCGACGCCCTCCTTTCAGCAGGCTTTTGCTTCTTCCTGCACCATTT 

CCAGCAACGGCCCTGGGCAGAGGAGAGAGAGCTCCTTCTTCTGCAGAACGCCAGTGGGTG 

GAGAGCAGNCCCAAGCCCATGGGTTTCCCTGCTGG 

Sequence 30 

CCGGTTNCTTGTNNGTTNATTGATTTAAAATAGAATATCAATTGAATTTAGAAAATTCTC 

AAAAGCCAGTTTAATGCTGTTCATCTTTTAAGGCCAAAAAAAGTTTAATCCAGAGGCAGT 

CTTTCATTCTGCACTMTTTATAATTTAGATCAAAGAACTAATTATATATCTCAAATTTA 

ATAATAAAAAGGTATAGTAATGAGAATTAAATTTATGGTAAATTATATAACTCAGAATGT 

TAAAGTAACTTGGAMTTCTTMTCTMGTTAAGTATCTTTTTATTTCTTACTTGTCCTG 

TTrGATTTATTAAGGGAMAGMMTTTTAAGGAGTTGCCAGTATTTCTTTGTCTATTGA 

AAGTGGAATGTTTATTCACCCTTATTTATATACTTAAAAGACATTGTATTGGCCTGGTCT 

CGAACTCCTGACCTCAAAGGTGGATCCACCCACCTCGGCTTCCCAAAGTGCTGGGATTA 

Sequence 31 

GCCGGTGATTTTGA ACAA TTCTGAAATATTTCAGGTAAGATTAATAACATCCAATTACAA 

ATATATGTTTCAATATTTTATACGTATGTCTACTTTGAAAGTTAAACCAATAGTATAGAA 

AGCCTAAGAATGAACACTGATTGGACATACTCACAGAAATTAAGGGAAAAACACATATTG 

TAAAATTCCTGTCMTGTTTGAGTAGMTACAGAAGTACATAGCAGTCTTCAATTTTTAA 

ACACAATTATGGGCTTATAACTGGACGTGACATGCATCATTTATTAGAACAATATTATTT 

ATTTATACTAAGTAAGGATATAAGATCACAGAAGCTTAGTGTTATAACGGAGACTTCACA 

GACATTCATAACTAATGTTTTCTAAGGCAAATAAGGGGCATAAACCAGAACTCATGGGTC 

AGTGCCAGAGGTAAGTATAAAAAGGTTATGTATGAAAGACATTTATTTATAGGAGAATTT 

CTGAGGGATTCTATGCCTTTTCAACTTA 

Sequence 32 

NCGTCCGGGAAACTGGTTCNGATGGTGTCTGCCCAGGAGCGCCTGACACGCACCTTCACA 

CGCAGCAGCCACACCTACACCCGCACGGAGCGCACGGAGATCAGCAAGACGCGGGGCGGG 

GAGACAAAGCGCGAGGTGCGGGTGGAGGAGTCCACCCAGGTCGGCGGGGACCCCTTCCCT 

GCTGTGTTTGGGGACTTCCTGGGCCGGGAGCGCCTGGGATCCTTCGGCAGCATCACCCGG 

CAGCAGGAGGGTGAGGCCAGCTCTCAGGACATGACTGCACAGGTGACCAGCCCATCGGGC 

AAGGTGGAAGCCGCAGAGATCGTNGAGGGCGAGGACAGCGCCTACAGCGTGCGCTTTGTG 

CCCCAGGAAATGGGGCCCCATACGGTCGCTGTCAAGTACCGTGGCCNGCACGTGCCCGGC 

AGCCCCTTTCAGTTTACTGTGGGGCCGCTNGGTNGAAAGGTGGTGCCCACAAGGTGCGGG 

CCCGGAGGCAC 

Sequence 33 

CCGCGTCCGCAGGAAATTGTTAAAAATAATTTGGGGGTGTTTATTGGGGAAGGAAACAGG 

GCCTTGACAGTGGAGGACTTGGAAGACATGTAATTTAAGATATAGAGTATGATTGTTGGA 

AAATAAGCAT GGAGA TCCAGAAGGAATCTTAAGAGTTTTTCTATGCAAGTGAAGATGGAA 

GAAAATATGTATTTTACA AMG ATAAATTACAAGTACCTTATTTGCTTTGCAAAATAACT 

TATCATGTTCTTCCACTATTTTTATTATATTTTMTTTTAATGAAACTTATATAACATTT 
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ACACTAMTTTTAMTACATGGCTCMGACAAAAAMTGGGGAAMTATTTTTTAAAAAA 
TCACCCCAAATCCTGGTACTCAGACATAACCACTATTCAAACTTGGCAGAGTAATATTTT 
TCCTGTCTGCATGTGTGCCACNATGTGTGCATGCATACCACAAAGTAGATGTTTTACTAT 
ATCTCCTGGGTTATTATCTGC I I I I I ICCC 
Sequence 34 

ACGCGTCCGGGACAAGAGAAACATAAGCNGGAAAAAAGGAGGAGGAATAAACACACGCCT 

GTCCATAATAAAACTCGCTCTTGAAGACTCAGCGGCAGCCCTGCACCGGAGACTGACGAC 

TTGCGCGGCTGTGACCTCCGCCCTGCAGCGGACCCTCGACTGCCCTGCACTGCGGCTCTG 

GAGGCCCCGACTCAGTGCATGGGAAAGAAATCCTCACTATCAGAAAACAGAGGGGCAATC 

TGCTGCTCTCCCTTTCCGGCCAAACACGTCACCCATCAACCGGATACCTACCAAGAGGCT 

TTCAGAGGAGGCGCCCAAGGTCTCCCAGGCCCGCCCCTCCCCAATCACGCTCCGCTCAGC 

CCCCTCMCTTTTGGCCTCCGGGAAGTTCGCAGCGTNTCTCACGCTTGGCAGGAAGTTCC 

CGCCMGGCTTTCGGAAAATCCTTTAAAAAGCAACGCTTGCGCTGGGCGGGGCTTTGGTG 

Sequence 35 

CCCCGCGTCCGGTAGATTGCTTGTGGCTGGCAGTGAAGATGGTGGAGTTCAACTTTTTGA 

TATAAGTGGGAGGGCTCCCCTCAGGCAGTTTGAAGGCCATACAAAGTAAGAGACAGTTGG 

TTTCTGTGTGTTCTGGTTTTATTTTGTTGTAAGCT C I Mil l I CTCTGGACTTTGGTTAA 

AAAGATAGAGATCAGTTTTATGGAGATTATTTGCCTATAGGTACTATATTTCCTGATTGT 

TCTAAGAGTGCTTAACTTGGGTTCCGTGGTCCAGTTTCATGGGGCTTATGAATTCCCTAG 

AATTGTATGTGATATTTTAGGAAATACACGTTTATCTAGGGAGCTACTCTGTAGCTTTTG 

GTTAACTTTAGTGGGGTCTGTGGCCCAGCTGAGATTATGAATTACTGACCTGAAGACAAC 

CTTACAGCTGGTAATGACAGCTCTATAGGCCTGTACTGTCTTAGAGGCTCTTATGTTGAA 

•GTCAAGTANGAAGGTGGATTTTCTTCTTGAATTATAGTGTTTTGCCCCTTAATAA 

Sequence 36 

CNCGCGTCCGCGGACGCGTGGGGGCGAGGGCCGCTGGGGCCGCGAAGTGGGGCGGCCGGG 
TGQGGTACGAGCCGGGTCTGGGCTGAGGGGCGCGGCTTCGCGGTGGACCCCAGCCCGGCA 
ACGGGAAGGCGAGCTCTCCTCCACCGTCCAAAGTAAACTTTGCCGCTCCTTCCGCGGCGC 
TCCCGAGTCCTCGCCGCCGCCGGGCCGCCGCAGTCCGCGAAGAGCCGTCCTGCGTCAGGG 
•CCTCCTTCCCTGCCCCGGCGCGGGGCCACTGCGCCATGGACGCCACAGCACTGGAGCGGG 
ACGCTGTGCAGTTCGCCCGTCTGGCGGTTCAGCGCGACCACGAAGGCCGCTACTCCGAGG 
CGGTGTTTTATTACAAGGAAGCTGCACAAGCCTTAATTTATGCTGAGATGGCAGGATCAA 
G CCTAG AAAATATTC AAG A AA AA 
Sequence 37 

GCGTCCGCGGACGCGTGGGCCGCCGCCGCCGCCGCCGCCGCGATGTGACCTTCAGGGCCG 

CCAGGACGGGATGACCNGGAGCCTCCGCCCCGCGGNGCCCGNGGCTCGCCTCGGCCTCCC 

GGGCGCTCTGACCGCGCGTTACCCGGCCCGCCATGGCCCCCTTCTCTCTNGCCCGGGCNC 

ACGCTCGACCCTGCGCNCGTATCGCCAGCAAGTCTGCTGCCTGCATGCTGCNTGGCTCGC 

GAGNNGTTGGCGCGCNTTGCNCTGNNGCCTGTTTGCACGAGCNGCCGCCGAGGCCCACTG 

CANGNTTANCTCGACGGTCTCCNAAGNCANACCGCNCCGCCGNCCCTTTCCAAGGNTCNT 

TTCNCGAGGGGANGGGCNCAAANGGCCAGGGGNCCCCACNCNTGGGGAAGAAGGGNGGAG 

TTNGNCNGATGNGNATGAGNAANCCTTGTAGNCAAGANCCCGTCCCNAAGNGGNAGNGNC 

AGCCCNGGGGAAGGGTTGGTTTTCNNANGNTAAACCGNAACCCCCCCGGAAGTNAACCGG 

NGATTTTMTTTATTTTTACCCCNCANANGNAAATAACCTTTNAANAACCATGGGNCATT 

NNAAANCAAAAGNTAAATTNGGGGGNAACTTAACCGNTAAACCANCCCAAAA 

Sequence 38 

CNCGCGTCCGGCGGGTCCCGCCGGCGGGTACCTGGGCACTGCGCCCCATCTGGACTGAAA 

TGGGGACACCCCTTCGGGGGTCCCAGGCTCCTGGCCGTATTGTTCTCCTTCTCCTCGTGA 

TAACTCCGCAGTGGAGGTGGATTCCGTCCAAGACGCCCAACGTGGCTCCGCGTAGCAATC 

AGCGCTGCAATCCTGGCGGTTACCTCAGCGGCGGCGTCTCTCTCTGCGCCTCACACTCGC 

AGCCCGCGGCCCTCCCCAACTTAGGGCGnTACAAAAGAAACTACTCCAGACGCGCTGCA 
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AAGGGAGGCGCATGTGCCCGAAAGCTGGCGATCAGACGGGGGGGGGCATTCTGCATGTGT 

GATGTTTCTGGGGGCGGTGGGGAGTGTGTGTCGGGGTCGGGGGGCGGG 

Sequence 39 

NTGGTCTTTCTTCCTAAGAAGGAGATGGAGCCAATTTCTCACAGCTCGTGCCTCAGTACT 

GAGGGTATGGAGGAAAAGGCAGTCAGTCAGTGTCTAAAAATGACGCACGCAAGAGACGCT 

CGGGGMGATGTAGCTGGACCTCTGAGATTTACTTTTCTCAAGGTGGACAAGCTGTAGCC 

ATCGGGCAATTTAAAGATCGAATTACAGGGTCCAACGATCCAGGTAATGCATCTATCACT 

ATCTCGCATATGCAGCCAGCAGACAGTGGAATTTACATCTGCGATGTTAACAACCCCCCA 

GACTTTCTCGGCCAAAACCAAGGCATCCTCAACGTCAGTGTGTTAGTGAAACCTTCTAAG 

CCCCTTTGTAGCGTTCAAGGAAGACCAGAAACTGGCCACACTATTTCCCTTTCCTGTCTC 

TCTGCGCTTGGAACACCTTCCCTGTGTACTACTGGCATAAACTTGAGGGAAGAGACATCG 

TGCCAGTGNAAGAAAACTTCAACCCAACCACCGGGAT 

Sequence 40 

CACCGNNCGCGCTCCTTCTGCCGCCAGGGCGAGGCTGGCACCCGGCCAGCGCGGGCAGGG 

CCACGGGTGCCCGGCTGTTTCCCGGTTGTGGAAGGCGCTCAAGGTGCGCGGCCCGGGGCG 

CGCTACTGGGGGCGCCCTCCGCGGTGGGCMGCGCGCCAGGGATCGGCCTGGGCANGCCG 

CGGGGCGCGCGAANGCTGCGCTTTCCCTACGCCCCCCCTCGCTTCCTCCGGCACGGCGGC 

AACGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTTCGGGGATCCTCGCCCCGCCC 

CAATTTCTNCGCCCCCTCCCCT7TGCTGGGGCGCCTGGGCTGGCCCGCGCAGGGGA 

Sequence 41 

CNCGTCCGGTTCCTAACACAGACGAACTCAGCTTCCTTTGCCATGCCTCTGACTCGAGCC 

AGCCTTTCTTTTTATCCTCCG 1 1 1 1 I CTCAGATTCCCTCCACACAGTCTTTTCCCAGGTC 

TAGATCGCTTCCCTCGCCCCAATCTTTCCTTGAACCCCTTTTCCAGTGTCCCAAAGCTGT 

CCACTCTTACGCCTCTTCCAGAAACACAGGCTACCTCCCCCCAATTCCCCAGTGCCACTC 

TGGATTGTAATATCCCACTCAGGAGCTTCTTTCTTGTAATTTCCCTCCCCCCACCCCCCA 

GCGTCCCCGGGTCGTCTGTTTTCTTCTATGMGCAAATATTACTCATCAAATATAGGAAC 

AMGGCCTMGTCCTTTCTGNGCTTTATTTCTTNGGGTGACTGGATCTTAGATCCTATCA 

TTTAAGTAGATGATGGTT 

Sequence 42 

GGTGTCGACCNCGCGTCCGCGCTTCTCNCCTCGGCCCGTGGAGCCGGGGCGTCCGGGCGT 

AGCCCTCGCTCGCCTGGGTCAGGGGGTGCGCGTCGGGGGAGGCAGAAGCCATGGATCCCG 

GGCAGCAGCCGCCGCCTCAACCGGCCCCCCAGGGCCAAGGGCAGGCCGCCTTCGCAGCCC 

CCGCAGGGGCAGGGCCCGCCGTCCGGACCCGGGCAACCGGCACCCGCGGCGACCCAGGGG 

GCGCCGCAGGCACCCCCCGCCGGGCATCAGATCGTGCACGTCCGCGGGGACTCGGAGACC 

GACCTGGAGGCGCTCTTCAACCGCCGTCATGAACCCCAAGACGGCCAACGTGCCCCAGAC 

GGTGCCCATGAGGCTCCGGAAGCTGCCCGACTCCTTCTTCAAGCCGCCGGAGCCCAAATC 

CCACTCCCGACAGGCCAGTACTGATGCAGGCACTGNAGGAGCCCTGACTCACAAGCATGT 

TCGAGGCTCATTCCTCTNCAGCTTTCTTGCAGTTGGGAAGC 

Sequence 43 

GTCGACCACGCGTCCGGGAGCTGCGGCGCCCTCCCTTATCGCCTTGGCAACGACCCAGCC 

GCGCCGCGAGGAGAACCGGGMTGGAGGTCGTGGCGTGAGGGGCGCCGAGCGAGGGGAGG 

CGCGGGCCACGGGAGTTCCGGGAGTTCCGGCGTTGCCCGGCAGTCCGCAGTCCTCGGCGG 

GAAGGCTGTCCCGGCGCCTCAGGCAGCTCTGCGTGGGCCGGGGTGACTTCCTCGCGATCC 

CCTGCGCGAGGTGAAGGGCAGGGACCTTTGCCGCGCCTTCCACGCGCCGTGCCCCACCGG 

CGAAGTGGGCTCCATCTTCCTTCAAGACTTGTCGCTTCGCGGACAGGGCGCCCGTGGGTT 

CTTCCCGGCCCTTCCGTACCGTCCTCTTTCCAAGCGGGCTTCCGGAAAGCGGGTTCCTTG 

TCTTTCGGACGCATTTTCACCCCGCGCCGGGGAGGAGCTTNCCGGGNAAGGGTTCCACGG 

CGGCCGAGGGTTTTCC 

Sequence 44 

CGTCCGCGAGACTCCCGCGCCCACCACCCCCGGCGGAGCTGCTGCTGAGCCACTCAATCT 
GAGCCCTGGCTACTMTAMGTTCGTTTAAAAATCATAATCATTCTTAAGAGAGCGAAAG 
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AGGGTGCGAACTAGCCGCTCGGCCCCGCANGGAGAGCTGGCGCGTCNGGAGGAGACAGCN 

GCGGCAGCGGTTGCGCCGCGACCAGGAGGAGCCGGTGGCGCCGGGCGGCGGGTCCGCGGC 

CGGTGGGGGACNGTGAGTAGCGGCTCGCGCTGCGGTGCAGCGGCGCTGCACTCACTCGCC 

CTCTCCAGGGGCTGGGGGTTCGTCCGGCTCCACTGGGGAAGACTCAGCTTCTCCCCGGGG 

TCCCGGGTTGTGCCTTACTCTCCGGAGTGGGCAGGGGTATCGAGGGCAGGGGCCTCCCGG 

CCCGGCTCCCCCATCCCCCGGTTCGGACCGACGAGCCGCCGGCTTCTTCCCTTCCCTGAG 

CACCGATCCCAAGTTCCA 

Sequence 45 

TCACCACGCGTCCGGGCGTCCGGGTACCCGAGGGCTCTCCCGCGTTGCTGGCACCGCTGG 

CGCCGCGGTCTCGTAGCGCATGGGCCTGCTCCGAGGCGGGCTCCCATGCGCTCGGGCCAT 

GGCGCGCCTGGGCGCTGTGCGCTCCCACTACTGCGCCCTGCTGCTGGCCGCGGCGCTGGC 

CGTCTGCGCCTTCTACTACCTCGGCTCAGGCCGGGAGACCTTCTCCAGCGCCACCAAGAG 

GCTGAAGGAGGCCCGCGCCGGGGCTCCCGCCGCGCCCTCGCCGCCCGCGCTGGAGCTAGC 

GCGGGGCTCCGTGGCGCCAGCCCCCGGCGCGAAGGCCAAGAGCTTGGAGGGCGGCGGTGC 

CGGGCCGGTGGACTACCACCTGCTGATGATGTTCACCAAGGCGGAGCACAATGCCGCGCT 

GCAGGCCAAGGCCCGCGTCGCGCTTGCGCTCA 

Sequence 46 

CCACGCGTCCGTGTCGGGCCGGCGCTCCCTTCTCTGCCNGGTGGCGAGTACACCTGCTCA 

CGTAGGCGTCATGAGGTCTCCGGTTCGAGACCTGGCCCGGAACGATGGCGAGGAGAGCAC 

GGACCGCACGCCTCTTCTACCGGGCGCCCCACGGGCCGAAGCCGCTCCAGTGTGCTGCTC 

TGCTCGTTACAACTTAGCAATTTTGGCC I I I I I I GGTTTCTTCATTGTGTATGCATTACG 

TGTGAATCTGAGTGTTGCGTTAGTGGATATGGTAGATTCAAATACAACTTTAGAAGATAA 

TAGAACTTCCAAGGCGTGTCCAGAGCATTCTGCTCCCATAAAAGTTCATCATAATCAAAC 

GGGTAAGAAGTACCAATGGGATGCAGAAACTCAAGGATGGATTCTCGGTTC CI 1 1 1 1 1 1 A 

TGGCTACATCATCACACAGATTCCTGGAGGATATGTTGCCAGCAAAATAGGGGGGAAAAT 

GCTGCTAGGATTTTGGGA 

Sequence 47 

CGCGTCCGCGGACGCGTGGGCGGGGCCGCGGAGCCGGGCCGGGGCATGCGCCGTCTCCGN 

CTCGGGGCCGNCGGGGGCGCCCTGCTGAGCGCTACCCACGTGCGTCCGCGCCACCTCGCG 

GGCGACCCCGCGGCCAAGGCCCCCGGCGGAGCGGNTCCCGGGCGCCCCGAACTAGCCCCC 

MCTTTGGGCGAAGTTTGCCTGCGCCTCTCCCCGCCCCCACGCGGCGCGCCGGGGCCGCG 

GACGGNAGCGGCCCCCGGGGATGCGCCTTCCCGGGGTACCCCTGGCGCGCCCTGCGCTGC 

TGCTGNTGCTGCCGNTGCTCGCGCCGCTGATGGGAACGGGTGCGCCGGCCGAGCTGCGGG 

TCCGNGTGCGGCTGCCGGACGGCCAGGTGACCGANGAGAGCCTGCAGGCGGACAGCGACG 

CGGACAGCATCAAGNCTCGAGCTGCGCAAGCCNGACGGCACCCTCNTNTTCTTNACCGCC 

GACTTTAAGA 

Sequence 48 

GCGTCCGCTGCATTGCGCCCACCGACTCCACTATGTTGAAGAAATTCGACAAGAAGGATG 

AGGAGTCAGGTGGAGGCTCCAACCCATTCCAGCACCTTGAGAAGAGTGCGGTACTCCAGG 

AGGCCCGTGTATTTAATGAAACTCCCATCMCCCTCGGAAATGTGCCCACATCCTCACCA 

AGATTCTTTATCTCATAAACCAGGGGGAGCACCTGGGGACCACGGMGCGACCGAGGCCT 

TCTTTGCCATGACCAAGCTCTTTCAGTCCAATGATCCCACACTCCGTCGGATGTGCTACT 

TGACCATCAAGGAGATGTCTTGCATTGCAGAGGATGTCATCATTGTCACCAGCAGCCTAA 

CAAAAGACATGACTGGGAAAGAAGACAACTACCGGGGCCCGG 

Sequence 49 

ACGCGTCCGCAGAANGCTCTCAGATGGGACAGTCTTTTACTTTCATTCTCACCTCTGTAA 

ATAGCAGGACAGGTTGGGGTGGGCCTGACTTCTATTCTGCTTTCAGGGGGTACTTACTGG 

AAAATCAACTTAGGAAGTGAATTTGAGGGTTGGTGTAATTTTAAGCCCAGCCTCTGATCC 

TTGGTTGCACAAAGCCTAATTTCCAAATATTTCTAACAGATTCAAGACTGTATTGGCAAA 

GAGGTAGAGAGCTATGATAATGACATAAATTACATAAAAATCCAGTTGAATGAATAAGAA 

GGAATTTGGGCGTATAACCCATGGAACACCAATGGTGCTAAGAATTTGCCAAACCCTCAG 
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CTTCGTATAAGTCCCAAAGMCTCAGGCTTATAGCACTAAGCAAATTACCAGTGGGAGGA 
GGAGGACCTGCCACGGAGCTGGATMTTACATTTAMTATTTTTGCCANTCTGTTGGAGA 
C 

Sequence 50 

NCGCGTCCGGCAGGCGGGTCGTAGAGAGCGTTCANGCCGTCTGTATATCTCCCCAGATAC 

CTGAAACTGACXJACCTGAGTACGTTTTCCCATTGCTGAGCTGTTTCCCTGATATCTGGCC 

ATGCAACGGAGATCAAGAGGGATAAATACTGGACTTATTCTACTCCTTTCTCAAATCTTC 

CATGTTGGGATCAACAATATTCCACCTGTCACCCTAGCAACTTTGGCCCTNAACATCTGG 

TTCTTNTTGAACCCTCANAAGCCACTGTATAGCTCCTGCCTTAGTGTGGAGAAGTGTTAC 

CAGCAAAAAGACTGGCAGCGTTTACTGCTCTCTCCCCTTCACCATGCTGATGATTGGCAT 

TTGTATTTCAATATGGCATCCATGCTCTGGAAAGGAATAAATCTAGAAAGAAGACTGGGG 

AAGTAGAT 

Sequence 51 

GCGTCCGGAAGGTCCTTGAGCCATCTGGATGGCGGGCAGTCTGGCACACTAATGTGTTCA 

AGGTGCTGGTTGAGATCACAGATGTGGACTTTGCAGCCTTGAAGGCAGTGGTGAGGCTTG 

CTGAACCATACCTCTGTGACTCTCAAGTGAGCACTTTTACCATGGAGTGCATGAAGGAGC 

TCCTTGATCTGAAGGAGCATCGGTTGCCCCTGCAGGAGCTGTGGGTGGTGTTTGATGATT 

CAGGA 

Sequence 52 

GTCGACCNCGCGTCCGGAAAAAAAAAATGACCCAGAGATATTAACAACTTGACCTGGTTA 

TACAGTTAGATATTTGCACAGTCTGGACTCAAACTGGAGGCTTCTGACTCCTCATCTAGG 

CTCCTCTCACTCTGCCATTGCATGGGTTTTCTCATATACCTTCTCTCATAAGGTTTTCAC 

AAATTTGTCACCGTCAAATAATTATCAAAATTATTCACACTATTATAGATGAAAATAATG 

TGCTTATAAAGATTAAGTAACTTTCCTGAGGGCGCAGGTATCTGGTTCAGATAACAACTA 

GCCTGGCTTAGAATAAACACATATTTCCTGGTTCTGAAGTTGGTGTCCTTCCTACCACTT 

TCTGCTGTCTCCTAAAGATAAAGAATGTTATTGGCTCACTGAATTAATCCATTCTGTTCC 

TGGCTGAAATAAAAATTGGTATATTCCTTACGTGAAGTGTCAACAGGAAGGGGGCTTTTA 

CAACTTCCTTT 

Sequence 53 

GGAGTCNCCACGCGTCCGCGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGC 

TGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTGGCGAC 

CAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAG 

TGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGGCGGACGGA 

GAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCGCCGCCTCC 

CCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGG 

GAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCA 

CAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTC 

CTNCTCTTCTCTTGGAGGGCGCTCTGGCCCATCAGACCGGATT AI 1 1 1 1 I CAA 

Sequence 54 

CNCCCGTCCGGAATNCCCATAGTTAGCTGCTGTGCTTTCACAACTTCTTTCTCTGTAAAT 

TCCTCGCTTGGCNCTGAGAAGGAAAAAAGATGTTCGTAAAGGGCTCAGCGAGGAATTTAC 

AGAGAMGMGTTGTGAMCCACCATAGTTAGTTGCTGTGCTTTGMTTTCTTTTGCTCA 

M TGGC CTCAGCGAMTCTTATTTGCCTATAGCATATCTACAAAAAATTTTCCTAGACCG 

TCTTTTCTACAACTGGATGGTAAAGTTGATTGAAGTGTGCCTCATGTAGCTTTATGTTTG 

GGGCATTTGAAGGGCTATGGCTGGACCAGAGTGTAATATAMTGCTTAATAGAGAGGGGA 

AAAGAAGAGTGTAAGAACCATTATAGGGCTGGGCTCACGCCTGTAATCCCAGCATTTTGG 

GAGGCTGAGGCAGGCGGATCACGAGGTCAGGAGTTCNAGACCAGCCTGACCAACATGGTG 

AAACCCCATCTCTCTAAAAATACAAAAAT 

Sequence 55 

GTCGACCNCGCGTCCGCAGCCTTCGGCTGCGGAGGGGGCTCGGCGGCGGCCGGCGGAGAA 
AGTTGCTCCGAGAAGAGGCTGGGTCGAGCTGGGCCGAGCCGGGCGCGCAGGGCGGGCGTC 
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GCGGGCGTCCCGGGCGGACGCGGCGCGGAGACTGCCGGCGCGTCCCGGGGGTTCCGATTT 

GAAGACCTTGCTTCTCATCACCCACTGGATTATGCCCCAGGCTTTCCTACCCAATGATCC 

TCTTGCAACACGCCGTGCTTCCTCCACCTAAGCAGCCCTCACCCTCGCCTCCTATGTCAG 

TGGCCACCAGGTCTACAGGAACCTTGCAGCTTCCACCACAGAAGCCTTTTGGGCAGGAGG 

CTTCCTTGCCTCTTGCAGGGGAAGAAGAGTTATCGAAGGGAGGGGAGCAAGGACTGTGCC 

CTGGAGGAGCTATGTAAGCCCCTGTACTGCAAACTCTGCAATGTCACCTTGAACTCTGCA 

CAGCAAGCCCAGGCTCATTATCANGGTAAAAATCATGGTAAGAAA 

Sequence 56 

ACCNCGCGTCCGGACCTGTTGGCGACATGGTGGCACCCGTGCTGGAGACTTCTCACGTGT 

TTTGCTGCCCAAACCGGGTGCGGGGAGTCCTGAACTGGAGCTCTGGGCCCAGAGGACTTC 

TGGCCTTTGGCACGTCCTGCTCCGTGGTGCTCTATGACCCCCTGAAAAGGGTTGTTGTTA 

CCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGATG 

GCTCCCCTTCTACTG AATT AGTTTCTGGAGGATCTGATAATCAAGTGATTCACTGGGAAA 

TAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTATG 

CGGTGCATGCTGTTTACCAGAGGAGGACATCAAGATCCTGCATTATGTACACTGATCGTT 

TCTGCAGCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAAGTAATGTG 

CCTTCAAACTTTAAACTTTGGAA 

Sequence 57 

GTCGACCACGCGTCCGTCTCATTTGTGAAGAGGTCTGTCTTCCTGAGGAAGCAGGGGACC 

CTCACCTGTGAACCAAGTGTGCCATGGGAGCTGCTCCATGTCCAGGTCCAGGTCTCCTGG 

TCTGCAGGGAACGGCACAAGAGGGCTGGCCTAGGCCAGGAGGATGTGATCTGTCCTAGAA 

GGGGGCTGACCTGCTTGCTGACCCCGCTTGCTGCTGCCTGGCTGACCTGACTCAGCCACG 

GCTGTTCCGAGGGCCCTTCTGAGTACGAACTTCCAGTTGGAGGATCTGGGTGAAGACCCA 

GCTGCTTGAGATAGCAGCCTCTGGCTAGGCCCTTGGCGTGGCCAAGCCAATCAGGCAGGT . 

TTAGAGCCTGGTGCCCCTAGACAGGTCCTGCAACCAAGAACAGGGGGTAGCCTTCAAAGG 

CCAGCCCTGCCTTCCAACCACCGCTCCACAGCGAGGGAAACCAAGGCTCTTAGGGCAGGA 

GGGTTGT 

Sequence 58 

CCNCGCGTCCGGGGAGCAGGGATCAACGGTGGTCCCCGTAAACCTGACAGTAAACCTGAC 

AGAGGCTGCAGGAGTGCATTTCCACCCAGGGTGCACTCAGCGAGTGGAACTCCACACCCG 

TTTCTTTGGAGTCAAGGCGCGACCTCTCAGGGAGGAGACTGCTCCTGGTTGCCCACTGCC 

GGGTCATCCCAGCTTGCAGTGGAACCCTCCGCAGCCTGGCCTCTTCCAGGGTAGCCGCTC 

ACTCCCCTCTCTCTTGTCCTAGGATAAGGCCGAGGAAGGCTGACGAGTTCCAGCTCTGGG 

GATGCCCTATCAGCTGTGTCACCTTGAACAAATCATTTCTCCTCTTGGGTCTCTGTTTCC 

TCCAGTGTGAAACGTGGTGAAGGCATGAGGGGCTATGGGAGCCCCAAGGCCTCTTTCAGA 

GATCTCCTCTGGGTCCCATGTGACCCCGTGGCTATCCCCAAAGGCAAGAGGGTCCCCAGC 

CCTGCACCAAGGCCCTGGG 

Sequence 59 

CCGAGGAAAGGAGTTGGTTCGCGCAGGTGCGGCGCCTGGGTCCCCATGGCGCTGTGGCGC 

GGCTCCGCGTACGCGGGCTTCCTGGCGCTGGCCGTGGGCTGCGTCTTCCTGCTGGAGCCA 

GAGCTGCCAGGCTCGGCGCTGCGCTCTCTCTGGAGCTCGCTGTGTCTGGGGCCCGCGCCT 

GCGCCCCCGGGACCCGTCTCCCCCGAGGGCCGGTTGGCGGCAGCCTGGGACGCGCTTATC 

GTGCGGCCAGTCCGGCGCTGGCGCCGCGTGGCAGTGGGGTGAGTGCCAACGGGGCCTGGG 

TCTNTGAGCCTCCGAGGTCGGCCTTGGAGGTCGGGCGGAGCCGCGCAGAAACAGGGCTTC 

TCAGAGGNCCCCGGGAGGCGCTTGCTGTCCGCGCTGGCCCC 

Sequence 60 

CGCGTCCGGTGGGAAGCCAGAAGATAAAACCAAATGGCTGGGCACGTCTTTAGGTTATTC 

CTAGCTMGAGTTAAGAGTTGTAAGCTCTCTCATTCTTTGTTCTTCAGCCTTAAACTATC 

TTTCCTTCTATTAACTTTATTTGTCTCAGTTACAATGATAGAGGTAACTTCACATACTAA 

AAGMATTAGGTTACCATG TGAM CATTCTTCTTGGCTTGTGCTAATGTTATCAGATCCA 

AACAGCATCTGAMGAAAATTTTCCAAGTACGATGTTGTTCTCTTGTTTTCTGAAATACA 
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TATCATATGTTMAGTGAGAGTTTTTATACATGTTGAAAGAAGTTGAATGACATAACAAA 

TAGTTACTGAGGCCTCCATTTTCTTACTTCACAGTTAAMTTCCTGTTTCTCTTTGGGTA 

TAGGAGGGTAGAAAGAAGTGGGAGAGTAATAGCATTTTAAAACACAGAATCAAAAATCAT 

ATTAAAAGTAG 

Sequence 61 

CGTCCGGAGCATCGCGCACTGCGGCCGGGTCACCGACGTGGGCATCCGCTACGTGGCCAA 

GTACTGCAGCAATGCTGCGCTACCTCAACGCGAGGGGCTGCGAGGGCATCACGGACCACG 

GTGTGGAGTACCTCGCCAAGAACTGCACCAAACTCAAATCCCTGGATATCGGCAAATGCC 

CTTTGGTATCCGACACGGGCCTGGAGTGCCTGGCCCTGAACTGCTTCAACCTCAAGCGGC 

TCAGCCTCAAGTCCTGCGAGAGCATCACCGGCCAGGGCTTGCAGATCGTGGCCGCCAACT 

GCTTtGACCTCCAGACGCTGAATGTCCAGGACTGCGAGGTCTCCGTGGAGGCCCTGCGCT 

TTGTCAAACGCCACTGCAAGCGCTGCGTCATCGAGCACACCAACCCGGCTTTCTTCTGAA 

GGGACAGAGTTCATCCGGCGTTGTATTCACACAMCCTGMCMAGCAMTTTTTTTAAA 

AGCAGCGTATGTAAAGCACCGACACCCACTCAAAACAAGCTCTTTCTTTCNGGAAGGGTA 

TTAAGGAAT 

Sequence 62 

NCCACGCGTCCGCCAGNCTGTGAAGGATCCCAGACTGGCATATGCAGGAGGAAATGGGGC 

GGGCGAGGAGTAAGGACCCCAAAAAGCAGGGGTAGGGAAGGGCCCTCCCAGCGCCCCACT 

GTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGAAAGGTG 

AAAAGAGGAACAAAGAACATTCTCCTGGACGCCACCCACAGAAAGCCACGTGCAGGCTTG 

GCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTCAGAGCT 

GGGGCAGCGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTAGTTACCGGCAC 

ACGGGCACCAGCGCCCCCTCTCAGCAAACTTNCACGTCTTATGAAATTAGCACTGGATTT 

CCACTTCAATTGGA 

Sequence 63 

CCCACTGTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGA 

AAGGTGAAAAGAGGAACAAAGAACATTNTCCTGGACGCCACCCACAGAAAGCCACNTGCA 

GGCTTGGCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTC 

AGAGCTGGGGCAGCTGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTATTTA 

CCGGCACACGGGCACCAGCGCCCCCTNTCAGCAAACTCCACGTNTTATGAAATTAGCACT 

GGATTTCCACTTCAATTGGA 

Sequence 64 

NCGCGTCCGCTTCATCTTAGGATAAAGTCTAAATCTTTGTTTTTTGCTATTGTACTAAAC 

TCATAMTCCTAGGTTATAMGATAAAGCCTTAAACTTTATCTCATCATCCAGCCCAATT 

TCCAGC CACM TGMGTACTTAAMCTCTGTGTCTTTGTACTTGCTGTTCTCTTGGCCTC 

CAATTCCTTTTCATCTTTTCCATTCGGTAAAGTTTGTTTATCCCACAGGCCCTATCT^ 

AAGCCTCCAGCAACTTCTCCAGACAGAGGTGTTAGCAGTGTAGGATCAGATTTCTCAACC 

ACGTCACTCCCATGTCTGGGTAGATATCTCTGCCCAAGTGTTCTCATAGCACTTGAGCAG 

TACTCTCTAAGCGCCCAGGATCTACCATGTTGC I I I I I I AAAATTTG ATTAATTTATTTT 

TTTATACTGCTCCTTGTGGAGCANGGAGTGTTCCCAGAGTAGCCCACCATGTTATATTGA 

ATGGATCTGTGTGCATMTGCAGCTGTCCATCTACATCGTATATTTTTGTCTCCTCAAGG 

GTAGGGA 

Sequence 65 

GTTTGTATTCTGGATACAGGGGATACTGGGGCTCGCTATGTGTGTGGAGCCATCCCTTCC 

TTGCCCCAGCCCCACCTCCCTCTCAMCCCTCTCTGGCTCTTTCTGAGCTTCCTTTCCTG 

CTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTGCTACTTCGGGTCAGGT 

GGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACTTCAAGAGGAGAGGCTG 

CAGGGAGCCATCCTAMCCCAGAGGCCTGGAGCTTACCGTGTCACTTTACTTTTGTACAC 

AGGGGTCTGCTTAGTGCCCTCGAGAAGGATTCTTGGCCCTGAGCTTCTACTCCTGAGGCC 

ACCTCTGTGCAGCCCCAGCTCCCTCAACTCTAGGCTGTAGTCTCAGTGGGAAAGCCTGGC 

TTGGGGGTCTCCTAGGAATGTCCACCTGAAGGCACACTTGATAGGGGC7TGCACAACTTA 



# 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

13/467 



TGTCTGCCMGGCCACCTGAGGMCTCCTGGTGCCTATMGTTCCACCTTCCCTTCCTTT 
NCT 

Sequence 66 

CGTTCGGCAGCAGCAGCTGGCGGCCAACCGGCACAGCCTGGTGGAGAAGCTGGGGGAGCT 

GGTGGCGGGTGCCCACGCACTGGGTGAGGGCCAGTTCCCCTTCCCCACTGCTCTGTCGGC 

CACCCCCACGGAGGAGACGCCCACCCCACCCCCAGCCGMCACCAGCGACCCCCCGGCCGA 

AGACATGCTGGTGGCCATCCGGCGTGGGGTCCGGCTCCGCAGGACCGTCACCAACGACAG 

GTCGGCGCCCCGCATCTTATGATGGCGCCACCCTCCCCATCCTCTCAGGCCCCAGTGCGA 

GCAGGTGGCCTGGTCTGTGAGCCGCAGGCACTCAGAGCAAAGGCCCAGCCAGGAGAGAGG 

ACAGAGCCAGGGCAGAGGCCATGCCACTTTATGGAAAGACACCTCACTTGGATTCCAGCA 

TTTAAACAGGAAGTGACTTCTTAGCAAGCCTGGCCAGGACGGAGCCTGCAGGCCTGGGCC 

TGGTTCGGGGTCTGTTTTATGCTCTTCGGTCCCTTCTCTTCCTTCTCTGGGGCCCTGCCT 

CTTCCTACCCATAAAGCACCAAAACCAGGGCCGCTGCCATGACAGAGGGGCCAGGCTGGC 

CTTCCTTTCACATCCCGGCCTNTTCCAAGGCTGGTCCTGCCTNACTTCTTCTGGAATGTG 

GGCCCCCTCTCCCTTGCTGACCCCTCTTCCTN^TGCTTI^^r^TTGC■^^TCCANGCCCT 

Sequence 67 

CGCGTCCGG CAACCTAAAAATAGGATGCACCAATAGCATGTGGTTCCAAGTAAGTTGTGA 

I 1 1 I I ATTTTGAGACAGTGTTCCACTGGAAGGGAGGGAAGGGCTTACATTCACAGACAGT 

MAGCAGGGCTGTGTAAGGAGCTACATTTACTTAACAGTCTCCTTTCCAGCTAGGTTTGT 

TTTATTTATTTGCMGTCAGCTMGMTTMCCTTTTAAACCATCTAAAACAGGCAAGCA 

ATATAAAGATTTCTACTAGTGCAAGGTAAGTGGTTTGAATATACAAGTGCCCTTTCCTGC 

CACCCAGTCTCACTACCGTTTTAGTCCTGCAGCTGGGTAAAGCCACTATTGTGTGGAAAC 

TCTCCCATGTGCCCTGTCTCTACCTCTGGACACACCAGCTCCTTCTTCCACCTATTCTAT 

TCCTCAGTTAAGCCAAGTGATCAQAAGTAGTATTAAATGGGTAGATAATTTTA 

Sequence 68 

GCCACGCGTCCGGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCGGGCTCCGCGCGGAC 

CGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGA 

ATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTTCTTCCTCACT 

TCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGGAAAGGACTGGGGAAAATA 

GCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTGTCT 

GATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCGCAGG 

ACAGACGGCCGATCCCGCCGCCTCCGTACCAGCACTCCCAGGGAGAGTCAGCCTCGCTCC 

CCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCACTGTGATTCTCMTTCCTGC 

NTGGG I I I I I I I 

Sequence 69 

ACCCANACCTGGGAGGAATTAATGGAATGCTTGNCCCTGGGCAGCCTTAGAAACAGACCC 

NAGCTTATCTAANGCTGCTCCGAGGCAGTGACCCAACTANGGCTCAGGAAGTCAAGAANA 

TTGAC CAAG CTTATAGTGATCACCTCTTGACCTTTGTGTCACAGTCNTTTTGCTTTTTAA 

MCCCTTTTGTGMCCGNTTATGGCCTTTGATTCTGACAGGCATCNTAGTTGTGAAGGGG 

AACANGGGCAGGATATAATGTTTCGTTTACCAAATACAANAAAATCNGANGTACCCAGNT 

AGATCACAANATTTTTTGGGAGAAGGNCTNTTGGGTCTCTTCCAGGAGNTCACTTCANNN 

TTGGNAACTTGACAGGGGCTTGGGGAATTTANTATTCCCCTTGGGCGCAGGGNCNCAAAN 

GGGTGGCANTTTCCCTCCTTGGAGN I I 1 1 I I I I I CAAGAANTCCTTGCNTNGGGGAAAGA 

TGGTTACNANNATCCCGGAATTTCCMCCCCCTTCCCTATTTTTTTGGTTTAAGG 
Sequence 70 

CGCGTCCGGCACATTAAAAAAAAATACTTATTTTTTATTATGGAAAGGTCTTGGAACATT 

CTGATAGTGAGCTTCCGGCATTCATTTGCTGTATCTGGCTTAGGAGATGCTAGGGTGGCA 

AGAAGAGGCACAGGCTTAGATGCGCTGGGTGGAGAGTTGGCTTTAGTAATGATGGTTGAC 

TCTAACGACTTGATTATCAGCTGTGCCTTTTTTCTTCCTGCCTTCTGAGGTGTGTTGCCT . 

GCATCCTAATTCACGTACTGAGTAGCAAGCTAAGCAGGTTGTAGCTGGAGATTGTAAGAA 

ATCCTGAATGGAAACCAAAGAAAGACTGTCACATCACATGATGTGCCCTTTTCAATCCCA 
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TGTCTCTTCCCAGTGGCATCCCAGTGCTGTCTCTGCCCCCTGCTGCTTCTGTAAAGATTT 
TCTGACACAAGTMCTGCCTCATAGACCTTCCTTTTTATGAMTCCTGAGTTTTGGTTTG 
GGTACGTCCTTTTTAGAT 
Sequence 71 

GCGCGGCTGTCGCGAGGGCGGGGGTCGGGGCTGCAGGCGGGGCAGGGCTGGGTGGGGGCG 
CGCGACGCACCTGCCTGCTTCCTGCACGGGTGGNCCCCMGGACTGCGGGGCCCCAGCCC 
AAAGCGGACCTTGA 
Sequence 72 

CGCCCCGCGTCCGGGCGGCTGGTGGGCGACCGGGCGCATCCTCATTGCATGTGCGGCGGC 

CCTACCTCGGCCCTGGGCTGACCCCGGCGGCCCTGCCCGCCCCTCCCTCCAGCATCATGG 

CCAGCCCAAGAACCAGGAAGGTTCTTAAAGAAGTCAGGGTGCAGGATGAGMCAACGTTT 

GTTTTGAGTGTGGCGCGTTCAATCCTCAGTGGGTCAGTGTGACCTACGGCATCTGGATCT 

GCCTGGAGTGCTCGGGGAGACACCGCGGGCTTGGGGTTCACCTCAGCTTTGTGCGCTCTG 

TTACTATGGACAAGTGGAAGGACATTGAGCTTGAGAAGATGAAAGCTGGTGGGAATGCTA 

AGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTACCGATCCTTGCTGGTCCTTTGCAGGG 

AGAAGTA 

■ Sequence 73 

GCCCCG CNTCCGGAAATGTCCGAI 1 1 1 I I I 1 1 AATTTAATGAAATTGTTAATGAGGAAAA 

ATTTTTAATATAGGTCTTATCTACCACACATCCCCATAGATTTAAGGATTTTAATAGAAA 

GTCATGATGTATGTATTTAAGCCACGTTAAAAGAAAAAATATAACTATGGACCGGTATTC 

AGTGMTACAGTTTCATGGTTTTTAATTCTTTCAAAGCACATTAAAAATGGTGTGCTGAT 

AMCCCCAAGTAAATTAACCCTTTTTCCGTATAAATCCATTTTTTGTTTTGAAGAGGGGA 

MTTATATTTATTGNTGTTTACTGAATCCTGGTGTGAAAGCATATCAGATATGTATGAAC 

TGCTACTGCTGTACTTCCGATTTACGGACATCATTTTATTGCTATTTGTAGACCGTGATA- 

ACATGAACATGAGTCCTATTTATGTGGGCCTTCAGTGGATGGGCAGTGCCACTCANGTCT 

CTGGGGGTTTCCTCTCTTAATTTTAAAGTAA 

Sequence 74 

AGTCGCCNCGCGTCCGTGTGTTTCTCCTCGGTCCCCAACTCTACCTTCCCCAACCACAGT 

TCCTGTCCCCAGATGTCCTGATGCCCACCATGGCAGGGGAGCCCAATAGACTCCCAGGAA 

CTTCAAGGAGTGTCCAGCAGTTTCTGGCTATGTGTGACAGGGGTGAAACTTCCCAAGGGG 

CCAAGTACACAGGAAGGACTTTGAACTACCAGAGCCTCCCCCATCGCTCCAGAACAGACA 

•ACTCCTGGGCACCCTGGTCAGAGACCAACCAGCATATTGGGACCAGATTCCTGACTACTC 

CAGGGTGCAATCCTCAACTAACCTACACTGCCACACTACCAGAAAGAAGCAAGGGCCTTC 

AGGTTCCTCACACTCAGTCCTGGAGTGGATCTTTTTCATTCACCCTNCCACCCTNCCATT 

GNTCATCCTGTGTACCCACCATCTAAGCAGTCTTCATGTACCCCTGAGGTCAAGCTTGGA 

A 

Sequence 75 

CCCGCGTCCGGGCTGGCATGGCTCTATATMGATTGTTGCANAAANTCCCTACTACTTTT 

GGTCTGTGATGAGCTTAATTATGCAATCTATATNGGCACAGGATGAAAACCTCTCAAAAA 

CAATGTTTCTGCCCCTTGCTGAGAGAATGGTCGAAAAAATGGTGAAAGAGGACAAGATAG 

AAGCTGAGGCTGAAGTTGAACTTTATTATATGATCCTGGAACGTTTGGGAAAGTACCAGG 

AGGCCTTGGATGTCATCAGAGGGAAATTAGGAGAGAAGTTGACAAGTGAGATTCACAGTC 

GGGAAAATAMTGCATGGCTATNTACAANAAGCTGAGCAGGTGGCCAGAGTGCAATGCCC 

TTTNCCGGCGCCTCTTACT 

Sequence 76 

GNTGGAGGGAGCTTTGCTACTCTGCTCTTGGCATGACTCCAGGATTTTTTTCTGGAATCC 

AACCTCTGTCCTCTTAGGAGAAGGAACCTGTCCTTGGTTCAGATGGGTGGGCATGAGGAG 

GAAAATTTCCATTAGTGTAGAAAAGTGCTGGACAGAATCCGGTTTGGAAAATTACAAATC 

■ CAGTTGGTCAAAATAGGCCATTTCCTATGTGTGACCTATTCGTGGTATGCCAACTGGACT 
GCTTCCTAAACAGGACGAGGAAAGTGAGGAATATTTTTATATGAAAGCCTTAGCCTGTCT 
GGCACCCATGMAAAMCTATTTATGCACTCCTACTTTCACCCGTCTTTTTGCATTCTCT 
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ATTTGTAGCACAACAGAGTTGAATGCCACAAAACACCCCGTTTATAGTGAGCTGTTTTCA 
GTGACCAATATCAGAAGGAGGCTTGCTTCTGGACTAGCCTACTAATTGCCAGCAGCCACC 
ATTTTTCATG 
Sequence 77 

GGAGAGTGCCTTGCCGGACCCTCAGGACGGAGCTGCTGGGCTGCTACAGTGACCAGGACT 

TTCTGGCCAAGCTGCACTGtGTGCGGCAGGCCTTCGAGGGGCTTCTGGAAGACAAGAGTA 

ACCAGCTTTTCTTCGGGAAAGTGGGCCGACAGATGGTGACAGGCCTGATGACCAAGGCTG 

AGAAGAGCCCCAAAGGCTTCCTGGAGAGCTACGAGGAGATGCTGAGCTATGCCCTGCGGC 

CCGAGACCTGGGCCACAACACGGCTGGAGCTGGAGGGCCGAGGGGTGGTATGCATGAGCT 

TCTTCGACATCGTGCTGGACTTCATCCTCATGGACGCCTTCGAGGACCTGGAGAACCCTC 

CGGCCTCGGTGCTTGCCGTCCTGCGGAACCGCTGGCTGTCANACAGCTTCAAGGAGACGG 

CCTTGGCCACTGCTTGCTGGTCGGTCCTGAAAG 

Sequence 78 

CACGCGTCCGGAGAACGTGATTTCTCAGCCGAATGAGTTTGAACATACCCCACAGGAAGA 
■ TGACTTGGGGTTCAAGGAAGAAGATTTGGCTCCAGATCATGAAGTAGGAAATGCCTCTCT 
CAAACCTGAAGGCATCCAGAACTGGGATGACTTATGGGTCCAGAGAGAGGGTCTAGGAAA 
GCCTCAGCCTCGGGACAGAGGCCCCCGGCTCCTGGGTGAACCACGCTGGGGCCAGGCTAG 
TAGTGATCGGGCCGCTGTGTGTGGTGAGTGTGGCAAAAGCTTCAGGCAGATGTCAGATCT 
GGTGAAACACCAGCGGACCCACACAGGGGAGAAACCCTACAAGTGTGGGGTCTGTGGCAA 
GGGCTTTGGGGATAGCTCTGC 
Sequence 79 

CGCGTCCGCAAGMGATMCCCCAAACTCTTTTCTCAGAGAGTTTGTAGCCTAGTTTGGG 

ATAGATAAGATCCACATATTTAGTCATATAAGACTACAGGAGAGTAGAATAGATGCACCA 

GATGG TGTCG AATGAAAGTGGTACTTTGTAGACTATAAGTGCTGTAAATTCTAAAGGACA 

GGTTACnTTTGCCTGGAGTGGTCMGAAAGATTTTATTTAAAATAAGGATTTGACGGGCA 

GACTTAGCAGTCAAAAGGGAGAAAGCGGGTAAAACAAATGTAAGCCATCATAAGAGTGCA 

TGTGGTTTGGAAGCATCAGGGAAAAGACTAGCCAACCTGAAGTAAAAGGTTCGTGCAAAT 

TGGGCAATCAATAGCATTAAGTTGGAAACAGCTTGGGGACAGACACATAAGAGGGCCAGA 

GTGTGMTMTTTCATCTMTACTTTATAGCACTTGGACATTTACAGAGCACTTTTCTC 

Sequence 80 

TNCTCTCTGCCCCCCCACATTCGTCCTCTTGATTCTCTGCTTCTCTAGCTCAGCCGCTGA 

GCTTCGTGCCTAGCCGCCACTAGTCCTTGACCAGCGTTCTGGCAACTCTTGCCTCCAAGT 

TCTTCCAGCTCCAGGCTGAGCCGATGGGGATTCAGTTTTCTGACATCACAGCTNAGTTCT 

TGATTTCTGCAGCAAAACCTTCAAGGCTTCCATCACCTCGGCTCTAGAGTCCATCATGCT 

CCTCTCCTATGACCTGCTAGGGCTGATCAAACGTTCTGTCTCCTGCCGTGCCCTGCCCTG 

CCCTGCCCTGAGCTTCGCTTAGCCTGTTGCAGGCTTTGTGTTTTCCTTCTTGTGCTGTTG 

GACCACGCAGCTCCTTCTACCCATAAAACCCCCTTCTCTAGGTCGGTGGAAATCTTGGTC 

ATCCTTCCGTGTCTAGNTAAACTGGNCACCTCCTCCACGAAAGCCTTCTAAAAACTCCTT 

CTCAGGGGAACTGGTTTTCTTTCT 

Sequence 81 

CACGCGTCCGCAAAATAGCCCCACATCCNGGCAAAAGGGGCCTTTCCCTTGGCCCAGAAG 

AAAAAGGAACAAGTGGAGTGCAGAAGAAAATCTGTACTGAGAGACTTGGGCCTAGCTTGT 

CTTCCAGTGAGCCAACCAAGGCTGGTGCTGTCCCATCCAGTCCCTCGACGCCAGCACCAC 

CCAGCGCCAAACTTGCCGAGGACTCAGCTCTGCAGGGTGTGCCCTCTCTGGTGGCAGGTG 

GAAGTCCACAGACTCTTCAGCCGGTATCCAGCAGTCACGTGGCTAAAGCTCCCAGTCTGA 

CCTTCGCTTCCCCCGCCAGTCCTGTCTGCGCATCAGACAGCACTCTCCATGGGTTAGAGA 

GCAACTCTCCCCTTTCACCACTGTCCGCTAATTATAGCTCACCTTTATGGGCTGCAGAGC 

ACCTCTGCCGCAGCCCAGATATCTTTTCAGAGCAGCGGCAGAGCAAACATAGGCGCTTTC 

AGAATACCCTAGTAGTCCTACATAAAATCTGGGTTGCTGGAGATCACTTTTGAAAACCAA 

G 

Sequence 82 



• 
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ACGCGTCCGCACCCCTGTGTCCAATGACATGTGCACCCAGGTCGCCAAGCGGCCTGTGGA 

CACCCAGGCCTGTAACCAGCAGCTGTGTGTGGAGTGGGCCTTCTCCAGCTGGGGCCAGTG 

CAATGGGCCTTGCATCGGGCCTCACCTAGCTGTGCAACACAGACAAGTCTTCTGCCAGAC 

ACGGGATGGCATCACCTTACCATCAGAGCAGTGCAGTGCTCTTCCGAGGCCTGTGAGCAC 

CCAGMCTGCTGGTCAGAGGCCTGCAGTGTACACTGGAGAGTCAGCCTGTGGACCCTGTG 

CACAGCTACCTGTGGCAACTACGGCTTCCAGTCCCGGCGTGTGGAGTGTGTGCATGCCCG 

CACCAACAAGGCAGTGCCTGAGCACCTGTGCTCCTGGGGGCCCCGGCCT 

Sequence 83 

CCCCGCGTCCGCTCTTACGCATTACTCTATGTCTACTGTTATGGGTGTGTAATTTTATAC 

CATAGATGTTTACTCTTTAAACAGACACTTCTAGTCTGTTTTATTTCATGTGTCTGGGAG 

CGGATAAAGTGTGAGGTTCAGGGAGAAAGAGAGGTCTGTCTCAATGCCTTGGCACGGCAT 

GAAGACAATCTCCCCTCCTTGTCCCCTTTCCCTGCTAGCTCCTGATGACTGACAGATTCA 

CAGCAGAACAGAAAGGACTGGGAAGGGATGGAGGTGGGACATCTGGCACTGACCTTCAGG 

GGCTGACCCTGTGGGGGAACATCTGCCCTGAAGAGTTGGAGCCTTCATGTGATGACACAG 

AGCTGMGTGTGATATTCGGGAGGGGATAGAGAGTGCTTGGAGGTTTTCTGATTTTGAAG 

AATCCCAAGTCAGTC 

Sequence 84 

GTCCGGCCGCTTCCGGTCTCCCTCCCGGGCCGGCGCTGGCCTGACTGCGGCCCCGGTCCG 

TAGCACTCCGCCCTCCGCTTCTCCCGCCCTGTAGCCGCGAAGACTGCTTCAGCCTTTCCC 

TGTGCTGCCCCTGCCGCGCGATGGAGACGAGCTCGAGCTGCGAGAGTCTTGGCTCCCAGC 

CGGCGGCGGCTCGGCCGCCCAGCGTGGACTCCTTGTCCAGTTAATGTGTTAAGAGCCATT 

GACATTTGAAGATCATCAGAAGTGAAGATAAAACATCTCAAAAATTATAATTGCCTCCAC 

TTCTCATTCAGAGAATTCAGTGCATACAAAATCAGCTTCTGTTGTATCATCAGATTCCAT 

TTCMCTTCTGCCGACMCTTTTCTCCTGATTTGAGGCCCATGCAGTCCAGTTCGGGAGC 
TAAGT 

Sequence 85 

CCGCGTCCGCGTGAGGTGTGGGTGTTCGTTTCTCAGGTAAAACATGGCTAAAAGCTTACG 

GAGTAAGGTGGAAAAGAAAGATGCGTGCTGAAAAGAGAAAAAAGAATGCCCCAAAGGAGG 

CCAGCAGGCTTAAAAGTATTCTCAAACTAGACGGTGATGTTTTAATGAAAGATGTTCAAG 

AGATAGCAACTGTGGTGGTACCCAAACCCAAACATTGCCAAGAGAAAATGCAATGTGAGG 

TAAAAGATGAAAAAGATGACATGAAAATGGAGACTGATATTAAGAGAACAAAAAGACTCT 

TNTAGACCAGCATGGACNGTCCCAATTTGGNTGAACCCAAAGGCAAANAAAAANGNTTTG 

ANGGCAAACCGANNGAAAAAAAAANGGGGAAAACCAACCNAAANCCCTTAAAANGGGCCA 

ANGGGGTTTGGCCNCTGNNAATNNTTTNAACCCNTTTGAAAACCCCCCCTGGNNGANACC 

NCCCGTAAAAAATNTTCCCCCCNN I I I I I I I I 1 1 

Sequence 86 

CCACCGCCGTCCGAGGAGGGATCACCAAGCCGTGGGCCATGAAAGTCGGGGGGGGGCACC 
GCAAGCTTGAAAGCTTCATCATTGACCTTTCNCAAGAAATTACCGGGGCCAAGGCCGCTT 
GTT TCNAAAAC CCCCATGGAAACNAAAACGANGGGGAGGCANGGGAGAACCACCCCCTTG 
GACI I I 1 1 I I INATTTCTTCTTGGACCTACCGAAGGAGGGCACAAATTGCCGGGAAGATT 
GCTTGCCTTTCCACCTTGGGACAAGGGGATCCTGGGACTTTTCCGGCCCGGGGTTCCCNT 
TCCCGTTGGCCCGGCAAGGGATTGGGTCNAAACAATTGACCCAAGGGAAGAATCCCCGGG 
GACGGTCACMCCGGGGNACCAAAGGNAAAGCTTTTTTGGGANGGGAACCC I I 1 1 1 I I I A 
ATT 

Sequence 87 

CCGGGTCCCTCCCTGCGGAGCCGCTGGTCCGGCTGGCGGAGATGTGACCGCGGGCCCGGC 

CGGCCTGCCTCAGGCGTCGCGTCAGCTCCCGTGTCCGTGCCCTTAACCCACACCGATGGC 

GGGATCCGGCTGCGCCTGGGGCGCGGAGCCGCCGCGTTTTCTGGAGGCCTTCGGGCGGCT 

GTGGCAGGTACAGAACCCGCTGGGTAGCGGCTCCTTCCNCTTCGGGGTATTCGGGTTTCG 

CTTGTTGGGGMCCCTTGGTTGGCCCCCGGGGCCCTTAAANCAGTTNTTGCCGCCAGAAA 

CCACCGGGGTTGCGGCCTTTGCCGCCNAGTATGNTTTCCGAAAAAAAAGGGCCGGGCTTG 
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NACAAGTTGCANGGGTCACAAAAACATCGTGAATTTTGATNGGAGTGGTTACAATCCAC 
Sequence 88 

CGTCCGTTTAATTATAACCTAGATTGTCTGGGCAACGGCAGGAACGGAGTGCCACTGTGG 

AGCAGATAACTGCAGTGGTTTTCTAGGAGTGCGGCCAAAGTCGGCATGTGCGTCAACAAA 

TGAAGAGAAGGCAAAAAATGCTAAG7TAAAACAGAAGAGACGAAAGATCAAAACAGAACC 

AAAGCAGATGCATGAAGATTACTGTTTTCAATGTGGAGATGGTGGAGAGCTGGTCATGTG 

TGACAMAAAGACTGTCCCAAAGCATACCACCTCCTATGCCTTAACCTGACTCAGCCACC 

ATATGGAAAGTGGGAGTGTCCGTGGCATCAGTGCGATGAGTGCAGCAGTGCAGCTGTTTC 

CTTCTGTGAATTCTGTCCACATTCATTTTGTAAAGATCATGAAAAGGGGGCCCTGGTTCC 

CTCTGCACTGGAAGGCCCGCCTCTGCTGCTCGGAACATGACCCCATGGCTCCTGTGTCAC 

CAGAATACTGGAGCAAGATAAAATGTAA 

Sequence 89 

NGTCGCCCCGCGTCCGTAAMTGTTAAGTCCCCTAAAAGTGAATAAATTTTAAATACCTA 

CTTTTAAAMTACT GTCTT CTAMTTGACATMTTGCTTTTCTTACCAAAAGAAGGAGAG 

GTTCCCCTMTTCCTTTTGGGCCATAGATCCGCTTTTAGGATCTGATTAAAGATGTGGAC 

TTTCACCGCAGGAAGATACCCATGGGCACACCATTCAAAATGTATCAGATTTCAACGGTT 

TTACAAACTCCCCCGGGATTAAGTCAAATGGGTGGATATTACAGCTGTTTTCATCAGATA 

T GGTTATT TGTTGACAACCATCAGGACAACAATGTTTATAGATGGAAGGATAACTTCCTG 

GTTTTTTTCCACATTGAATGTGGCTAGTTACATATCTCAATTTAAAATAAATTGTGGAAA 

GCCAAAAAAGTATGGTCAAGCTAACCTTGGGGTGGCTTTTACCTGATGCCTACAAGCACA 

GAAAAATAG 1 1 I I I AA 

Sequence 90 

CCGCGTCCGATTATGCCAAGAGAAGGTATTACTTTAAGATGTGAAAAATGTAAAATGGAA 

AATTACATTACTAAGAAAAACAAAAAAACACAAACTGNNAAATTAGAANTGAAAAAACAT 

TGCCACAAATGCAACGCACATACAAGTCATAAAGAAAAAAAATAATTTAAAATGAATAGA 

MATATTTAAGACTTMGCTTTGAAGAACACAATTTAGATGCAATMTTTCATTTTCTCA 

ACAAACAAGATTATGAGTTTCTAATCTTCAAACAAGTGATGGAATAGTGTTTCTTGAAAA 

AGAAGAAATCCATTTATTTGTTGATTCAAGATATATAGAAGCTGCTCAAAAAGATGCAAA 

AAATGTACAAGTTCATTTATTGACAGCAGCTMTTTAAAAGATTTTGTAAGTAGTAAAAA 

TTACTTAAAAATTGGTGTTGAAAAAGMTACTTAACTTTAGCTGATTTTAAAAAACTTCA 

AGCTTGATTTCCAAGTGCAGAATTTGTCAAATCAATGCGCAAAAATTAAGACTTATTA 

Sequence 91 

CCGTTGTCCCATATATCTTGTTCCAGCAGCCATATATCTTGNGGTCTACACGCCTAAAGC 

ATGATTTCCCTTGAAGTCTTGGGGTTGNTTAAAGGAGAGTCCCTTCAATATAAAACCTCT 

GMATATTAGTGAGMTGGCTCACTMTGTGMCMTGTTTAAATTATTTATTTATATAT 

AGAATTACTGAATATTAGTACT GGGA AAATTTATAGAAATCATCTAGTCTTACCCTTCAT 

CTTACATATMGAAAAATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 

TTGAGCATGMTGTATGAGCCTMTTAGAGMACAGAAMTCAGCATGTCAGTTTTCCTT 

CATTCAAAATAACATAGTCTTTCTAAGCAGTCATTCTGGGAG 

Sequence 92 

ACCACGCGTCCGCAAGGCCCGCCCCTTACGTACTCGGAGCTCGGATCCCAGTGTGGACCT 

GGACTCGAATCCCGTTGCCGACTCGCGCTCTCGGCTTCTGCTCCGGGGCTTCTTCCCTGC 

CCGCCCGGGGCCCTGACCGTGGCTTCTTCCCCGGCCTGATCTGCGCAGCCCGGCGGGCGC 

CCAGAAGGAGGAGGCGGCGCGGGGGCGCGCTGGGCGGGGGAGGCGTGGCCGGAGCTGCGG 

CGGCAAGCGGGCTGGGACTGCTCGGCCGCCTCCTGCCCGGCGAGCAGCTCAGACCATGTC 

GCCTGAAGAATGGACGTATCTAGTGGTTCTTCTTATCTCCATCCCCATCGGCTTCCTCTT 

TAAGAAAGCCGGTCCTGGGCTGAAGAGATGGGGAGCAGCCCGCTGTGGGCCTGGGGCTCA 

CCCTGTTCACCTGTGGCCCCCACACTTTGCATTCTCTGGTCACCATCCTCGGGACCTGGG 

Sequence 93 

NCGCGTCCGCCAAGATGGCGTCCNTCATGGAAGGGCCGCTGAGCAAATGGACTAACGTGA 
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TGAAGGGCTGGCAGTACCGTTGGTTCGTGCTGGACTACAATGCAGGACTGCTCTCCTACT 

ACACGTCCAAGGACAAAATGATGAGAGGCTCTCGCAGAGGATGTGTTAGACTCAGAGGAG 

CTGTGATTGGTATAGACGATGAGGACGACAGCACCTTCACAATAACTGTTGATCAGAAAA 

CCTTCCATTTCCAGGCCCGTGATGCTGNTGAGCGAGAGAAGNGGA 

Sequence 94 

ACGCGTCCGCGGACGCGTGGGTGCGGGCCGGCCNCCCTGGACGAAAGAAGAGGGCCCCTC 

CAGGCCAGTCTGGGCACCCTGGGATAGCGGCTGCAGCCAGGCATGGCCGACTCTGCACAG 

GCCCAGAAGCTGGTGTACCTGGTCACAGGGGGCTGTGGCTTCCTGGGAGAGCACGTGGTG 

CGAATGCTGCTGCAGCGGGAGCCCCGGCTCGGGGAGCTGCGGGTCTTTGACCAACACCTG 

GGTCCCTGGCTGGAGGAGCTGAAGACAGGGCCTGTGAGGGTGACTGCCATCCAGGGGGAC 

GTGACCCAGGCCCATGAGGTGGCAGCAGCTGTGGCCGGAGCC 

Sequence 95 

CCCCG CGTCC GAGGTGACCTCCTTGGCCCAGATCATCTTAGAGCCAAGAAGCAGGACCAT 

TCGTGGTTTTGAGGCCCTGATTGAAAGAGAGTGGCTGCAGGCTGGTCACCCATTCCAGCA 

GCGCTGTGCACAGTCAGCCTACTGTAACACCAAGCAGAAGTGGGAGGCTCCTGTATTTCT 

TCTCTTCTTGGACTGCGTGTGGCAGATCCTTCGTCAGTTTCCCTGTTCTTTTGAGTTTAA 

TGAGAATTTCCTCATCATGCTCTTTGAGCATGCTTATGCCTCACAGTTTGGAACATTTCT 

GGGCAACAATGAAAGTGAAAGATGTAAGTTGAAGCTACAGCAGAAGACGATGTCTTTGTG 

GTCCTGGGTTAATCAGCCCAGTGAGCTGAGTAAATTCACCAATCCCCTCTTTGAAGCCAA 

CMCCTTGTCATCTGGCCTTCAGTTGCTCCGCAGAGTCTTCCACTGTGGGAAGGTATTTT 

CCTACGTTGGAATAGATCCTCTAAGTATTTGGATGAAGCATATGAAGAAATGGTTAACAT 

CATTGAATATAATAAAGAATT 

Sequence 96 

CCGCGTCCGGTTTNCCTGTTGGTTAGGCTGGTCTTGAACTCCTGACCTCACGATCTACCC 

ACCTT GGCCT CCCAAAGTGCTGGGATTACAGGCCTGAGCCACTGCACCAGGCCCACCCTG 

TCTCTATTTTCTAAAATAATA AATCT GATTTTAATGTGGCTGGATATAAATCATATCACA 

GTTGG ATTTG GAAGTTTGGGTTTTATTCCTAACTTTGATGGGAAGCCATTTTAAGCAGAA 

AGATGATTTTMMGACCACTATATTTTCTGTGTGAAGAATGAACTGGGAGATTTCATAG 

TATTATTAACAAAAATAGAATAGTTGGGGATCTGGTTTGGCTTGGGAAATGGAGGAAGTT 

CMCTTTGGGCATGCTCCATTTGCATTGCCAAGACATTGCAGCAATTGGAAGTGCAGTCA 

GAGAGCTTAGGAGAAACACTTGGCAGATGGACATAGAGAAGTAGTACTCAAAGCTTGTGG 

ACATTGATTAAATAATCATACAGGAGTATGGGCTGACAAAAGATTNCAAAGAGAAAACCT 

Sequence 97 

GTCNCCACGCGTCCGGGACTCTCGGCCCTGGAGAAGGAGGTGGACTTTGACTCCGACCCC 

ATGGAGGAGTGCCTGCGGATCTTCAACGAGTCCACCAGCGTCAAGACGGAGGACAGAGGC 

CGGCTGGCCCGGCAGCCCCCCAAGGAAAAGAGTGAGGAGAAGGGGCTTTCGGGTCTGACC 

ACTCTGTTCCCCGGGCAGAAGAGGAGGATCTCCCACCTTTCCAAGCAAGGCCAGGAGGTG 

GAGCCCCCGAGGAGGGGTCCCGCGGTGCCCCCGGCCCGGCCCCCGACGGCGCAGGAGGTG 

TGCTACCTGCGGGCCCAGCAGGCGCAGAGGGCATCGGCGAGCTTGCTGCAGGCCCCCGCC 

AGGCTGGCAGAGAAGTCGCCCTNCGTCCACATTTCCCGCCCCTGGCQAGAA 

Sequence 98 

CGCCNCGCGTCCGGCAAAGCAAAAGGGAAATTATTTGGTGGATGGTAGCTCAAAATTGGA 

ACTCTTGTTCTMTTCAGTTACATT GGCT TTACCCTCCTTAGATTTTTCATCAAAGGGCT 

GTCCCATTG CMTC TTACTAAMCATTTTGTTAAMTAMCTCTTTTCCTTTTTATATTA 

ATMTTAGGCTTTTAAATAMGATGTTATTCCTTTAAAATGGTGGGCTTACCATCATTGA 

AGATGTCACTCAGGTGGCCTTGCTTGATCAMACGCCTTTTTTAAAAACCAAGCTTTAAA 

AACATGTTTATAATTTCATGAAGTACATATATATTGTTCCCATAGTCTTCAGCTTTAAAA 

CTATA AATATGCCCAMTTTTGTTATTTGCCCTACT7TAAGTAGGTTTATTGNGTTTGTT 

CAAGTACTTGT 1 1 1 1 CTCTG ATAAG ACTCAG G AATTCTG AAATGTG AAAATG NCT 

CAATT 
Sequence 99 
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CNCGCGTCCGAMTCGTTGCTACCAANTATTCAAAACCCTTTGAGTTTACATACTAGTTA 

CCTTAMMTTANTNCC TGACNC TCI^fTGANTTTGGGNGGAAAGCCCTTGTI^TCNNCTCTC 

TNATGNACTCTCATGGGTTTTTTTGTATGATTTGMTATNMTGTGCCTAMGMTTTTT 

GCTCTCTTAATCTATGNATACATACTTGAACAAATCATTCTTGCTTAACTGCTGATCTTT 
TGTAAAACTATTG 

Sequence 100 

GCCCCGCGTCCGGCTGGAA GCAACAGT TTGGCAGCCTGGGGTACACTCAGGTTATTCGTT 

ACAACTATTATTATTTGATGTCTTTTTTTAAACTCAGGTCATCCACTTTTGACTGTCATC 

C ATGG AAGAGCTCTTATTAAAAGCCTCAGACTTTCGGGACCTATGATTCTTTGGCACAAC 

CTTTTGGAAAA TTCTT AAGCAGGGATGAAGCAAACTTGATTGGAGTTGGGGAAAAAGAAG 

ACAGATTAGTA 1 1 1 I ICATGCTGACAAAAAATAGCTGCTATGACTTTTCCCGCAACGTGG 

ACAG GGGC CAAGTGAAGCTGAACTGGTCATGGTCTGTCGCCCAGTGTTCCCTTGTCGTCG 

GCGATTTTGCCCCCGACCCTTCTTGGTGGGCTTAGTGGTGGCAATCTGTCTCTTCTACCA 

GACTCTGACCCTCCGAGGGTCGAGGAAGCTCACAGCCGCTGCCCCTGGGGCTGTCCCACA 
CACATCCACTGAAA 

Sequence 101 

CCACGCGTCCGGGCGTCTCGGCTCTTCTGTATCTCCCTGGCCTGGTCGCTCTCGGCTTCT 

GGGCCTCGCCCTTCCTGTCTGTGAAATGGACTCTGGGTGAATCCAAATGGGATCGTCTCG 

GGCTACGTCCTGTCCCTCCGGGACTACAAGTCCCAAGGTGCTCGAGGCGACCTTGGCTCC 

CCCTCCCCACCGGGACCCGCCTCCCTCCAGCCCAAGTCACGTCGTCTAACCTGTTCCCAG 

CTCCTGCCCCGCCCCGTTCTCCGCTCCCCAAGCCGGAGCCCX3AGCTGGAGGAAGCCCCCA 

GGTGCCAGGATCTGCTCGGATCCGNGCCCGCTCCGGCCGGCACCATGGACAGTGAGGCAT 

TCCAGAGCGCGCGGGACTTTNTGGACATGAACTTTCAGTCGCTGGCCATGAAACACATGG 

ATCTGAAGCAGATG 

Sequence 102 

CCACGCGTCCGGTCCGGGGTGGAATCACGTCGCTGCGGCTGCCGACGACCCACACCCGGC 

CGGCCGCCTCCGCAGACCCACCTTGGCCGCGCGGCAGGGGGCGCGCAGAGCCCCGAGGGA 

GCGAGTCCCCGCGCGTGGCAGCTCGGCGGCTTCTCCCTTCGGGAGGTCCGGCTCCCGGCT 

CTCCGGACCCGCCTGGCGTCCTCGCCTGCGGCGGGGCGGACGACAGCGGCGCCCAGGAAT 

GGCTTCGGCGGGCAGCGGCATGGAGGAGGTGCGCGTGTCGGTGCTGACCCCCTTGAAGCT 

GGTCGGGCTGGTGTGCATCTTCCTGGCGCTGTGTCTGGACCTGGGGGCGGTGCTGAGCCC 

GGCCTGGGTCACAGCTGACCACCAGTACTACCTTGTCGTTGTGGGGAGTCCTGCCGGAAA 

CCCGCCAGCTTGGACATCTGGCACTGTCGAGTCCACGCTCANCAANCGATTGGCAGATTG 

C 

Sequence 103 

NCGCGTCCGAGAAATTGCAATTTTTTAATTTTTMTTTTAAGAGGAATTCGTGCCAGAGA 

GAACTATTAAGAAAGGGGTATATCCAGTCTAAGGATTATTAGGCTCAAGTCCATGAATAG 

GCTCTGGGAAGTTTGTAAACACTTGGAAATTATTTGCAAAATGTGTGTGTGAATGTGCTT 

TACCTTANAGAGTTCATGAATTTCATTAGATTGTTGAMGAGTTTTAGTATTAACAAAGG 

MAMCAAACCACCACCATCACATAACAMACCACAACAGTGATTTAATCTTTTCACCTA 

ACAATAAGTAAATTGAGGCTCTGATGGCTAAATTAATAGCCTGAGGCTACACAGTCAGTG 

GCAGAGCCCAGGGTANAGAGAGAACCAGCACAAGCCATTGTGGGAGCCGAGGGTAAAGAG 

AGAGCtAGGTGTTGTACCTTAGTAAATAAATCAGAA 

Sequence 104 

GNGTCGCCCCGCGTCCGGAAGGTGGAGACCGCTTACCCTGATCNGGGATGTATCGGCTGC 

GGGTGCGCAAGGCAGTCCAGGAGTGACCTGGGGCTGTGGAGAGCGACCCGTGGCCTTGTG 

TTTCAGAGTTTACCACCTAGGATGACTTCAGTGACTAGATCAGAGATCATAGATGAAAAA 

GGACCAGTGATGTCTAAGACTCATGATCATCAATTGGAATCAAGTCTCAGTCCTGTGGAA 

GTGTTTGCTAAAACATCTGCCTCCCTGGAGATGAATCAAGGCGTTTCAGAGGAAAGAATT 

CACCTTGGCTCTAGCCCTAAAAAAGGGGGAAATTGTGATCTCAGCCACCAGGAAAGACTT 

CAGTCCGAAGTCCCTTCATTTGTCTCCTCAAGAACAATCTGCCAGTTATCAAGACAGGAG 
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GCAATCCTGGCGGCGAGCAAGTATGAAAGAAACGAACCGGCGGAAGTCGCTGCATCCCAT 
TCA 

Sequence 105 

CGTCCGCGCAGCGCTTGAATCCCGTGGCCTAACCGTCCCTCGGMGACCGGTCCCGCTCG 

GGAGGCTCTGCAGTCGCGCCTGGGGTCAGGGCCGGGGGCGAATGTGGCTCGCGTTCTAGG 

CCTCCCTGGGTTGGAAAAAGACTATGTTAGCAANGTGTCACGCCATGCTTTTGCCAACTT 

TGCAATTAAAGGTTGACATTCCTGCATAAGCATTTCTCTGTGAAAATGTCCTTGCCTCTT 

ACAGAGGAGCAGAGGAAAAAGATTGAAGAGAATCGACAAAAGGCTCTGGCCCGCAGAGCT 

GAGAAGTTATTGGCAGAACAGCATCAGAGGACTAGCTCGGGCACCTCCATTGCTGGCAAC 

CCATTCCAGGCCAAGCAAGGCCCATCCCAAAATTTCCCAAGGGGAGTCTTGTAAGGCCAA 

GTGAGCCATTGGTGTCATTTTCAAGCAACAGAATCTCAGTAGCTCATCTAATGCTGACCA 

AAGACCTCATGATTCCACAGTTTTCANGCAANGGGAATATGGAAA 

Sequence 106 

CCGGCCCTATC CCTA TATTGTTTGCTTGTTTGGGATMCCTAAMTTTTTTATCCAGTTT 

ACTACTAATTTGTTTTACCTGATGTATCTTCTCTTTCAATAATTTTATGTTACCTTCTGT 

TTAGAATAATATTTGCCACAGATATTTAGGTTTAATTCTGTGTTTGAATGATTCCAATGC 

CTT TCTCTACCCA CTTTGAACACTTCATCCTGGAATGGTTGGCTGATGTATGTCTCTAAA 

CAAI I 1 1 I I I I I I AGGAGAAGGTATGTGGGTAATGTAATTCCTAAACCTTTGCTTTTCTG 

AAAMTCTTTCATTTGCCTTTAT ACAT GACCAGATTTACTGGGTATATAGATTTGTTGAT 

GAAAAAAGGTAAAAAGAGCMCTTTTGACATCCAGAGGTTTGTCTGGCACTCACAGCTAG 

CCCCGTGTTATTCTCCCTATT 

Sequence 107 

GCGTCCGTCTNAACCCTAAAGCTAAAAAGTCATTGTGAACCTTTNGGTCTGATGCTAAAG 

AAGGGAAAACAGGTACAGGAAATCCCATGTGGATGCTTGCTTNCAGGATTTCCTGCCATG 

ATTCCAGAATCCCACAGCTNCAACATGATTGCAAAAAGACTCCCTGCTCATTTTNCCTCA 

GCATGCACAGCGCTGTCCTGTCTCAGTTGCAACTCGACAGAGCCGCATTTACTCCAGAAC 

CCAATCCACACACCTGCTCATCCTGCCCCGAGAGGAGTGCCTGAAGCCAATAGCAGGGAA 

CTAGAGCAGACTTGGGTGGATCTTCATTGGATATTAGGTATCTTGCCCTAGATAGGCAAG 

CAGTGGCCTTACAGATGCTGACAGATGATCTGATTAGATGCACAGNTGCTGGGTGGCGTC 

TGGGGCCAGTCTATTGGNCAGTTCTGGGAGNGGGAACTATTTGGGCTCTGCAAAGATG 

Sequence 108 

CGTCCGCTCCCTGGCCCTGCTCCGGGAGCTGTCGTTGTCTCCGCCAGCAGCCCTGTGGCT 

GCAGGAGCGCCAGGCCCAGCTTCGCCACTCGCTGCCCCTGCAGAGCTTCCTGCTGAAACC 

TGTCCAGCGCATTCTCAAGTACCATCTGCTGCTGCAGGAACTAGGGAAGCACTGGGCGGA 

GGGCCCAGGCACTGGGGGTCGCGAGATGGTGGAGGAAGCTATTGTGTCCATGACAGCGGT 

TGCCTGGTACATCAACGACATGAAGCGCAAGCAGGAGCATGCAGCGCGCCTCCAGGAAGT 

GCAGCGGCGGCTGGGTGGCTGGACCGGACCAGAGCTCAGTGCTTTTGGGGAACTGGTGTT 

GGAGGGCCGCGTTCCGAGGAGGCGGANGGGNGGGTTGGCCCCCGGCTACAAGGGGGT 

Sequence 109 

AGAATTGTGTATGCCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCA 

AAGAAGATGGAACCAGACTGGAATTCTGTCTCCAGAGAGAAACCCAGCTGTTTGGGTCAA 

AGA CAGA TGCTTCAGACTTGGGTGGGAAGGTGAAAGATGGCTATTTAGAAAGCTGGTGGC 

ACGTTTTACATAAGGGAATGTCAGATGGGAGATGCTAGTTGCCATTTTAACAAAGCAGGT 

AAATCGGTAAATTTTAAACTCTGTCCATGTTCTGTTAGAACTCAGGGACAAGGGATCCAT 

GAAAAAG 

Sequence 110 

ACGCGTCCGCACGGAGAGAACTGGNCCTGGAGCGGGGGCGCGGGGAGGGGGGCGTCGTCN 

TGGGTACAATTGCGCANGGGCAAAGGTCGAGAGGTCGCGCTGNCGCCGTTTTATTTGAAG 

ACATCGTCCAGTTCTGACCATGGACTCNCAGCCATCGGCCCTTAGTTTCCATTCCCTCTA 

G NGGG CCTTCNGAGGGNTCTACTGACGTACCTCCTTCCCTTGGTACCGGACCGGGGAAGT 

GTTTTCGGGCGCGGGAGGTTCCGCATGCCCAGGCCTGGCCAGGGGA 
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Sequence 1 1 1 

CGGGCCCTTAGTCCAAGCCTTGATCGGCGACTAAGTGACGGCAGTGACTGCCGCCATGCC 

GAGCTGGACGGAAGNCACTTCTGAGAAGGGCGGAAGTGTCTCGGGCTCCTTAGAGGGAGG 

ACACCATATTAGTGCCAGTGGGGAAGTCACCGGGTGGAATTACTTCTTTGTGGAGTTTGT 

GCTGTAGCGACAATGAAAAACGAAGAGTCAACTTTTATAAAACAAAATAAAAATTAAGTC 

AAATCATGCCAACCTTTATTAGATCGGCTAGCAGGGTTAAACTTAATTCAAAGCCCCTGA 

TGAATCGGGCCTTCATTGCACCCCCAAAGGCTCCGCCACCCTGATT 

Sequence 112 

CGCGTCCGGGCGCCGGTACGCCTGGTCCCCGCGTGGAGTCTTTACTCAAAACAGCTCCCG 

CCTCAGGCCGAGATGAGGAGCCCTTCANAATAGCTGCTGTCTCTGGGNGGACCCGGGCGT 

CCTTGGCAGCCCAGCTGNTCTGGACAAAGCCCTGCCAGTCAGGCCTCGGCTGGCAGGAAC 

CATGGCAGAGGCTGGGGATGCTGCGCTATCGGTGGCCGAGTGGCTGCGGGCATTGCACCT 

GGAGCAGTACACGGGGCTCTTTGAGCAGCATGGCCTGGTGTGGGCCACTGAGTGCCAAGG 

CCTCAGCGACACCCGCCTGATGGACATGGGCATGCTACTCCCT 

Sequence 113 

TGTCGACCCCGCGTCCGCGGGANGTTCATGGAAACGCAGGACACGACAGAATTGTGTNTG 

CCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCAAAGAAGATGGAAC 

CAGACTGGAATTCTGNCTCCAGAGAGAAACCCAGCTGTTTGGGTCAAAGACAGATGCTTC 

AGACTTGGGTGGGAAGGTGAAAGATGGNTATTTAGAAAGCTGGTGGCACGTTTTACATAN 

GGGAATGTCAGATGGGAGATGCTNGTTGCCATTTTAACAAAGCAGGTNAATCGGTNAATT 

TTAAACTCTGTCCATGTTCTGTTAGAACTCATGGACAAGGATCCATGAAAAAGACCTGTG 

ATGTTTCNTCTGGCGCTTTACTGGCCTGGGCACACCTACCAATCTTTTAGGATTTGACTG 

GTTCCATTACATTTCCT 

Sequence 114 

GTCGACCCCGCGTCCGTATCACTGTAATTTAAGGAAAGAAAACTTCAGTTCTGCCTCTGG 

ATACCAAGATGCCCATTGCTCAGTTCAGACAACTGATATTAAAATAAAGCTATGCTCCTT 

ACTTACTTCTTTTATTATAMCAAATTCCTTTGCTTTGGCTGATACTAGCTGAGTCATTG 

ATCATCATTGGTACCATGATATTGTAATCTATGCTGCTATTTGGCACAAGACTGAAGTTC 

ACA CTACA GTAGAGAATACTATAAGATAATTTGCAATAAATACTGATAATAATAATACCA 

GATATTTTMCTAACTTTTTCTACCTTTATTAATAGCAATCAGCACACTTGAATGTGTAA 

ATTTCACAGTAACTTTAGGCAGAACTTAAGCTCCAGGCCACATTTGTATAAGAACACCAA 

GTATTCAAGGCATAAAGTCTGTTGTAAGCCAAAAAAA 

Sequence 115 

AGTTCAGTCTGCAGCAGTCCCTGCACCCACTTCCCAGTTGCTTTCATCTNTGGAAAAAGA 

TGAGCCCCGTAAAAGTTTTGGCATCAAGGTCCAGAATCTTCCAGTACGCTCTACAGATAC 

MGCCTTAMGATGGCCTTTTCCATGAATTTAAGAAATTTGGAAAAGTAACTTCAGTGCA 

GATACATGGAACTTCAGAAGAGAGGTATGGTCTGGTATTCTTTCGGCAGCAAGAGGACCA 

AGAMMGCCTTGACTGCATCAAMGGAAMCTTTTCTTTGGCATGCAGATTGAAGTAAC 

AGCATGGATAGGTCCAGAAACAGGAAAGTGAAAATGAATTTCGCCCCTTGGATGAAAGGA 

TAGATGAATTTCACCCCAAAGCAACAAGAACTCTNTTTATTGGCAACCTTGAAAAAACC 

Sequence 116 

CCCGCGTCCGCACCAGGCCCGAGTCTTCCCTTCATGGAGGGTGACGTGAGCAGCAAGGAT 

AAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAGCTCTTCAGGAAGCTGCAGCAAGG 

TTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGACCGAAAGAATGAC 

AAGAAGCCACCACCTTATAAACATATAAAGGGTCTCCCTCTGTGACCCAGGCTAGAGTGC 

ATTGCTGCAATTTTGGCTCACTGCAACCTCCGCTTCGTGGGCGCAAGTGATTCTCCTGCC 

TCCTGCCTCAGTCTCCTAAGTA GCTGG GATTACAGACATGAGCCACCAACGCCTGGCTAA 

TTTTGTGATTGGCAAAAAAGAGATTTTGTGACACATAAAGATGATATGAAATTCAACTTT 

CAATCAAGTATCCAGAAAATTTTA 

Sequence 117 

CCACGCGTCCGGCCCTTGCCCCTGTCNCACANGMTGGACCCACGGCCCCACCCAGCGCC 
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GTCAGCGCCCGGCACTGCCACCCGGGTCCGGGCCGCTGCCTGCACGTGGGATCCGTCGGG 

CAGCCGGGGACAGAAGAGACCCCGCCGTTGGGACGCAGGGCAGAGCCGGCCACCTAGTCC 

CTTCCAGCCAGCAGAGGCGAGGGAAGGCGTCACTGCCCCGGCGGGGAGACGGGCAGGACG 

CCCTGCCCCGCACCAGCAGCCTCCGCCGGGGCGCCCTCAGCTCCCTGCTTGGCTCTGTCT 

CTCCACACCCGGCAGGGCCCGCGGGCTGCCCCAGCCTGGGGGGGTCGTGGGCAGCTGCTA 

CTCAGTGCCAACCCCGTGGGGCACAGA 

Sequence 118 

NAGGGAGTCGACCACGCGTCCGGTGCGGAGCAAGCATCACACCATGGCGTATGAGTGTTC 

CTCTGTGTAGACTCAACCTGCGCCTCGCCGTCCCCCATTCGCACACCCGATGCCCGGGGG 

TCGCTACGGACTTAAAATCTCCGCACCGCACCCTCCACCTCAGAAACGTTCCTGGATCCG 

AACACTGCCCCCTGACGACCTAGAGAGATCCCGGCTCCCAGCCCACTGAGTGGCTTCAGC 

CTCGCTGGTAGGTCTCTCCTCCAAAGCTCTGGAAACAGACTCCTGGGAGTGANGGTAGNG 

GGGGAGCNGCAGGCACCGCCCCCTTTCCCCAAGTCCNCCGCCCACTTCATCCCTCAGGCA 

CCTNCCAACTCCTGGCCTTNTCTGCACGAGGCGCCTGCCCCGGGCCCCGCCTACAGGGGA 

CCCAGCTCTTCTTGCAGCCATTGGAAGNTGATCACCTGGGAGGTGA 

Sequence 119 

CACGCGTCCGGTTTTACTGCTCTTTGCCATGTGGTAAAAAGAGGCTGAGACATATTTAAG 

AATTCCAAGAGGATATTATGTGTCAGAATTTCAGACACTGATGAGAAG I I 1 1 1 AATTGTT 

CTTTTTTATTTGATTTTGGAATTCAGGTGCACTCTATTCAAGTGCAAGGATATCAGAAGT 

I I I I I I I I ATTTAAAAAA I I I I I I I I I CGAGATGGAGTTTCACTCTGTTGCCCAGGCTGG 

AGTGCAATGGCAGCTTACTGCAACCTCCACCTCCTGGTTCAAGCGATTCTCCTGCCTCAG 

CCTCCCCAAGTAGCTGGGGATTACAGGCACCGCGCCAACACACCTGGGCTTATTCTAATT 

TAAGTAAGAAAATGGGAAGTCTTACCCATNTTTGGTCAAGGCTTGGGTCTTCGAACCTNC 

fGACCTTAANGGTGATNCCACCCCANCTTTGGCCTCCCAAGCCGTGCTNGGGATTATAGG 

GCATGAAGCCCACCCANGCCCGGNCCCAGGA I I I I I ATATTTAAGCCCTTCTTGCTCTTN 

AAAAAAAAAAAAAGGT 

Sequence 120 

NGTCGCCNCGCGTCCGGGAACCTACCGGTACCGGCCGCGCGCTGGTAAGTCGCCGGTGTG 

GCTGCACCTCACCAATCCCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTC 

TCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGG 

GGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 

AGTATTAGAMTGAGCTGAAGACCATTCACAGATTMTATTTTTGGGGACAGATTTGTGA 

TGCTTGATTCACCCTTGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAA 

CTGGAACCAGCCAGTGAATCTTAAATGNTCACTTAAATCAGAACTTTGCCNTTAANAAAG 

AAAATTGGGGNGTCCTGGGTTTA 

Sequence 121 

CCNCCCCGCGTCCGATCAATTCCTGGAATTTATGGTTATAACTTCGAAACAGAAGATGGA 

CTAATTTTATTTTTATCTAATTTTATTGTTGGAAATTTAGGMTTTACGGAAATACTAAT 

TTAMTTATTTAAAAAGATCATTAGAATCAAGTAAACCAATTTTGATGGCTATGATTGAC 

TCMCMGAGCTMTTATCCAGGTAAAACMTAGATAAAATTTTTGCTAAAAAGTTTTTA 

GAAAAMCATTTTTAACAACAAAATCCACTTCAAGAATAATTGTCGGAGCATACGATGM 

GAGATGCTTTCAATTCAAGAAATCCTTGATTTAGCTTACAAATCAAACGTAAAGGTTGCT 

GTATATGGNAGAAATTATGACAATCTTTTAGAAATGAATCAACGATTAGCACAAAAACAA 

MTCTTGAMTACATTATCCAGMTTTTTTGATTTCAGGCAAGCTAATAAAATCGATMT 

TTTGTMTCTTAATTACATCAACACCTGGAGCGAATTTACCAMGATTTTTTAGAAT 

Sequence 122 

CGCGTCCGCGAAACTGAGAACCAGTTCTCCCGAAGCCGCGGGTCTCCGGCCGGCGGCGGC 

GGCGGCGGCGCAGGTGAGCAGGGCAGGGGGCAGCCGAGGGAGCGCGGGGAGCGGGGGCCG 

GGGGGCCACGTACGAGGGGCTGCAGGCCCAGCCGGGGCGGGACTCGCCAATCCTGCGTCC 

CCAGCTCAGGACGCGGACGCTGATCCGMGCCCCTGGCCCCGGCTGGGTCAGCACTGGGA 

GAGCAGGCCCAGGTCCGCAGCCCGGGTGTGGGGCCCTCCCCAAATCCAGGGAAAGGATCG 
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TGGAGCGGGGTGGGGACTGAMGCCATTTCCTTTCCCGTGMGMTTTTTATCAGTGCAA 

GTMCAAAATATT 

Sequence 123 

CGCGTCCGCTNAAAAAMTMTACCAAAAMAGTTTTTGTAAAGACAACGCTCTCGCTGT 

GTTGCCCCGCCACTGTGGCCTCCTTAGCTTCTTCCCTGGGGCCTGCTGGACCTT TCCATA 

CTCCAGAMCTAAAGGGGGTCCAGGACCCTGCTTNMCCCTAGGATCCCGCATCTTTTTT 

ill I U I I I I I GGACGCAGGGTCTTGCTGTGTCCCTCAGGCTGGAGTGCAGTGATTCACT 

GCAGCCTCMACTCGTGGGCTNAAGTGATTTCTTTAG CCTCAGC CTTNTAAGTAGCTGGG 

GACTACAGTCATACACCAACATGCCCAGCTAANTTTCCTTTT1TTMTTCTTGTAGAGNA 

TGTTTGAGACGGCTTGGGCTNTGTTGCC 

Sequence 124 

CCNCGCGTCCGTGCTGATAAAACTCCTTTGACCTGACGATTGCT CTAAGTC CTAATTGCC 

ATATTTATATTCCCATAGTAAGAGTGTTTGGAGATAGTGTTTGAGCTTTTTTGCTGGTGT 

TAAAAATGCATAATGAAAGATGGCACNAGAGAGGCATATTATATCCAATTCATGAAGTTG 

TTTGTGTTAACAGAMGCTTATTrTAATCACTTMCATTGTTGATTTGTCTAATCACAGT 

AGCGCTATTGATTAGGAGCCTGACCTTTANATGGTTGACTTGTGAGTGTATTCAATATGG 

TGAMTMNGGTGTTTGATATATGGCTGCAGATTTTAGAAGGTGTCATTAGCAAAGGTAT 

ACGGAATAMATANGGGTTATAGTATTCCTTACTCAAATTCTGTATGTGCTAGAGCTGGC 

TGGAGTCTGTTGGCATGCTCATTTGGTGTAAGGNCCGNTAAGGACTATGCT 

Sequence 125 

GCCCCGCGTCCGCACTTTGTATTGATAACTTAAAATGGCATCAGTTTATCTTAGACATCA 

GCTTGCTTTTTATCTCCTTTTTTAGTGAGTGAAATAGAGCMCTAGCATGCCTGTC 

CAGCTACTTGGGAGGCTAAGGTGGGAAGATCAATTGAACCTAGGAGGTTGAGGCTATAGT 

GAGCTGTGATTGCACGACTGCACTCCAGCCTGGGCAATGGAGTGAGACTCCTGTCTCTAA 

AACAGCAACAACAAAAATAAAGCAACCATAGTGCATAAGGGAAA TTAAA TGTTCCCTATA 

GAAATATGTGTATGTCTGTGATMGTGGTATGCAMTGCTMTTATTTTATAAAATAAAA 

GTTCAGMCTATTCTTATCATTGCCACTTGAACAATTAMGGGTTTGCTTTATTTCCTAA 

TGTTTAATAGGAACCCTTTGCTTCAAACAGCCTTTGTTGAAATCATGTAAAAATTTGTTA 

ATAG 

Sequence 126 

CNCCACGCGTCCGGCGGCCCAGCCGCCGCCTCCCGTTCTCCCCTCCGCAGCGGCGGCGGT 

GGCGGAGAAGGAACTCGACACGCACCGACCGCCCTCCCGCCCCAGCCGAAGCGGAAGCTG 

TAGCCCGCTCTGGGCCGGGGCCATGGGCGCCCCGCGCCGCCCGGGTCATGAGGACGGAGG 

CGGAGGCAGCGGGGCCGCCGCTCGAGCCCGGGGACTTTGTGCAACTGCCTGTGCCCGTCA 

TCCAGCAGCTCTACCACTGGGACTGTGGCCTGGCCTGCTCCAGGATGGTGCTGCGGTACC 

TGGGCCAGCTGGACGACAGTGAGTTTGAGAGAGCCCTGCAGAAGCTGCAGCTGACCAGGA 

GCATCTGGACCATCGACCTGGCCTACCTGATGCACCACTTTTGGCGTGAGGCACCGCTTC 

TGTCCCAGACCTGGGGTGTNGACAAGGGCTACAAGAACCAGTCCTTCTACAGGAAGCACT 

TT 

Sequence 127 

CNCGCGTCCGCGGTCGCGTGGGCGGACGCGNGGGTTCGTCCTGGACAAGTCTGGGAGTGT 

GGCMATAACTGGATTGAMTTTATAATTTCGGNNAGCANCNGGGCGGAG AGAT TCNGTG 

AGCCCTGAMTGAGATTATCTTTCATTGTGTTTTCTTCTCMGCMCTATTATTTTGCCA 

TTAACTGGAGACAGAGGCAAAATCAGTCAAGGCTTGGAGGATTTAAAACGTGTTANTCCA 

GTAGGAGAGACATATATCCATGAAGGACTAAAGCTAGCGAATGAACAAATTCAGAAAGCA 

GGAGGCTTGAAMCCTCCAGTATCATMTTGCTCTGNCAGATTGGCAAGTTTGGACGGTC 

T 

Sequence 128 

GGAGTCACCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGGTTAATCTTAGGCCTGAGGT 

TTGGGGCCGGGTGACMGGAAGTTAMCTCGTCCTCCCTG CCAG ATTCTACCCCCTTTCG 

GAGCTGAGCTCCAGCCAAACCTGTGGAGTTTTCTTTGACCATTTTAGGACATGTTACTGC 
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TTCTGAGTTGGCTGCCCCAGCTGCTCAAACAAGACCTTTCTCCTGGGTTCGTAGTAGTGA 

AAAGGAGCAGCAGAGCAACTGAGGAGGAGGGCGGGTGGGAGGCATGGGACTGGGGC7TGG 

GGAGGTCAGGCGAGACCGGGGTGAGAGCTCAGAGAAGCTCCTGTGACTTCCATGCTAAGA 

TCTTGCCAGAGAACTCTGGTCAGTCCTCGGGTGTCTGGATGAAGTAAAGGAGTTAGGCAT 

TTCTTCCTTTGATTCTCTGGCTTACCT 

Sequence 129 

CGTCCGGCCCGCTCCGGGCGTGGGCGTGTTCTCGGCGGGCGTGCCTGGAGGAGGAGCTGG 

GTCCTTGTGCGGCTGCAGAGTCAGATGGGGCGGGGATTTCGGGGCACCGGGTCCTCACCT 

TCACGAGAAAGGCCCCACAGCACGTCCCCACTACCCGACGACTCACTCTTCGTGGCTTCT 

CTCTCCTCCCCAAGAGCAGGGGTGGGCCTGTCTCGCGTTCCCCTGCGGGAGTCAGGAAGC 

GTCCTTCCTACCTACCAGTCCTCCCCTCTGGTGTCTGGGGACACTTCCTGGGGGCCTTTC 

AGGTGGTTGGCGCCGGTGCAGGGCCTGAGAGCCTGGG 

Sequence 130 

GCGTCCGGTGGCATCATGACTTCTGGGGCAGTAGACTGAGCAGCAACACCAGCCACAAGT 

CCTACCGGCCTCTCACCGTCCTGACTTTCAGGATTAACTACTACCTCTCGGGAGGCTTCC 

ACCCCGTGGGCTTTCACGTGGTCAACATCCTCCTGCACAGTGGCATCTCTGTCCTCATGG 

TGGACGTCTTCTCGGTTCTGTTTGGCGGCCTGCAGTACACCAGTAAAGGCCGGAGGCTGC 

ACCTCGCCCCCAGGGCGTCCCTGCTGGCCGCGCTGCTGTTTGCTGTCCATCCTGTGCACA 

CCGAGTGTGTTGCTGGTGTTGTCGGCCGTGCAGACCTNCTGTGTGCCCTGTTCTTCTTGT 

TATCTTTCCTTGGCTACTGNAMGCATTTAGAGAAAGTAACAAGGAGGGAGCGCATTCTT 

CCACCTTCTTGGGTGCTGCTGAGTATCTTTCTGGGAGCAGTGGNCATGCTTGTGCAAAAG 

AGCAAGGGATCAC1TGTGCTGGGTTTAAMTGCCGGMTTTGACMTCTTTGGGTGATAG 

GC 

Sequence 131 

GTCCGCTGGGGGCCCTGGGGCTCTCTGCGTCGAGAGCGCTCGAAGACCCGGGATTCCTGG 

CCCGATCGCGGGCGGGGGGAGACCCCAGCTCCACCCCAGCTCCCGCCGGCTCGGGGAAGG 

GGCGGCCCCTTTAAGAGCGCGCGGCCCCGCCCGCCCCCTCCGGGCAGGATCCGAATTCCA 

GGGAGGCGGGGCGGAGACGGCGGCGAGGAGGAGGCCGCGGCGCGGGACGCATAGAGCTGC 

GGCTCGGGCGGCGCCTCCTGCGGCGGCCCGGCCCGGCTCCGGCCCCCGCTGGGGCAATGC 
TCCCCGGG 

Sequence 132 

TCGCCNCGCGTCCGGGCACACACATGCCAGGCTATTTTAAGAACTACTACAACTATGATA 

MGCTGTGMTATGTAGCCATGMCCAAAACAMGTCTCTGTCCTTGTGGAACATTTGTT 

CTGTCAGAGAAGACAGTGTGTTGGCTCACATTGTGGTCAGTGCTGTTGAGCAAAATAGGT 

CAGAGTAAGGGGGATGGAGACTGGTGGGAGGAATGCTGCTTTATCCAGGATGGGCAGGGA 

GGACTCGATGGTGTCAGCACTGAAGGATGTAAGATCTGCTGCTCTGGGGAGAGGAGCAGC 

ATGGAAGGAGTAGAGTGCAGAGGCCATGAGGAAGGATCAGGCTTGACTCCTTTGAGCAAG 

GGGGATGGGAAGAGTGACGGNAGAAAGAGGGACAGGCCACATGGCCTGGTGGCCTGTGCT 

GAGGCCTTGGGCTTTTCTTCMGTGAGATGAGATGCCATTGGCCAGTTTGGGCAGTGATT 

TNATCAGACTTGGTTCAGCAGGACCATNCTGCTTGGCAATGTGGAGAGCANGCTGAAG 

Sequence 133 

CGCCNCGCGTCCGAACAGGCCGGGCACCAAGGCGCAGGATTTCTATAATTGGCCTGATGA 

ATCCTTTGATGAAATGGACAGTACACTAGCTGTTCAACAGTATATTCAACAGAACATAAG 

AGCAGATTGCTCCAATATTGACAAAATTCTTGAACCACCTGAAGGCCAAGATGAAGGTGT 

GTGGAAGTATGMCATTTAAGGCAGTTCTGCCTTGAGCTAAATGGACTTGCTGTCAAACT 

TCAGAGTGMTGCCATCCAGATACTTGCACTCAAATGACAGCAACTGAACAATGGATTTT 

TCTTTGTGCAGCTCATAAAACTCCAAAAGAGTGTCCTGCTATAGACTATACTAGACACAC 

ACTTGATGGTGCTGCATGTCTTCTGMTAGCAATAAATATTTTCCCAGCAGGGTTAGCAT 

AMGGMT CATCT GTAGCGMCTAGGATCAGTATGCCGTAGGATTTACAGMTATTTTTC 

ACATGCTTATTTTCATCATCGGCAGATATTTGGATGAATATGAAAATGAAAC 

Sequence 134 
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GCGTCCGCGAAAGCTGGGAAGCCAGGTCTACCTGCCCCAGACGAATTGGTGTACCAGGTG 

CCACAGAGCACACAAGAAGTATCAGGAGCAGGAAGGGATGGGGAATGTGATGTTTTTAAA 

GMATCCT TTGAAGATGATGCTGCTTTTTACAMGCATCGTTTTAAAGCACATGGCCTTT 

I I M I I I I AATTATTAGTGGTAGTAATATATAGAATGTATTACATAACTGTCACTGAAGT 

GGTTGGGGAAAATGTGGTGACTGAGGTACAGGAAACTACTAATCTTGCCATCTTGCTTTA 

AGGTGTTATGGTGG CACAGT TACTGCTCGCCTGTTAAATTTCAAATGTCCTGTTTGATAC 

TACTGGAGAACACTATTTTTAATACAGAAAAAGGTCCCTATAATGCACTTCAGAGAAATT 

AA 

Sequence 135 

TCGACCCACGCGTCCGGGAGTCCCCCCTGCCCCCCATCAAATGCTTCCTGCAATACTTTG 

CACACCAGAGACTGGGCCTCCCCAGATCCAGGGGGACAGGGGTCCCTGGGGGAGTCCCCA 

GGGCCAGCCCCTCCAGGCCAGCTGCACACACTTGACACTGATTTGCACAGTCTTGCACAA 

ATAGGGGGTAAGAGCCCAGTGGCTGGGGTGGGCAATGGGGGTAGCCTCTGGCCTAGGGAG 

TCCCCTGGCACTGCCAATGGGCACAGTCCCGAGCACACACCCCCTGGCCCTGGACCCCCA 

GGCCCCTGCCCCACCAAGCGAAGGCTGCTTCCTGCTGGAGAAGCCCCAGATGTCAGCTCT 

GAGGAAGAGGGGCCAGCCCCTCGGAGGCGCCGGGGATCCCTGGGCCACCCTACTGCTGCC 

AACA GTTC TGATGCCAAAGCCACACCCTTCTGGAGCCACCTGCTGCCTGGGCCCAAAGAG 

CCTGTTTTGGACCCAACAGACTGCGGTCCCATGGGGCGGAGGCTGAAAGGAGCCCGTCGC 

GTGAAAGCTTGAGCCCCCTTCGAAAGCCTNCGGAAGGGGCCAGGCCTGCTGAGCCCCCCC 
AGT 

Sequence 136 

CGACCCCGCGTCCGTG AGMTTCAGCTTTGG AGTCCCGGGTG AGGGGTTTTAG ATAAACC 

CATCAATATCACCCACATTCTGTGACTCTTTGCATCACTCGTGTTATTTATn"ATTTATT 

T ATATT CTGCCTTGTTCCAGAAAAGTGTTTAAGGCAACAACGCTTGTTTTTTGGTGTTTT 

GTTTTGACATTTGAAMTTTAGTACATTGTTMAATGTACTTGTTAAACAGGTMTTTTA 

AAGAGAAGGAACAATTGTTTTTAGTMGTTTTCTTTTTCCTTTTTCAATGMTTGATTCT 

TGAAATTAAAAGTTCTTGAGAGAAGGAGAGGAAGATACAGCAGACATAGGACTGAGCCAA 

GGAAGAGTCTGCCTGAGAGAGACGCTTGGCCTGTGCTTTGCTGCCATCCGTGCGGCCTTG 
GCCACA 

Sequence 137 

TCCGATT TTTAAA TCTATTGGCCGTGTTGTCCTACCTGAAGTTCTTCAACTGCCAAAAGC 

ACAGCCCTTTTTCTCTGAGCTGGTGGTTCTGGCTAACACTGACAGGGGTCGCTTGTTCCT 

GTGCAGTGGGCATCAAGTACATGGGTGTGTTCACGTACGTGCTCGTGCTGGGTGTTGCAG 

CTGTCCATGCCTGGCACCTGCTTGGAGACCAGACTTTGTCCAATGTAGGTGCTGATGTCC 

AGTGCTGCATGAGGCCGGCCTGTATGGGGCAGATGCGGATGTCACAGGGGGTCTGTGTGT 

TCTGTCACTTGCTCGCCCGAGCAGTGGCTTTGCTGGTCATCCCGGTCGTCCTGTACTTAC 

TGTTCTTCTACGTCCACTTGATTCTAGTC7TCCGCTCTGGGCCCCACGACCAA 

Sequence 138 

CGACCNCGCGTCCGGAAGGACCCTCTGAGCTATTTTGCGGCATACGGGAGCAGCAGCTCA 

GGCTCCTCGGACGAGGAGGATAACATCGAGCCGGAGGAGACGAGTCGCAGAACCCCGGAT 

CCGGCGAAGTCGGCGGGCGGCTGTAGGAACAAGGCGGAGAAGCGGCTCCCGGGACCTGAC 

GAGCTGTTTAGGAGCGTGACTCGCCCGGCCTTTCTCTACAATCCGCTCAACAAACAGATA 

GACTGGGAGAGGCACGTCGTCAAGGCGCCTGAGGAGCCTCCAAAGGAATTCAAAATATGG 

AAGTCAAATTATGTACCACCTCCTGAGACCTACACCACTGAGAAGAAGCCTCCGCCTCCA 

GAGCTTTGACATGGCAATAAAATGGTCTAACATATATTGAGGACMTGGTGATGATGCTC 

CACAGAATGCTAAAGAAAGCTAAGGCTTNTACCA 

Sequence 139 

CGACCACGCGTCCGGGCTGGCGAGCCCGGCTGAGGAGCCTCTTGGGTCGCACTTACCGCC 
GCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGCGACGCTGAAGCGG 
CCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCC 
Sequence 140 
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CGTGTCCGGTGAATGGGAGCGGAACTCACAGGACACAATGAGCCGGGTCACTGATGGCCT 

TGCT TTCTAAGAATCTCACAGTGAGCCCTAGAACTCTCTACGTGGTAACACTGTGTGCCT 

TTTTCAGAGMGAGCCTATCTTAGATCTTAGCCTAACGTTGGGTCTATTGTGTTGCTGGA 

GAGACCAGCACTGACATTCATCTCAAAGCACATGGTATGTTTGACTCCTATGTTGACTCA 

ACTACCCATCTTGTACTGGGACACTCGC I I I I I I 1 1 1 I I 1 1 1 1 I I GAGACGGAGTCTTGC 

TCTGTCACCGGGCTGGAGTGCAGTGGCACGATCTCGGCTCGCGGCAGCCTCCGCCTCCCG 

GGTTCAAGTGATTCTTCTGCCTCAGCCTCCTGAGCATGTGGAGCTCAGGCTGAAGGTGAT 

GTGGCCGCCC 

Sequence 141 

GTCCGATTG ATTCT TCTATGATGCGTGTTCATTATACAATACACATTCTCGGAAAGGCAG 
AATTATTTGCTTTTATGATTGTTTTTTGTGACC^ 

TATCCCAGGCCAMAMTATTCTCTCACGTTTTTCTGTMGAGCTTTATMGTTCAGCTT 

TTATATTG AGGTCT GTGATTCATCTTGGATTATTATGTGTAGGTTGTAAAATAGGAATCT 

AGGTCCAGTTTTTCCATGTGGATATTCAGTTATTGTGGGGCCAATTGTTGAAAAGTCTCC 

GCAMG MCTG CTTTCTTATCAGAAAGATAAGATATATTAAAAGTGTATATCTAAGTCTG 

GGTTCTCTTTTCTGTTCTMTGGTTGATATTTTTATCCTTATGCCAGAACCACACTGTCA 

TGATTGCTGTAGCTTTATMTAGTCTTGMTCMGTTGCTTTTCAGTm 

AAATTGCTTTCGGTATTCTAAGTGCTTTGCATTTCTGCTAAMTTTAGAAGCAGCATGTC 

TACCAAAAGGAAAAAAAAAGCC 

Sequence 142 

TCCGGCGGAAGAAGGTGCGTCCGCGGCTGATCGCGGAGCTGGCCCGCCGCGTGCGCGCCC 
TGCGGGAGCAACTGAACAGGCCGCGCGACTCCCAGCTCTACGCGGTGGACTACGAGACCT 
TGACGCGGCCGTTCTCTGGACGCCGGCTGCCGGTCCGGGCCTGGGCCGACGTGCGCCGCG 
AGAGCCGCCTCTTGCAGCTGCTCGGCCGCCTCCCGCTCTTCGGCCTGGGCCGCCTGGTCA 
. CGCGCAAGTCCTGGCTGTGGCAGCACGACGAGCCGTGCTACTGGCGCCTCACGCGGGTGC 
GGCCCGACTACACGGCGCAGGTGCGTGCACCCCGTCCGCACCCCGCCCCTGCAGCCGCCT 
GGTCTCCCCGCCTCCCCTCCTCCCTGCAGGTTTGCGTGGCTGAGGCTCCCACCTCCTGAC 
CTCGGGGGCCGAGAGCTTTGCGAGCTGACCCCGCTTCTTCTGGCTTTGCAGAACTTGGAC 
CACGGGAAGGCCTGGGGCATCCTGACCTTCA 
Sequence 143 

CCCGCGTCCGAGATCCTGTAGGTGAAGTTCTCCTGTGCTCCACAGCCACCCAGAGGAATT 

eCAAAACCAGCAGTGGAGGACTTGGGGAGGACAGGAGGGAAAACATGGCGAGTTCATCAG 

CTCTGTTTCCTTTATTAAAATATTTCTGTAATTGGTGGTGGGAMTTGAAGAAATCAAGT 

GATTGCATC AGCGC TGGAAAAAGCTGCCAGCACTTGGCAGTGGAAGAGAATATATGCTTT 

ATACTGGACTTTTTGAAAAAGAGGCTGAGTTTGGCCAGATTGCCGACCAGCAATGGAAAA 

ACTAATTAGGTGCCTTGCCTGTGAGCCAGACGCCCAGCAGGGCTGTGGCGCATGGCTCCC 

GCCGCCTCTGAAGAGGACACTTTCTAGTGAATTCAGTTCGTGCTACCCTTGAGCAGCCTG 

TGCTACAGCAGGCACATTTGTGAATCTCCCAGCCTGTGCCTGGCGTCNGAACTGTAGCTT . 

CCCAAAGAC 

Sequence 144 

CGCGTCCGAGTAMTCTGTTCTGCACTAMTTATATCAAAATAAATATAAATAATTATTT 

CGTATACCATTCGTTATATACATTTCAGTGTTTTTMTMTGTCMTTTCTT^ 

AGGAGMCTCTAGTTATTTTTATAMTCTAATTGACTTAATTTTGTGGGAATATAAAAGA 

AGTGATTAAAMCCTTTGGTTTAAAGTAGTTAATCCTGAAATCAAGCTCTGTAAATATTG 

TGTAGGGATATGGAGAAATCCTCAAAAAGAAAGAGCTAAAGAAAATGGCAGGGATGGCAT 

CTTGGGAGTATMCTGAAAGTAGGMGATGTGGATAGMGAGTCTTATTTTTAATCACAG 

GGCATATGTGCTATTTGMTTATTTTGACAAAAGTATAAAMTATGGMTTATGCATTGT 

GTGTGTGTGTGGTTTCATCTGTTTAAGAAATATGTGATGGAGACTGTCCTATCATCAGGA 

AATTATTCCAGT 

Sequence 145 

GTCGACCNCGCGTCCGCGGAGACACCGACCGCGGCGGCAGCAGCAGCAGCAGCAGCGAGA 
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GGCAGAGGCGGCGGCGGCGGGGAGGACAGCACGGCCGAGGCTGCCCAGAGGCGCCTCCTC 

CACACCCCCCGCCGCAGCAGCACCGGCGACAGATTTTTTAAAAAATGGATTTGGCCAACC 

ATGGACTTATTCTACTGCAACAGTTAAACGCTCAGCGAGAGTTTGGTTTCCTGTGTGACT 

GCACGGTTGCAATCGGCGATGTATACTTCAAGGCACACAAATCAGTTCTTGCTTCATTCT 

CCAATTACTTTAA 

Sequence 146 

CCACGCGTCCGATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAACCATGTCCAGCCCTACA 

TCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGGCTTCACTGAGGTGCGAG 

ATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCATCAACGAGGGCGGCATTG 

ACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTGGCGTACCACCCCCTGAAGATGC 

AGAGCTGCTATGAGAAGATGGAGTCGCTGCGACTGGACGGGCTGCAGCAGCGATTTGATG . 

TGTCCAGCACGTCCGTGTTCAAGCAGCGAGCCCAGATCCACATGCGGGAGCAAATGGACA . 

ATGCCGTGTATACGTTCGAGACCCTCCTGCACCAGGAGCTGGGGAAGGGGCCCACCAAGG 

AGGAGCTGTGCAAGTCCATCCAGCGGGTCCTGGAGCGGGTGCTGAAGAAA 

Sequence 147 

NACCACGCGTCCGCCNCGCGTCCGCTTGCACCCGGTGAAGAGCGTGCGTGTGCTGAGGCC 

GGAGCCGCAGACGGCTGTGGGGCCCTCGCACCCCGCCTGGGTGCCCGCGCCTGCCCCGGC 

CCCCGCCCNCGNCCNCGCCCCCGNCCNNGCTGCGGAGGGCTTGGACGCCAAGGAGGANCA 

TGCCCTGGCGCTGGNCGGCACAGGCGCCTTCCCGNTGGACGTGGAGTAC 

Sequence 148 

TCCCAAGAGCTGCANGNNNCAGCCGCGACAGCAAGAACCGGNAGAGCCGGCAGACCGCGG 

CGGCGGCGGCGNCGGAGGCAGGAGCAGCCTGGGCGGGACGCAGGGNCTCCGCGGGCGCAG 

GAAGGCGAGCAAGAGATATNCTCTGAGAGCCAAGCAAAAGAACATTAANGGAAAGGGAAG 

GAGGAAANGAAGGCTGGATACCGGNGCAGTGAAAAAAGGCACTTCCAAGAGNTGGGGGCA 

CTCAACTACGCCACNAGACTCTGACCGGGTGGCCCAATCAAGCCAATGAAGAAACCTATA 

ACCCGGNTTAACCTNTNATTGGCCTGCCTTCTNNNTNGGGGGGTGGGGGCCCAAGCCCNC 

TTAACCCCCCAACCTTCNNTTTCAAACCTATCCCAACCTTATTCNAAAANGAAGGGGANC 

CTNAAGGATGGGGNTTTCCCCCAAGGCNAAAAAGGAAAAAAAGGGGGCCCCCNNGGGAAG 

CCTTCNTTCTTGGGGAAAAACAAGGCAAAAAAAATTGGAAGCCTGAAAACCCGGCTTCAA 

AAAAAAGGGG 

Sequence 149 

GGCCGAAAGGGGGGCAGGTGGTCGGGCCGCGCAAGCGGAGATGGAATGGGGCCCGGGCTC 

AGACTGGTCACGGGGGGAGGCTGCCGGCGTGGACCGCGGGAAGGCGGGGCTGGGGCTCGG 

CGGGAGGCCACCCCCACAGCCGCCCCGGGAGGAGCGCGCCCAGCAGCTGCTGGACGCGGT 

GGAGCAGCGGCAGCGGCAGCTCCTGGACACCATCGCAGCCTGCGAGGAGATGTTACGGCA 

GCTGGGCCGCCGGCGCCCGGAGCCGGCTGGTGGCGGGAACGTCTCANCCAAACCTGGAGC 

G 

Sequence 150 • 

CACGCGTCCGGCCTGCTGTTNACCTGCGGGACCCCAGGAACCTGGACTTGTTTCTCAAAG 

TGGTTCATGGAGATGTCACCCCCTACGACCTGGTGCGGATGAGCTCGATGCAGCTGGCCC 

CCCAGGAGCTGGCCCGCTGGCGGGACCAGGAGGAGAAAAGGGGCCTGAATATCATTGAGC 

AGCAACAGAAGGAGCCGTGCAGACTTCCAGCCTNCAAAATGACCCACAAGGGCGAAGTGG 

AGATTCAGCGGGACATGGACCAGACACTGACCCTGGAGGATCTGGTGGGACCGCAGATGT 

TCATGGACTGCAGCCCACAGGCCCTGCCCATCGCATCAGAGGACACCACGGGGCAAGCAT 

GACCACCACTTCTTAGACCCCAACTGCCACATCTGCAAGGACTGG 

Sequence 151 

TTTTTCCTTAGAATCTTCGAGAAAAAGATGAAGGTATTATTCTCAGTTTCGAGATCAGGA 

CTCCTCACCACCAGGCGGGGGCTTTAAGGTAGACACTACAGGGAATCTGATCTCAGGGTG 

ATCCTCTCCCTTCCACTTGCAAAAAGAGAGGAGCAGGTGGGCCACTGCTCTCTGAGATGT 

TAACACCCCTCACACTCCACGGGCATGCTTTGTCATTCTGCACACCGGTGTAGCTGCAGC 

TCTGTGTGAATTCAGATCTCAAGAGAAATGTAAATCAAAAGTATGAGTTTCTTTCTTTCT 
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TGGGTGCCACAGTAGGAATGAAATGATGGGGACTTTTGGAAGCCCCTGGACTTGTGGCCC 
CTGTAGAAGAGCAGCTTGGGCAGGGTGTGATGGCCATCTCTGTCTCTAGGGGCCCTGTGG 
A 

Sequence 152 

TGGCGAGAGCGCTGGTGCCGAGTGAAAGATAACAAGCTCATTTTCCACAAGGACAGGACC 

GACCTGAAGACCCATATTGTGTCTATTCCGCTCCGTGGCTGCGAGGTGATCCCGGGTTTG 

GATTGTAAACATCCTCTGACGTTCCGGCTGCTGCGCAACGGCCAGGAGGTTGCAGTATTG 

GAGGCATCTTCTTCTGMGACATGGGCAGGTGGATTGGGGATTTTACTCGCAGAGACGGG 

GATCGTCCACAGACCCGGAGGCTCTGCACTATGACTACATTGATGTGGAAGATGTCTGCA 

ANGTGTCATTCAGACAGGCCAMCAGACCTTTCTGTTTTTATGAACAGGGCGTGTTATAT 

GTGCTAACCCATATCTAGGGGGCACCTTCAACGGGTTATTGCCCACCCAGCGGGACGGCA 

CTTTAATATGACGATGTTTCCGTNCCATAAACCGGNTTGNTTAAAGGGTAAAA 

Sequence 153 

GTTCGACCCACGGGCGTCCGCGGGACCGCCGTGGGNNNNACATACTATGCGNACAGGCGC 

GTTGNACACAAANGGCCCATTCCTGTAGCCTCACACTTGACTACACATGGGGGANTCACT 

CGGATTCGGNTCTCCACGTGGNNGNTCTTTGTTCTGTACTCTACGTAGCTTTGGCTTTTG 

TTTTCTCGTCGCMCAGGGCATGAGACTTCGTCGACCTTNGGGGTCTGTATAGTCTTTGA 

CTTACTACGTGTAGGTCTCAATACAAAGTGGGANATANTCATATCCGTCCGCGAAAAGTA 

ATTCTTGGAAAAATTTACCCTTGTCTCCCGCNTTATGAAACGTGAACTAAGTAACTCACT 

TTGCCCCTGGGGCGCCTCTNTTTMCANTGTTCTTTTGNCGAAATCATCATAACCTTCAA 

CTGAAAACAATGTGGTCAACAAACTGACTATGGAGGTCTTAGGCTCNGTCTCTAAGATCT 

TTAACCTTGTTTATCGGCGCGTGCGGCGTNGTCCGAACGAAGAGACTATAACCCGCACTA 

TMCNAAAACTCTTTTTTAATCCCACCACCTCGTGAGGGANGGGCCCTAAGACTGAAACT 

GTAGTAAGTCCTATTGATTTGCGTAGGAGGANTTAGGAAA 

Sequence 154 

NCGCGTCCGATAGTCTACCAGCCTTACCTGGTTGATTACACTTGTAAAAGAAAGATTAAA 

AGCAGGCCAGTGACTCTGGTCTGCTTGAACATGTGAATGTAGTGGTTTGAGCAATCTGGA 

GTTTGCCCTAGTGTCAAATTCCAGACTGTCCATAGTGTCCAAAACCTGAGGCAGACACTA 

ATGTT AACC CCCAGCACCCCGTGATTGGAAACAAACCTAAATACGTATTGGGAACTTAAT 

AGCAATTTTMGCATTCTGATAGATTTTTTGTAGGGATGGGGTCATGCCATGTGGCCCAG 

GCTGGTCTGAAAACTCTGGCCTCAAGTGATCTCAAGCTTTGGCCTTCTAAAGTGTTGGGA 

TTACMGGTGTGAGGCATTGCACCTGGCTTAGCCGTCTTGATTTGACATTGTAATGAAAA 

AGTGTGAGTCTTATTCTACCAGGGGCCCTTTTTTGTCCTCTTGAAAATNGAATAACCANG 

GGAAGGGGGAA 

Sequence 155 

CCNCGCGTCCGTCCATCACAGCCTCCGAAGGTGCTGGGATTACACGGCATAAGCCACTGT 

GCCCAGCCTGTTTrT MTMTGATATT MGTGGGTTTGGTTCATGTGTTATTMTCAGTG 

TTAATAATCGTAC I I I I I I I I I I I I I I I AAAAGAAACCATGGGTATTCTAAAATCAGGAG 

TCCAAATAAAAGAAAGTTCTCGGCTGTGCGTGGTGGCTAACACCTTGTAGTCCCGGCACT 

TTGGGG 

Sequence 156 

CGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGT 

AGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGG 

AGGGAGTCTGAGGGTTGGGGACCGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGG 

CGGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGT 

TGGTCAGMCCTGAGAAACAGCCGAGAGGTT7TCCACCGAGGCCCGCGCTTGAGGGATCT 

GMGAGGTTCCTAGMGAGGGTGTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTT 

GGGGGTCGCCCTTCTGAGGAGGCTGCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCC 

AGAATCCCCTGTAGTTGATAATGTTGGGAATAMGCTCTGCACTTTCTTTTGGCATTTCA 

AGTTGTTAAAAACAAATAGGA 

Sequence 157 
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CGCGTCCGGGTGTCNAGGCCATGGGGCAGCCCTGGGCGGCTGGGAAGCACGGACGGGGCG 

CCCGCGCAGCTGCCTCTCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTG 

GCTTACGTGCTGGTGCTCGGTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTG 

CAAGCTGGCGCTGGCCGCCTTCCAGCTGCTCAACCTGCTGGGCAACGTGGGGCTCTTCCT 

GCGCTCGGATCCCAGCATCCGTGGCGTGATGCTGGCCGGCCGCGGTCTGGGCCAGGGCTG 

GGCTTACTGCTACCAATGCCAAAGCCAGGTGCCGCCACGCAGCGGACACTGCTCTGCCTG 

CCGCGTCTGCATCCTGCGTCGGGACCACCACTGCCGCCTGCTGGGCCGNTGCGTGGGCTT 

NGGCAACTACCGGCCCTTC 

Sequence 158 

CGAGCACGCGTCCGGGGACTCAGGCATGCACCACCACGCCCAGCTMTTTrTGTATTTTT 

AGTAGAGACAGGGTTTCTCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACATTCGGTGA 

TCCACCCGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCAGCC 

CCTTCCTGTTGAGTAAAAGGAAGAACTTCAGGGTAAGACACTGTACAGTGCCCAGCATCT 

GGAGAGCCGCCAGCATTACCCCTGCCTTAGGAGGTAGTCGTCTCCTCATCACTACAAGGT 

ATTGAAGCCTGAGGGCCCCTGGGCAGGACGATAGAGTGAGATTGCCCTGGGGACTCAGGA 

AAGGAAACATGCCGTATTTNTAGGGAAGGAGCTGCTGCTGCCTCTCAGTGACTCTGGTTC 

CAGGAGGGAAGAGCCGAGAGCTAGGGTTCCCTTTCATAGGGAGAAACCCAGCAGGGTTTG 

GGGTGTTCT 

Sequence 159 

ACCACGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGG 

AGGTAGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAA 

CGGGAGGGAGTCTGAGGGTTGGGGACGTCTGTGAGGGAGGGGAACAGCCCGCTCGAGCCT 

GGGGCGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGG 

CGTTGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCCGAGGCCCGCGCTTGAGGG 

ATCTGAAAGAGGTTTCCTAGAAAMGGGGTGTTTCCTCTTTTCGGGGGGTCCCTCACCAA 

GMGMGGTTCTTTGGGGGGTCGCCCTTNTTGAGGGAGGCTTGCGGNTTAACAGGGCCAA 

AAAANTTGCCAT GGGATG TCCAAGAATCCCCTGTAATTTGATTAAATGGTGGGGAAATAA 

AGCTTTGCAACT I I I ! I I I GCN ATTTAATTTGGTTAAAAACA 

Sequence 160 

TCCGCTCCCCTGTTTTCTTCCTTTTTCTTTTTG 

TCGTMGAMCAAAATGCCAGTATTTTCTTAAGCCATGATGTGAAACCAATGACCCTGTG 

ACCACATGGCACAGMCACTMATTTTGGTCCCATGGCTGAAACTTGAGGGTGACTAAAA 

GTAATGCCTGTGAMCATGATATCTATCTGGGATGGCCATTTGATCTCTAAAAGGAATTT 

TGTNCACTCCACAGMCTCCTATCTATAGTMAATTTGATTTTTC 

Sequence 161 

CGTCCGGAAAMTATTAAACMCTCATTTTMGATTCAAATTAACTAATTCCTGCATATA 

TGACATTCCTTACATMGCGAACACTAMCAMMTGGCTAGAAATGTCTTTTTCTTTCT 

TTTCTCTCTTTGTTGTTTAAGGTATTAAGCACCGAATTATTACATGAGACTGGCAGATAG 

CTATTA ATCC TCTTACAGATTTGAGAMGTTGATTCTCAAATATTTATGCACCTTCTCCT 

TCATTGTTTTCTTTAMTCTGTCCTCTTMAAAGCTTCTTAAGAGCTCAGTTMTGCTTT 

TGACTTAACTAGGAGAAAAAGGCATGATAATACAGGCAAGATGGCATTGTTAGCAATTCT 

GGTAGGTGGTTTTGGAATGAATCCTAAGAGGCAAGGGGATCTTAAGGACAAGGAAGAGAA 

GAGAGAGGGGGNGGGATCCCTTTGATCTCTTTCTCTGGNAATCTTAMTGCNTMTTTTA 

CTAAAACATGTTCTCAATTCATTCATAT 

Sequence 162 

CCCCGCGTCCGGATTAATGAGTGTATGCCTAGCTCTTTCTCCAGTTTACTTTTAGACCAT 

ATTGTTGTTTGTTTTGAATATCATTCCTTAGGCTATGTTGAGAGTAGAGTGGCTTCCCAT 

TAGGAGMCTMTTTAGG GCAT GTCTmGCTGAATCCCGTCAGCATATTTAACAAATTC 

CCAATTCTAGATMTTTCCTTTTATTTCTCTAGTACCCTTTGCCAGGGGCTCTACACATC 

AAAGGTGTTCATGAAGTATTTGTCAAAGGAAAGAACAGTAATGACACCTAACACATAATG 

AGTGATTAGTATGTTCCAGGCATTGCGTGAGCTATTTACTGTGAGTGATTTAATGTTATC 
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TTCCCAGCAGACCTCTGAGGTAGGGTACTAGTATGATCCCCATTTCGTACATGAGGAAAC 
TGACACTAAGGGACATAAAATAAGTTTTTGAAGTCACAAAGTGAATAAAAGGAAGACCAG 
GGTTTTAATTGGAAGCCCATA 
Sequence 163 

TAGA CTTT TGCAGTGTTAAACACAGCTTCCTTAACTCTTAGMCTGGAAGTTGTAGAGCT 

CTCCTTTTGGTGCCTTTCCAGCCTTTATACACACTATTGTAGCTTTCTTAGGTTTGATAG 

GTAG CGTTTCAAGTAGTTTAGCTGAGACAGNGAATGTATTAGGTTCAACATGACCTTGTG 

TTTTATTrGTGTTTGC CAACAG GATGCCTTATTTGTTTGAGAAAAAGATGTACTAGTGTC 

ATTCTAAACTATCTCCTTTTTTAGGATTCTAMGMGTTMTCATCATGCTTTTGTTTAT 

TTTACCACCATTTAGTGCCTTAAATCCTATCAAGAAAGCAGTGTTACTGCTCAATGCCCA 

MTAAGACACGCGGATATTGCTATTGCTTGCTTTTGAGTTAACAGGCCNACTTTTTATAC 

TTAAAACCTCA 

Sequence 164 

GCCCCGAAGTCCCACTGTCCCTGCCGAGGCGCCGGCGCCGTCCCCTGTGCCCTTGACCAC 

GCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGCGAGCGGTCGGC 

GTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCG 

GCCGGCCCGGGCCCGGGGCGAGATTGGGGGCCTGTCGCTGTCGCCATCGCCCGAGCGCAG 

GAAACCGCCGCCACCTGACCCCGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCA 

CAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCGCCGCCAGGCTCCGAGGCCGCGCC 

CAGCCCCAGCAAGAGGAAACGGACGCCCAGCGAACAAATCGGCGCATACGGTCGAACACN 

GGAAGCCCCCGGACCCGGACCCAACGCCGGGGCCCCCCG 

Sequence 165 

AGTCCGCCCCACGCGTCCGGTGAGTTTAGCGCTGCTGTCCGGATGGGTTGGTAGCAGACA 

GGGTGGAGTAGGGTTAAGCACACTGGTCACCTTAGGATTGGTTTCCTGGTGCTGGAGAAT 

GGTTAGGACACAGGCCTTGGMGGTTTTTTGAGTGTGAAATATTACTCAGCGTTTTCTGC 

AGACCTCGCGGGCAATGCCGCTTCTAATTTTATCCAGGCCTTCTTCTGTAGGGAGGGCCT 

GTTAAGAGT TGAGCAGCCCGATTTCTGAACCCCTCTAAAAAGCTGTGGCTGATTGGTGGC 

1 1 I I I I I 1 1 ICTTGGAGAGGGGGGTGTCAAAGATTTCTTTAAAATCGTTAGTGATGTGGT 

CTCGCTTTA 

Sequence 166 

ACGCGTCCNAAMTGTGTGGTACATGGMTATTTTTTATTATGCTTATTTCGTATTGCCA 
GGTAGATGGCCAGCCTGACATTCAAAATTATTTTATCAGCCCCTAAATGTTAATATTTCC 
CMACTATTTAMTCAGTAGMGACATTTTTACTATTAAGAATAAAAAGTTATAATATAA 
MTGGATT AMTGCCAG ATTATATGCTAMCMGTCCTTTAMTTTTTAACTTAATATTT 
TTAACAGA I 1 1 1 1 1 1 1 1 IGAGATGCAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATG 

GCACAATCTTGGCTCACTGCAACCACCACCTCCCGGGTTCATGCGATTCTCCTGC 
Sequence 167 

CCGTCCGCGAGGTTAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCATCTCTA 

CTAAAAATACAAAAAAAATTAGCCAGACATGGTGGCAGCCTCCTGTAGTCCCAGCTACTC 

GGGAGGCTGAGGCAGGAGAAGGGCATGAACCTGGCAGGCGGAGCTTGCAGTGAGCCCAAG 

ATGGCGTCACTGCACTCCAGCCTGAGGGACAGAGCAAGACTCTGTCTCAAAAAAAAAAGA 

AAAAAAAGTGGCACAGATCAATTATAAATCACTGCTTCAAGGCCAGTGCTCTCACTTTGT 

ACATTAATAATCTCAGGCCCAAATAAGATAAGTGATATGTCAACGTATGTTCACTTTGGT 

CTTTACATGGCAGCTATAGTATACCGGAATATTATAAGCTCAGATCGTCATAGCTACATA 

ACTCCTTTAGTTGGGAAGANACGCCGTAAATGCCCATCAAGANTAGCAAGTCTTGCATTT 
GACT 

Sequence 168 

CGCGTCCNGGTAACTTGAATAAGGATTATGTGCCCCACCCTTACTCTCATTCCTGCTTCC 
TCTTGGGCTCAAACAGGGTATGAGTATGAAGATTTTGCCTTTAGTTCCTGAACTGAACCT 
GCT TGCTA TCCCTTTCCTCCCCACCACTACCTTATTCCTTCTCTGCCTCCAAATTGCCAC 
TTTGTTTTGAGGCTTCCTTCCCTACCTTATTATTCTGAAGGAAGTAGAGATCTTGCTTCT 
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GAAACCCCTCCTAAGAAACTGCCCAGGGACAAGATAAATTACAAACAATTCATGGGAGTT 
TACTACCTAAGTTGCTTCTAGGGCATATGTATACCATACTAGTAGTCTAGATTTCTGG 
Sequence 169 

AGGGAGTCGCCCCCGCGTCCGGACAGGATCTATGGATGGAAGAAGATGAAAACAAACACA 

CAGCCATAAAGGTAATTGTTTCTGGCCMCATCTTTCTACTGATGCTTTGTTTTGATTGT 

ATGTTGCTGTTTATATTTTCTCAMCTTGAGGCTCTATTTTATGAAATGTTGAATATAAA 

TACATTGT ATTT AACTTGAAAAATTCCTGGAAATATACCTGATAATTACCACCTGAGGAA 

TCGhnTTATTTTATGAAAGTAACAGCGTGATGAATACTGTAATTACAAAAGAAAATTAGT 

ACTCACTGACTTATACCCTTG 1 1 I 1 1 I 1 1 1 1 1 1 1 ) I I GGTTTGG 1 1 1 1 1 1 1 I GGTTTTGN 

TTTAGGNGACA 

Sequence 170 

GGAGTCGNCCCACGCGTCCGGCTGGCCAGCAGACCCTGCAGCAGTCATTGGTGGTTCTGT 
CTTTCCCT AACCCTTATGNGCTGGAGGACAGAGGAGCCACCTCTTCTCACTGGTTGATAC 
TCTGTTCCTAAGACCTCAGGTTACCCAGTGTAGCCTGCTGTGACGCTTCTGCTTCATTTT 
CTGCCAAGTGATACAGAACCTTCGTTCTGCTGCTTTGGGGACAGGTGATCCTAGCCCCAG 
CTTAGGGCCAGTGCTCTGTGCCACTCTGGAGAAATAGGGAAAAGATAGGGGTGGCTCAGT 
ACAGCAGCCCTGTGAAAGTCAAGGCCAGAGCTTTT CI I I I I I A A 1 1 1 1 1 I I ATT ATT ATT 
ATTATTATTT 
Sequence 171 

TTTAGGGAGCCGACCCACGCGTCCGCTTGGCAAACCTCCGGGGACTGTCAGAGGAGGAGA 

GGAGCGAGAAGGCTATGCTTCGCTCCCGCATTGAAGAGCAGTCCCAGCTCATCTGCATCC 

TGAAGCGGAGGTCAGATGAGGCCCTGGAGCGCTGCCAGATCCTAGAGCTGCTCAATGCAG 

AGCTGGAGGAGAAGATGATGCAGGAGGCTGAGAAGCTCAAGGCCCAGGGTGAGTACAGTC 

GGAAACTAGAGGAACGCTTTATGACCCTAGCAGCCAACCACGAGTTGATGCTCCGCTTCA 

AGGATGAATACAAGAGTGAGAACATCAAGCTGAGGGAGGAGAATGAGAAGCTGAGGCTGG 

AGAATAGCAGCCTCT 

Sequence 172 

CCACGCGTCCGCTTAGCCGCTGCCAGAGTTCCATATGTTCTGGAACCCTTGACTCCTAGA 

GTTCAGMCCCAGCCMCTTGCAGTTTTCAGAATGTTCAAGAAACTTCTGACACTCAGAG 

TTGCAGAACCTCCTGGTCCCTGCAGATTCCTGGAAATCAGAATATGGTGGTTGNAAAGAA 

TCTTGTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCG 

GGCAGATCGCCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAATCCCGTC 

TCTACTGAAGATAACAAAAATTAGCCGGTCATGGTGGCGCCCGTGCCTGTAATCCCAGCT 

CGGCAGGGCCGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAAGAGGTTTGCAAGTGAGC 

CNAAAGATCGAGCCACTGCACTCCACCTTGGGTGACCAGAGTCTTAAAAAAAAAAAAAAA 

AAGG 

Sequence 173 

CGTNCGGTGAATATTCAGTMGCTMTTAAAATTTGTAAGTAAGATTTCCTTATTAAAAA 
AACTATTAAAATAAGANGAAGCCCGTTNAAATAATNATTAGAGNNGGAAAAAAGAAGATG 
ACTATCNAATTACAGCACT1TTTTTCAGCTATACATAAAGGCCTTTTCANT 
Sequence 174 

CGTCCGGTGACCCATTAAGTATATTTCGTGACCCANAGTTTGAAAGAGATTATAGGTGTA 

GCTTTGCTAGTTTGTAATTGATATAGAACAGTGACTATCAGGGAAGNTGAAGAACGGCNA 

ATTGAATGTAAATCATGTCTGGATGGTGAAGATTCTAAGAATGCANCTAGGGAAAGGGCT 

GCAAAAAGAAGGTGGCAGACTAATGTAGAATGGTGCAACCAGATGAAGACATGGGTGGCT 

TTAGGAATTCCAAAGTGGCCGTGAAGGCCAGGCACGGTGGCCCACGCCTGTAATNCTAGC 

ACTTTGGGAGGCCAAGGTGGGTGGATTGCTGAGCTCGGGAGTTCGAGACCAGCCTGACCA 

ACCAGGTGAAACACCATCCCTACAAAACAT 

Sequence 175 

GGCGGCGGCGGCGGCGGCGGCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGTNGCGGG 
TGCGGGAGATGCCGTGCGGGACTGGGGCCACNTTGAGCCGCCCGNCTCGTCCCCGCCTTC 
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TGTGGGAAGGATGTGCGCGCGGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTA 
CGGCCCCTGGCTCTGCCTCCTGGTGGCCCTCGCCCTGGACGTCGTGAGAGTGGACTGTGG 
CCAGGCTCCCCTGGACCCTGTC 
Sequence 176 

GAGCTGGCTGGTGTTTGAGCTGTGGCAGAAGCACCTGGGGCTCCAGGGAAGCANGCTGGG 

AACTGCAGGACCTTGCTCAGCCAGGAGCACTTCCCCCTCCTTGAGGCAGGAATACTGAGG 

TGCCTCCCCACAGATGGAGAANGTGGAGAGGAGGATGGGCCTCAGGAGCATCTCAAGCCC 

CAGTAGCAGGANA AAGAAAGAA AGAGATGCCTGGTTTTCACAGACTGGTTCCTGTGGCTG 

GGATGACTGCATCC I 1 1 1 1 1 I 1 1 1 I I I 1 1 1 GAGACGGAGTTTTTGCCTTTGTCGCCCAGG 

CTGGAGTGCAATGGGGTGATCTCGGATCACCGGAACCTCCGCCTCCNGGATTCAAGCAAT 

TNTCCCGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCTCCACCGTNCGGCT 

AATTTTGTA 

Sequence 177 

CCTTGTNAGGGGACACAAAGAAAAATTGAATAAACTGTATGATTTAAAAGATTATCGGGA 

GAGTTACCTCCCGATATAAAAGGAAGGATTTACAGAATGTGACCTAAGGTCTGGCGTAAA 

TGTGCACCGGAACCGAGAAGGCCCGGATTGTCATGGACGATGAGATACACCGGAATATCA 

TGGACATATTCTTTAAAGCGCCCTTTATCTTCAAATGCGGCACGGAAACCGGAGGCTTTG 

AAGAACTCAAGGAAGCGCGGCACGATACCGCCCGCAATAAACACGCXJGCCAAATGTCCCG 

AGATTGAGCGCCAGATTGCCGCCAAAACGGCCCATAATGACGCAAAACAGCGACAATGCG 

CGGCGGCAATCGGTGCAAGCTGTCAGCCAGCGCCGCGTTCGGTAATATCTT 

Sequence 178 

CACGCGTCCGACCGGAAATGCTGACCTGACCTTTGACCAAGTACGNGCGGTGGGGGGGGG 

GGACAAGTGGGGTGGTGGTATTAACTGGCTCCGGTGGGTCTTCAAGCCCCAGGAACCCTC 

CAAGGGGGAAACAAATGGAGGGCCCTAACGCAAGAAGCTTCAATCGGTCCCTTGGACTGG 

GCTTCTTTCGGTGGGTTGGTGGGTATGGCCCTGGGCTGGCCTTTNCCTNAATCTTTCCTT 

CCCTNCCTGGGGGCAATCTGGCTGGGTGGCCAAGTGGCTTGCCCCGGCAACAACTTTGGC 

TGGCTGGCTTACGGTCAANGGGTGGCCCCTGGCTTGCCCCAAGAACAAAGGTGGCNTTGC 

TTTGCCCCCGGAAGGGCCCCTNGTAATTGCCCGGCCCCGGGCCAAAAAAGCCAAGCCACC 

TTCMGGGTGGTTTCCCAAGCATTTTTAATTGCCCCCCCAAGCAACCCTTAATTGCCCAA 

CCCTT GTTCTTTCCCCGGCCCAAAAGAACCCCCNAACCCCCCCAACCCAAGCTTATTGGA 

ATTTTCCCAAATGGGGGCCCCCCTTGGCCCTTACAAAACCGGGGGTTTACCCCTTTGGGG 

AGGGGAATTAACCCCCTGGGGAAGGAAACCGGTTTTGGACCANGGGGAAGGTAAGGCTTT 

AAANCCTTGGGGTNGGGGCCCAAAAGGGCTTCCCCTAAATTGTTAACCCCCCTTGGCNTT 

TT 

Sequence 179 

CGTCCGGAAAACTGTTCATCTACTCACTGTAGTGCCCTCCTTGAAATGTGTGTTTGTTCA 

TTCAACTAACAATATTTGGGGATCCCTGTAGTAAACACTGTATGAATTTTACACAGTCTG 

GCCATCAAGAAAGATCACGGAGTATATTCTAGATGGGGAGGCTACTAAGTGAATAGGAAT 

CACCACGCTGGGCTGTTTATTAGGTACAGTAATAAACATAAGTACTGGTTGCAAAANAAA 

ANAANAAAANAAAAAA 

Sequence 180 

CCNCCGCGTCCGAAMGACAAGACAGCATACTGTATTTTTCCTCTTAAAATTCAATGTTA 

CMTTAMTGATTGTTITrCTGAGMTMGTTAGCTTCAGCTTTCTAATCGATGTGTTCCC 

ACATCTACAAATTGATATGAAAAATTATTTTGAAATGCACACTGCAAAATGGTGAGAATA 

TGAAAGTTACCTGGGAATTAAATCAGAACTGTCTCCATATGACTATTTCCAAGTCACAAT 

CATMCTTTCTTAATAGCAATGGTTATATATGTGGCCAGATAGTATTCAGTTTCACAGTA 

ATGTCTCGGTCACATAAAGATAGCANAGCATAGACATAGTACAACAATTTATTATTTCTG 

CTGATTGCCAAATGTGCATAAAACTATAAAGATATATTTTCCAGCCCAGGTGACAGAGAC 

CCTGTCTCCCNTTTNAAAANCTTCATGNTAAAGGTGCGGCCGCTAGACTAG 
Sequence 181 

CGCGTCCGCTAATCMCTTTTAAAAATAATGTTTTACGGCCGGGCGCGGTGGCTCACGCC 
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TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGGAGATCGAGA 

CCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCCGGC 

GTGGT GGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGATTTTTTATTTTTTGAACT 

TTTTTACAGAGAAGGGGGGTCTCCCTATGTTGCCCAGGTTGGTCTCAAACGTTTGGGCTC 

AAGTGATCCTTCTCACCAAGAGTGCTGGGGATTATAGGCATGAGCTGCTGTGACCGGCCC 

AAAATCAAATTTCAAAACTAAAAAAATTCTCAGATAATTMTGAACCTACGTGAATTAAT 

CTACAAATTCCAGTTTGAAAAACACTAAAGATAAACAACTGTCAATGTGGGAAACTTAAA 

AAGTANGTTGGTATTTAGGTTATtGGTTAAAATGGGGGACCGACTGGCATACACAGTCCT 

AAATATTTAAGTCTTAAG 

Sequence 182 

CCCTATCTTNCGGTACTGGTGGGCCAAATTCCTGGGACCAGGTCAAGGTGGGCTGCCTCA 

GTAAGAAGGAAGGACTGGAGAGTGCCCATTTCAGAGGAGCAGGCTGGTGGGGGCCAGCCA 

GAMGTAGTTCCCTTTGGGGGGAAGATGTTGGACCTTTATTATTTGTGGTAACCAGCCGA 

GGCTGGTTGTCAGGACAGCAGGTGAGCCACTTTAGGGAAGAAAGTGCAGGGGTGGGTGGA 

TGCCCAGATTACCAAGGCCAGCCACCCTGATGGGGTAGGGTCTGGTTATCTGTGTTCAAG 

AAGCAAATCCCACCCCAGCCCCAGCACTAGCTCTCTATGTATGTATTTTCCCTGTACAAT 

GTTTTATAAAAGAGATCATTAATTTAAAAAAAAAAANAAAAANA 

Sequence 1 83 

TCGTCCATTTACCTCACTTATGGGGTAAAAGGTCACTTCAAGTAAGGTTAAAGGTTTTCC 

CTGGCAAAGGACCT AACC AGAGCCCCNAAGGGGGGGAAAAAAGGAGGTCACCTTTGGGGA 

GGTACCATGGCCCrmTTGGTCTGGCCCNTTGGGCNTCTTCAACAACAAAGGAATATTTT 

ACCAGGCCCTTTTGGAGGCCTTTGGATMTTTCTTAAGAAATTTGGTTACCAGGAAGAAT 

TTAGGCTTCNTNGGNAAAAAGGAAAAATTAGGACCTAGGAAAGGGAATTAAAGGGGGNAA 

GGGG AAATCAATTAAGCNTTAATGGAAAGGGGGTTTTACTTCTGGCAATCAAGAACCGGC 

I I I I ICNTAAGTTTCNTAATGGAACTTTAAACCGGTNCCTAATANGGGCTNGTAAANGGG 

GTTGTCNTGGCGGTGGNAAACCACCTTTCTTTTTCCTNGGGCCCTTCCCTTTTCTGGNCC 

CGCAATTTNCCCTTCTTTNAMCCCTTCAAAGTTTGCCCTTGGAGGGTTTTTAATTAATT 

CCCCCTTGGTGGCCANTTCCCTTGGGGGGCCMTTGGGTTTCAATTTCCANCAATTANTG 

GNAAAMCCAAANTCCAAGGGGAAGGGACCCCCTTNGGGCTTAANTTTTTCCTTTTAAAT • 

CTTCCTGGGAATTTTNGGATNGGGGGAAAAAAAAATAMTTAAAATTCCTTTTTGGGGCG 

CCCTTGGCCMGTNGGGGGGGAAAMTMTTTNGGNTTTTNGGGGGGGGGGNTTTGGGGA 

ATANCC7TACCAAGGMCCCCTTCCTTGGATNCCTTTTGGGGGGTCTTTTCAAAAAAT 

Sequence 184 

GCGTCCGGTTGTAGTTTCCCTTCCCATTCTCTTGGTGGCCCTGGAAGCTTCTAGGCACAA 

GTGTGCCACCCTGATTATTCCNACCACTCCATCCAACTTTTCTCTCTCTGTGGGTGTCTG 

CACCACAAGCTGCCTACCCTCCAGGTGCCTCAATGGTCCGGCCACCAGTTGTGCCTCGGC 

GCCCCCCGGCCCACATCAGCAAGTGTCAGGCAGGCCTCCACCCAGGTGCCACGCACGGTG 

CCTCATACCCAGAGAGTAGCCAACATTGGTACTCAGACCACAGGACCCAGTGGGGTAGGA 

TGCTGTACACCAGGCCGGCCGCTCCTGCCGTGCAAATGTTCCTCAAGCAGCACATAAGCA 

CCTATCGGGTCCAGGAGCCGGCTTGTGCACATNCCAGGACAGGAGCCCCTGACCGCGTCC 

ATGCTGGCTGCGGCGCCCCTGCATGAGCAAAAGCAGATGATTGGGGAGCGTCTCTACCCC 

CTTATCCATGATGTCCACA 

Sequence 185 

GTCGCCCCGCGTCCGGGCATTTGTATTTTCAACAATTGTTCTCAAATTTAGAAMGAGAC 

ATCGCAAGATGGTGAAATAGGAAGCCCTGGGCCCTCCTTCCCTCCACAAACACACTGATT 

TGACAACAGTTCATGGACAGATTCCCTTTATAAGAAACCAAGAAACTGTTAAGAGGCTCT 

TATACCCCAGGTGAGTGCAAAATCATCCACATCAAAGATAGCTGGGAAGTTCAAGACACC 

TTCTTTCCGTAATTTCTAAACTGGCACAGTACTATATGATTGAGATGAATCTCCCCACAT 

CCAAGCTTCCTGCTGGGGAGGAGAGGGGAGGGTATACCATTATGTCCAATGTTCCAACTC 

CTCTAGGAGCTACCCAGGTAGGAGGTGGGTCCAGCTCTGTCAGGCTTGTCTTAAGAGCAC 

TGATTGAGGGTCTGGTATTCTCTAAGTGGCCCAGGACCATAAGAGCAGTGGATGGTGCTG 
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GGGNTNNGNNGTGGGTTACCCATAACCCCCTGGTTTTTGG 
Sequence 186 

TCCGAAGCAGTGAAAATAGTGTTTGACTCCATTGACAACCTAGAGGCGGCTCCTCATGAT 

ATCGGCTACGTCAAACAGGCCATGTTCCACTATTTCCAGGTGCCAGATCGGCTAGGGATA 

CTCACTCACCTGTATAGGGACTTTGATAMTGCACGTTTGCTGGGTTTTGCCGGAAAATT 

GCAGAATGTGCTCANCAGGGAGACCCCCTTTCCCGCTATATCTTCAGGAAGGCTGGGGAG 

ATGCTGGGCAGACACATCGTAGCAGTGTTGCCCGAGATTGACCCGGTCTTGTTCCAGGGC 

AAGATTG GACTC CCCATCCTGTGCGTGGGCTCTGTGTGGGAAGAGCTGGGAGCTGCTGAA 

GGAAGGGTTTTCTTCTGGCGCTGACCCAGGGCAGAGAGATCCAGGCTCAAAACTTCTTCT 

CCAGCTTCACCCTGATGAAGCTGAGGCACTCCTCCGCTCTGGGTGGGGCCAGGCCTAAGG 

GGCCAG 

Sequence 187 

CGTCCGCGTCGCTCCCCTGCTCGGGGTCAGCTAGTGTCCGCTCTGCTCGGCCGCGGGCTC 

CCGGAGGACTGCAGGCAGGATGACCGCAAAAACACGGGTGATTGGTGAATGGAAGTTGCT 

ATGGGCCTCTA AGGG CCATCCCAAGCCCAACCCAACGTTAAACGGTCCACAATCCACCAA 

GGA AGTCAA GCTTTTGACACAAACTGCTGAAAAGCTGGGAGGGTTTCTTCTGAAAGAAAG 

TTTCI I I I I ICAACCCTGGGGGACACTGGTGCCCTTTCCACAAGCCAGGGAATTGGGTTT 

ATGAAGCAAGCTTGGCTCTAAGGGGGGTGACCTCAAGATATTTGCTGGGGGGTTGTGAGG 

TTTGGTGGTTCTTGGGAAGTGTGTCTCAAGCTTTGGGGGCCCTGGAACTGTGCTTGAAGT 

GCCCTCAGGCCTGTGCCCTTCTGGGGCCGGGGGTCTTGTGGGTGNATNCGCAAGCANGGA 

AGCCTGGGGGCCATTGGTCCATTNAAGAAGGCACCCCGGGGCCAAACCTTGCTTGGCTAT 

ANTATTCCAAGCCTGCTTCAACCCNTGGGCAGGCTTT 

Sequence 188 

TCGACCNCGCGTCCGGCTTCGACGCCTTCCCTAACATCGAGAAGGTGTCCAAGATCACGT 

CTCCCGTGCTCATCATCCACGGCACGGAGGACGAGGTGATCGACTTCTCGCACGGGCTGG 

CGCTCTACGAGCGCTGCCCCAAGGCGGTGGAGCCGCTGTGGGTGGAGGGCGCCGGGCACA 

ACGACATCGAGGTCTACAGCCAGTACCTGGAGCGCCTGCGTCGCTTCATCTCCCAGGAGC 

TGCCCAGCCAGCGCGCCTAGCGGCGGCCCCAACCGGCCGGACCTCAGCAATAAGGCGGCC 

CCCGGACCTCACCCCGCGCCGGCCCCCACCCAGGGGCTGCATGTGGACCCCCCGGGCGGC 

CCAGGGGAC CCCG CCCCGACCCAGGGGCTGTGGACGATGTACAGGCAACAGAGCTACCGC 

A CTCCTTT CCTTTTGGAAGCAAGAAGAAAATACGTGAAAACGGGAAATTAAAGATTTAAA 

AATTTTTTNNNIMNTNNANAAAAANNAAAGTGCGGC 

Sequence 189 

CGCGTCCGAAGCCTTTTGTCTCAGAGAATTTATTGTCCTGACAGCAGAGGCCGATGGTGG 

GATCGACTGGCCCTTAATTTACACCAGCACTTGAAGCCGCCTGGAACCCGACTATCAAGT 

GCATCACAGAGGGGCTGGCGGATCCCGGAAGTCAGAACGGGACACCGNCTTTCACTGTAT 

CAGCGAGCCGTGCGCCTGCGAGAGTCTCCGAGCTGTAAAAAGTTCAAGCACCTCTTCCAG 

CAGCTCCCAGAAATGGGCTGTGCAAGATGTGAAACACGTGACCATCACAGGCAGGCTGTG 

CCCACAGCGTGGGGATGTGCAAGTCTGTGTTTGTGATGGAGGCCCGGGGGAGGCCCGCTG 

AC CCCA CCACGGTCCTGTGCTCTGTGGAGGAGCTGGCACTGGCCCATTACAGACGCAAGC 

GGTTTTGACCAGGGGATTCATGGCGAAAGGGTCCACCTTCAGCACCCTTGTATGGCCTTC 

TTNCTGTGGGGACATCATCTTCATNGGATGGGATTCCCGGATGTCTTCAGAAACGCCTGT 

CANGCATTNCCCCTGG 

Sequence 190 

CNACACATGCGGAATCATAGGCCTGCAAAGCTCCTGCTATTCACTATAACTCTGCCATGC 

CTTAGGCACTTCCTAACCTAGAATTCTGAGTGAAGGACAACAATAACTAATACTTTTGAT 

TCAGGTATTACAAAGAAGTTAAGAGTTCATAAGGCACCTAAGTAAAGTCACATTGGTTAA 

GAGTACATGTCTCCAGATACTCTTACATTTGCAAAGNAATTGCATTTCTGNATCTATGGT 

CTGTAMTAMATTGMGAGTTGNGAGMTAAMGCATGTTGTCTTTGATAMTTGTTTT 

TACAAAACAGGCACMGAGAGGCTTGMGGGTCCTTGCTATCTTTTAACCTATTTTATM 

TCTTTGCTGCATMGAAACAAATATGCTTATTTACATTCTATACTTAAACATATTATCAA 
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ACTTTTATTCTGMGATMTACACATGMGGCTTTAMCTTTMTCTTCMTATATTTTT 

TAATCTCCCTGATGAATTAGGGAAATAAATATTGGG 

Sequence 191 

TCTCCCCCTTCACCCTCCI I I I I I I I 1 1 1 1 1 1 I IG I I 1 1 1 I CTTAAAAGCATTTAGCTGG 

GTGCAGTGACACATGCTTGTAATCCCAGCTATTGAGGAGGCTGAGGTAGGAGGATTGCTT 

GAGTCCAGGAATTTGAGGCCAGCCTGGGGAACATAGCAAGAGCCTATCTTAAAAAAAAAA 

AAAAAAGCATTTCAATTATTTTAAATTTTNTMTTACAAAACAATTCTTTCCTGTCTTAG 

TTTAGTTTACTTTTTACTCACAAAGTTCCTGGAAAGTAGTTATTACTCAATMTTGAATG 

CATGAGTGTCAACTGCAA AATCTA TGCATTATGTAGNGATTTGAATCAATTAGTCTTTNT 

TGATACTCCMAATTACCCTTTTTCGAGNGTCTTTATCANAAATTTGATAAATCGGAACT 

T 

Sequence 192 

TCCGCCCCGTGAGGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATG 

GTTGAGTCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGT 

GATGATGCTGGGATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCA 

ACTGCATGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATGGTTGAG 

TCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGTGATGAT 

GC TGGG ATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCAACCGCA 

TCATTTTGGGGMGACACMTTTCTCAGAATTTTATTTAAGTTGTAAAAATAA 

Sequence 193 

ACACATTGCGGCACCGGGCTGGGCCTGGCCATCGTCAAGCATGTACTGCTGCGCCACCGC 

GCGCGCTTGGAAATCAGCATGTGTGCTGGGCCATGGCAGTACG7TCACCTGCCATTTTCC 

GCCAGCTCAGGTGACGCGCACACGGCTGGTCGGGAATGATGAATAACCTGAAGTGACGAC 

GACCCAATGTGGGAGCTGCTGATACTTTGCTGTCTCCGCCCAGCAGTGGGCACTAGGCAA 

GGGCCCCATCAGCCGCTACATTGGCCGACTTGCGCCTGCCTTTCAGGCCCTCTCGTCACC 

CCTTATTGAACACACGGAACCTGCAAAACCCCATCATGGACCCTTCCCCTGGTATTACCC 

TCGCTACACTCTTCGCCGATTTCGGCATGATTCTTTTTGCACTGATCCTGGTACTGCTCA 

ACGGTTTCTTCGTTGCGGCGGAATTTGCCATGGTCAAACTGCGCTCCACCCGGGTCGAGG 

CCATTGCCCACACCAACGGGCTGGCGCGGGCAGATCCTGCGCACCGTACACAGCGAGCTC 

GACGCCTACCTGTCGGGCTGGCAGCTGGGGTATTACCCTCGCCTTCCTGGGTTTGGGTTG 

Sequence 194 

TGAMTCTCCTMT ACAC TGNGTTTTTATTGTTATGTATTCTATGTTTTAAAGCTCCTCA 

AGGTAT TGTTATT CTTTTG TATAATCAGTGTTTGTTTGGAGCTGTCTGTAAATTTTCCTC 

TTTmGCTCTTTATTTTTTCTTTCMCTCAGACCTTCCAAAATGGGATTATTm 

TTTCTTTAGAGACATTTGCTTGATMCAGMTTTTMTTTGGCGGCTATTTATTTTTTTC 

AGCACACTGCAGGTATTAMCTACTTGGCTTCATTGTTGGTTTTGAAAAATCAGCTGTTG 

TAATGTTGCTCTTTGAATATMTCTGCCTTTTTCTGTACTTGCTTGNCTTTMGTTTTTT 

GTT 

Sequence 195 

CGCTCCCTGGTTTCTTGTCTCATGAANAAGAAAAAATCCTAACTGTTCTTGATGATCTTT 

AAGGCTCANAATGATCTGGACAGAGGTATTTACCTTGAAGCTCATAAAGCATAGGCCTTT 

CTCACTTGCAGAGTATCTTTCTGAAGCTGGACTAAATTGGTTAAGGCCACTGTACTTTTC 

CACTTTGTCTTCTTTCCTGTCACACACCCTATCCTTTCAGGCTGTGTTGGGATGGACAAA 

AGCAGTTCTGGAGTCCTAAGGAGAAGATGAGTGGGATATGTTTCTGTGACCTGCAGTCAT 

TTTAAAGTTTAGCTGTTGCTAGCTGACTCCATGTAAGAATACCTTCCAGGAATTTGATGG 

CTGTGCACTCTGGCAGTGCAACTGGCATGGT 

Sequence 196 

CNCGCGTCCGGCCCTGATTGATGAAGCACAGTCAGTAAATCATCTCTTCATTCCCCAGTT 
CTTAAGCCAACATCAGCAACACTGAGAGAACATTAGATTAAAGGCAGGTATAGAAGAGAG 
ACTTAGGGTAACAAGTTAGTGGGTGCCTGAAGGCATGTGGGAAGAGATGTGGTAAAGGTG 
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TGCACACTATTCCTTACGTGACTTAAAATGTCAAGGGGCTTGGTGTGGTGGCTCATGACT 
GTAAAACTAGAGCACTTTGGAAGGCCAAAGCTGGAGGATAGCTTGAGGTCAGAAGTTCAA 
GGCCAACTTGGGCAATATGGCAAGACTGTGTCTCTACCAAAAAAAAAAAAAAAAAAA 
Sequence 197 

CCGTCCGGAGAGGCTGTTNTTGATNGATACTCCAGAATAGACACGGGCCCTTGAAAAGCT 

T GTACC TTTGGGGAAAAGTAGACCAGGAAAATCTCAAGCCAGAGAAGGACTATGAAGCTT 

CTTTTTCCAGTTAC ACAT GGAAATAAGAGGTCTTGCATGAGAAACCTAAATCAACCCCTT 

TNTNTTTMGCAGGTTTTGGAGTCTAAATTTATACTAATTGCATTGTGGAGCAGGTCTAG 

CCAAGAAATTAACATGACAATCAGTCTCTAATCTGTCTGTCTATCTACTTACTTACCTAC 

GAGTCAGGCCAAAGAGATTCCCACAAGTAGGTTAAAAAAAGTAAAGTCTGATAACAAATA 

ATTAGTAAGAATGAATGAATAAATAAATAGCACAAAGGAACACTCTACCACAAGGAGGAG 

TACTC ATCA CATCAAATGAGAGAATACCCTAAGAAAGTGAGATAATGCCAGCAATCTGAA 

AGATATTTTACCNAMCGTGTTTAAAATGNTTAGGTTATTAAAAAAAAAAAAA 

Sequence 198 

CTAAGCTGTTATTTTCCCTAAAAATGCTTCCCTTGCATTTATACTTATTAAGAATAGATA 

ATAGCTAACATCTATCCACTGCCTTTGATTCATGAGGCCCTCACGAATTGCCTCATTAGG 

TCTCCGACAGTATGGTACAACTACTCTTTATATTTTACAGATGAAGAAACTGAGGCTGGA 

CTGNTACCTTTTTGCTACACATCCTAATG 

Sequence 199 

MCCTGTTTTGTTAGATGT GMTC TAGGAMTACAATATATTTTTAATGTAAAAGNACTC 

TTGCTTTACTTGTAMCTGATTTTCGTTTTTTTCCCCTCAGGCCATCAAGCCCTGTCGTC 

CT ATGAC CAACAATGCTGGCAGACTTTTCCACTACCGGATCACAGTCTCCCCGCCTACGA 

ACTTTTTAACTGACAGGCCAACTGTTATAGAATACGATGATCACGAAGTATATCTTTGAA 

GGATTTTCTATGTTTGCACATGCCCCCCTGACCAATATTCCACTGTGTAAAGTAATTAGA 

TTCAACATAGACTACACCGATTCATT 

Sequence 200 

MTGCAAGMCATCTGGATNAAATGGCTTrCTAGATGAGAATGGTTGCATTTTTTAATGG 

CTATTCTGGTAGAAAGGACAACATGTGATATTCATCGACCCTCTTTCTCAGACCCTCTTT 

ATMCAGCTAGTATGGAAAMTCTGTCTTTCCTATAAATATTTTCCCTGGGGAATGG^f^" 

CCCATTMTGTGTNGGATGCTTTGNTGTTTCTCTCTTGCACAGCGTGTATGTTGTCATCT 

GCACTTAAGAAGTG GAAT GGAATTGGGAATCCTGGTTTCTTGCTCTGGCTGGAGGTCCCA 

AGTCCGCATTGTGATTTTGGGAAGTCACCGTTGTTCTTCTAGATCTAATTTAACTCATCT 

GTAMANMNNNGGNTNGMTTCCGACMTTTCATGGGTTTCATTAGATGCCAATATTTT 

ATGACTCATGATCCCAGCNAGACTACACCTATTTATMGGCTGGTTTTGCTTGTTTTTAC 

TAAGAGCAACAATNACTACATATTTTCAGGTTACTCAATCATCAAAAAAATTATAAAATC 

CATAAACACTTTGGATTTGAAACATTGCAACTTTG 

Sequence 201 

CGTCCGAAAAGCAGTCTTTCTTGCTCAAAGTATTAANGGTGAACMTTGAATAGAGTACT 

GTGGTCGGGAGACTGATTTGAGACTGCAGAGCTGATGCTGGGTAGAGGGTCTGGACTTGT 

ATTCATGTTCTGTCTCAGGGCAGCCCCTGGAGCAGGAGATGGCAGAGGCATTTACAGCTG 

CAGAAAACAGGGAG GAATG GAATCTGAGGTAGCCCTGGCCTCAAAATTCAGGCCTGGCTG 

TATCATTTACAGAGATTTTTCTGGAGGGAAAAAGTCTCATTTCTGAGGAAGGCAAGGNGG 

GCTAATCATTATTA A I I I I II I I AAACTTTTTG 

Sequence 202 

GCGTCCGGTTGMGAGGGCAGGGAATAGGGGTGGGTGAGCGTGAACAGAGTCAGGCTGAT 

TGCTGCAGGGTCCCTTGCATTAGTTCAGGTGAGAAGAGACCCGAGTAGGCCAGTGAGCCT 

GGAGGAGAGGCTCTCTGTGTGTTTMTTGGTTTCCAGCTTTTTTTCTCTATTCATGTAGG 

TTATACACGTTTCTTCTGTGAATTTTTATTTAMTGATTTTTTGTTGTTACTGGATCTAC 

AAACAGCCCAACTCCAAGGAATCTGGCATCTCTCAGTGGAGCATACAGGTGACTTCATAA 

TCTAACC GCAT TAGTAACTGCCAAAATCGGAAGTAATTTCTCTCTGTTTAAAAGGCAGTG 

AAACAAATTTTCAGAGCAGGTTTCTTCMCTGAACAAAATATTTTGGACCTTAAAGGTGG 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

37/467 



TATGGCTCTCCTGATCAGGGAGGGACAGTGAAAGGTTTGAGCTCTGACACTGNCCAGCTC 
TCTGGATACAACCAAGTGA C 1 1 1 I 1 1 1 I G 
• Sequence 203 

CGC GTCCG AGTATATAGAAAAACCATTAATTTAGACTCTGTGAGATTAGGTTGCATGAAG 

AAGGTTTTCTGAATATTTGAAGAGTGGATAAATAAATGTCCCCCAAAGCAATAAAATCAT 

AATCCTTTAAAATATAGGAAAAATAACTAATGGGAACTAGGCTTAATACTCGGGATGAAA 

TAATCTGTACAACAAACTCCCATGACACATGTTTACCTATGTAGCAAACCTGCACATGTA 

CCCCTGAACTTAAMTAAAATTTAAAGTAATAATAATAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 204 

CGCGTCCGCTGGGAGGCTGTGGGGTCTGCCACCCAGCAGATCTGTGTCACGGGAGTGGCG 

CTGTCACTCGTTGAGGTGGTGGCCTGGTTCCTTTGGCCTTAGGGAAGGACAAACTTCAAC 

TCTGAGCCTTGATTGAGTGACCTTGGCCAAGTTACCTAGCTTTTCTGAGCCTCACTTTTT 

TGGCNATTAGATGAACCAGAGGTTTATTTCACTCAGAATCCTGTTCACGATGCTGGTATT 

TGGACCAGCCTGCGGGTTTATCCTGGGCTCTTTCTG 

Sequence 205 

CGCGTCCGAAAAAGGATGAGAAGAGAGGTGCATTCCAGAAGACAAAAGGTGTGTAGTATC 

AGGATAAGGGGCTTTAAATATCAGATCCAGAGAACACTGCACATGTAGAAATGGGCTTGG 

CCTGGGTCAGGGCATTGAGATTGGTTACATAATCTTTTCAAGGATTGGTGAATGAGTTGG 

AGTATG TGTAG AMCCTACAAAGATGACAGTTTAATCTCATGTCATAATTTTTAGACAAA 

TAATGTATTTTAAAACTGGGTGCAGTTCCTAAAGCTGTTCTAAAAGTCAATGCAACTGAA 

TTTGGAATGTAAGCATAGGACAAACAGATGGGAAATAAGTCATGACCTCTGTGGGATAAA 

GTGAGAGTTATCMAG MTGTCAGTGTTT ATAACAAGGAACAAGCTTGTTTTGGAGAATT . 

ACTAGATATTATGGAAAATTTTTTTCTTTTCTACATTTGGGTAACTATAGCTGAACTATA 

GCAGATCATATTGACTTGGCAAAAAA 

Sequence 206 

CCNCGCGTCCGGTTATGTACTTTTGGAGACTTCCATTAGAAATATTGGCAAGTCCCTGCT 

TCGTGGCCATAGATTTAAMGGCCTATCMTTTTAAATGTTTCGGTCATTGAGAGCTAAA 

ACATGTAACATATCACAGTGTTATTCACCAGAAATAAAAMTCAAGAGTCTGCTCAGAGT 

AGGTTMTATGAGTTCCTTTCTTCAGTCCAGCTGATGGTTTTTAGTAAGATGAACTGCCA 

AGGAGACAATGAGCACTGACTTCTCGATGCATGACTTCATCTTGTTAGAAGGTGGGTTGC 

CGGGCCGCGGTGGCTCACGCCCGTAATCCCAGCACTTTGGGAGGCCGAGGCGGCCGGATC 

ACCGAGGTNGGGGGGATCGAGACCATTCTGGCTACACGGTGACACCCCGTCTCTACTAAA 

TATACAAAAAATTGCCCGGGCGTGGTGG 

Sequence 207 

CCNCGCGTCCGATGAATAGTTAGCCCATGATAAAGGAATAAAAGGATGAAGAATATTTGA 

AGAGAAATAAATCTTCCTCACTCCTCAGGTTCCCTTCCATGTGCAGGAGCCTCAACCTAC 

AACTAGCAACCTTATCTCCTGACTCATTCCTCTCTAGAGGAGGAGTAAATTAGTCAACTG 

ATATGCTCTGGAAGAAAAACCCA 

Sequence 208 

CGTCCGGTCTTTCCTCCCCCTAAATAATGCATTACAAAGTGGAAATGCAAATTTCCTGTG 

CAAGCTCTMGTAGCAGGTGGTATTTCCTTMTATATTGTTTTTGACCTTTGGGGAAATT 

GGTATTACGAGCTGA CTTTG GAAAATTAAAATAGCATCAAGGTCCTACATTTTAAATAAA 

A CAATC GATATCTTAATTTTTAAAATCAGACTAGATTACGATACCAGGAAAAGACATACA 

TATTTTGCTTTTATGTGTTAAAGTTTTGTAATTCAGGGAGGACAAGAAAAGGGATATGGT 

GCAGCTGAACTTTCTAATTCACTAAGACAGGAAAAAAAAAAAAAAAAAAAN 

Sequence 209 

CG TCCGG GAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGCAGGAAAA 
CTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGGGGGTGGG 
AGGTATCCTTGTAATA AGAAT ACCATGCATGTATTCCCACACTGCTCTTGGTGGTCTGCA 
AAGTGATTTCATATGTATTTTATGTCAACACCAGCACMTGAGGTAAGTAGGACTGTATA 
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CCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCAGATTTGA 

ATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTCTACACCC 

GA TGAGG CCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCCTTAAGTT 

ATATTTTAGAAAGATTTCTGTCT AAAAT AGATAAACTTGAAAGTATAGCTCTTCAAAATA 

ACGTATTCCTGTGTTGGCAAATATTTTCCAAACTCACAATCAACACATAGGTGTATTTCT 

TAGACTACTAGAAGTGGGGACTTACCCCAA 

Sequence 210 

CGCCNCGCG TCCG GGAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGC 

AGGAAAACTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGG 

GGGTGGGAGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTG 

GTCTGCAAAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGA . 

CTGTATACCTCAGAGGCATTTGGTGA7TTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCA 

GATTTGAATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTC 

TACACCCGA TGAG GCCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCC 

TTMGTTATATTTTAGAAAGATTTCTGTCTAAAATAGATAAACTTGAAAGTATAGCTCTT 

C AAAAT AACGTATTCCTGTGTTGGCAAATATTTTCCAAACTCACAATCAACACATANGTG 

TATTTTCTTAGACTACTAGAAGTGGGGAC 

Sequence 21 1 

NCGCGTCCGGTTTCNTTGGGATAGATTTTACCTATGAATTCCTCCTTAGAATTCTGAAAT 

TGCTCAGATTTACCCAAATGACAGCCAGTTTCTCATTTCACATTTGGGGGCTGTAGAATC 

TTCCAACATTGAGAACCTGTTTTAATCAAAGGATGCTTTGTGGAATCCTGAATGAGGAAC 

AGCATGTTGCAGGAAGAAGAGAAGGATCCTGATGCCCTAATGGGACTGATTTCCTTTTGG 

GGGGCAGGAAGATATATATTCGTTGGGTGCTTATAAAAGGTTAATTCCAAAGATTGTGTA 

TGGTTAAAGGACTGAAAGTCACACTTAGCCTCATAC7TCACTTAGATGAAAAACAAAAGC 

CTTCCTCTCCATTACCTTGTAAGATCTATTCCTTGTGTCTTGTGCTGAGTGGACCTGGAA 

TAATGGATAGCCCTCACTGAGTACCTAGAAGGGGACTAGGGGTGGGTGATGAAAGGGGGT 

TCACACCGAAGATCTAAGTGCTAGCTTGGGTA 

Sequence 212 

CACGCCCCGTGGCCTTGCTAGAGATCCATATAATGCAGTCATGCTGT7TCTTNCTCCATA 

GTATGTGGGGCATGAGGAGGAGACAGGGAGAGGGTGGCTTCATTGNGCAAANGNGGAATG 

GCTGTGCTTTGGGGCCAAGGAGATGCTGTCCTGCTGTAGCTGCTCTGTGAAAGGTCAGGC 

CTGCCCCTNTGAGGCTCCCTTTATCCTCCTAAA7TCTGGGGCATCTACATGACGCTTTCT 

AGTCCACCTTTGCCTNCGCAGATCATGGCTACTAACCTGACCTTTGTCTGTACTTGAGCA 

CCCTTCGCGATTTAACTTNCATGTANCGTCCGACTTCTAATATGGATTTGAATTTNTTGA 

CTGTTACTGCTCANAACAATCACCCTTTTTTGAGCAGNGAGCTGGNAGGATAATTGCCGA 

CAAATGACATTNGGANCCGTTTTNAACCACAGGGGGCATGGGG 

Sequence 213 

CGATCATTTTATTAGAGTNATGTATTTAAGAACTGATAAATCATGGGCTTACCTACACAA 

TGTCTAGACACATG AGCAA TGMCAAATAGCAAGGTCTGTGATATCTCATATGGCAATAC 

TAGGACTGAGATTATTTTTGTTACAATTAAATAATTGTCAGTAAAAATCCACAGAGATCA 

TTTAGM TGGG AAAAAAGTCTGATATATTTTGTTTCAGATTATAATCATTAATAGGGTAC 

CTGACAGTTTTCAAAGTTGTTTAATGTTTTTTAGGTCTTTAACCTTCCTAATCCTACAAG 

GTGGTTACAATCCCACATATTATCCTTGTGTCAGGGGTCTCCAGCACCTGCCTTAGGGTC 

AGTGATTTGCTAGAAGTACTTACAGAACTCAG 

Sequence 214 

ATCTGACAGCCTGGAACNGCACCCACACCCCCAGGTGAGMTCTGATGTTCTGGAGCATC 

ACACACAACCACAGGTGAGCATCGGAGAGTCTGGAGCAGCACCCACAACCCAAGGTGAGC 

ATCTGACAACCTGGAGCAGCACCCACACCCCGAGGTGAGCATCTGACCTCCCGGAGCAGG 

ACCCATACCTCCAGGCGAGCATCTGAACCCATGGAGCAGCACCCACGCCCCCAGGCGAGC 

ATNTGACCGAACAGAGCAGCACCCCCTCTA 

Sequence 215 
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TCCGGGAAAGACTCTGAGAAAGGACTCCTGACCTGGCTCTTCCAGAATGAGGATGGATTT 

TCCATATGAAGGAGCCGGGGGCAGGACCTTCTGGGTAGAGCGTGAAAAGAACACAGTATG 

TGTGGAGAGCGGGAAATTGTGTTGAACTGTCTAGAAAACAGAGIXACAGGAATGCGTGTT 

GAGGGCTTGGGCTGTACAGTGGAGAGTGCCCTCCCTGGCCAGGGAGTTGGACATTCATCC 

CACCACAAGACCCCATGAAGAGTTCCTCAACAGCTCTGTGTCCTCATCAAACCTGTGTTT 

GCAGAACAGTGGAGGAAGAGAGCTGAAGGAGGGGAGAGGCCCTGCACCTGCCAGGCCCTG 

GCCTAGACTACAAGGGTGAGCACTGAGCCATGCTCTCGGGGAACCTTCACTGGAGTTGAG 

GGCAGTGAGAATGTTTAAAAAAAAAAAAAAAAA 

Sequence 216 

ACGCGTCCGATGCACACTTGTCTTTTACCACANGGGTGGGGCGTGGGANGGAGGTTTAGT 

TTGGATAGCCACGTAAACGCCTTTCCCTGTGGCCTGCGATGTTCCACACCGTTTATGTGT 

GAACTGGCTGCACCCGCGCCTCCCGGACGGGGCTGCCAGGGAGGAGGGCCCGGGAGACCC 

CATCCAGACCCCGGCCCGCACGCTGCAGAGGTCTGCTCTCAGACATGTGGTGGGCTCCGT 

GTCACGGGTAAGGGGTCGGGGTCTAGACGGCAACAGAGTGTCTCTCTCTTCCCGCTCCCT 

GGTGTGCCCACCTCCCTGTACAGTGTTTCTGTTCAAGCTGCTGCAGGGGACGGGGCATTT 

TTCGTCCAGACTCTATTTTCCTGCAAGGAAGAGCTGCTGTCCTTTTTCTTACTGAAGCCC 

CTGATTCTGTGTCTGATGTTGCTGACCGCCGTGCTTGCTTTCTTGCCCGTGTGCAACTTC 

CAATCCCAAGCACACGTGCTCACTTCCAAG 

Sequence 217 

GCGTCCG GGGM TGGCTGTNAGTNMGTTAGAGGTAAAAAAATTTCATGTTAAGATnTG 

GMCTGGATTTTATTTAAAATAATGATGCGAAGCCATTGAAAGGTTTTTGGTGTGACAGG 

ATAAATT TAAAA TATGAACACACCAACGCATACTTCTTTTAAGAAAGAACCTGATTAAAT 

TTGGGAATT TTAAAATAAAAACAGGAAGCATATCGTACTCTAATATAATAATTCAAGGGT 

TTTTATTTTCCTAGMGATCAAGGTCATGTTAATAAAGGGAATATAGTTTTCTTATCTGT 

GTTAAGACACTGATGACTTGCAAAGAAAAGTAACACTTTTGTGATATCCTTAGGTAATTC 

AAGAGGAAACGCTTGAGCAATTACTGATGTTGTAAACTGGGATCAGAAGACATA 

Sequence 218 

NCCCCGCGTCCGCCCAGTTGCAMGGAGATGTTGTAGGATGTTAGGTCTCAGCACAAGGA 

ACCCAAACCTTCAGGGGCTCTCCTCTACATTATGCTCCCATTTTTCTCCCAAAATATCGA 

TCTCCACCCACCCTAGACATAGAACTGGAGMTAAGTTCCAGTTTCATCCCTTTGCAGAT 

CTTAGGGGGACCCATCAAATTCCCAGCCACTGGGTGAAAATCAGCAGCTTCTTTATAGGA 

CCTGAGTTGCCTTCCTAGAGGATCCTAGAGGAAAAAAAAAAATCTTATCCTTCAAATACT 

GCTGTCTTCCAAAATACGTAAGGACGCCACGGTGAATCATAGTGGACACCCTGCATTGGT 

TGGGTTATTATTTATCCTAGMGCTTGGGTTCTTGGAGCCCTAGCTTATTTAAGCAACAA 

AGTCCCTCACAGCCCACAGGTGAGGAAGTGAGTGTAACAAAGAGATCATTGGACCTAAAA 

TCAAAACACCTG 

Sequence 219 

CCCGAAAGGAGGTTTGTGGAACTTGGAGAGATCCAGGAGGTAACACCAAAAAGCTGCATT 

TAGCAATGCCTGCCTAGCCCTCCTGTACAGCTCATGTTATTTTGTTGCCTTAGGCCATCT 

TACACCAMCCATTCCTATCTCCATGCTTTTGTTTATGCTGTTGCCTCCTTTTAGAAGGG 

CCACGCTCCACCGCTCTGCTGTGTTGATACTACCGACCCTTCCTTTAGTTCCTGTTCAAT 

TCCCAAGCCTTCTGCCAAGCCTTCTTGGACATTCCCATCCCATGCTGACTATTCCTTAAG 

CTAAAAACCCTTAGAGTTAATATGATACACTTGGCTACCTCATTGTTTCATATCAGTCAT 

TCCTGTCTCTAGCTATAATCGGCTCAGCCAAGGAAGATATATTTATATGGACAATGTCTT 

TGTGCCTTGGCTGTAAACAGTGTTGAATAATTAATCTCACTTGATGAGGTCTTACTTAAT 
GATAGCCTCC 

Sequence 220 

CTCGTCCGGGCTGGATCCGTCTGCNCCACTGCMGGGCAAGATGCAGCTGGTGGCTGACC 
TGCTGCTGCTGTCGAGCGAGGCGCGGCCCGTGCTCTTCGAGGGCCCCGCCTCCTCTGGTG 
CCGGCGCCGAGTCCTTCGAGCAGTGCCGGGACACCATCATCGCGCGCACCAAGGGGGTTT 
TCATCCTTACCCACGACGTGCAGAGCCAGCTCAACATGGGCCGCTTCGGGGAGGCGGGGG 
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ACAGCCTGGTGGAGCTGGGCGACCTGGTGGTGTCGCTGACCGAGTGCTCGGCCCACGCGG 

CCTATCTGGCCGCTGTGGCCACGCCGGGCGCCCAGCCCGCGCAAGN 

Sequence 221 

GATTTTGGCTNCTGCAACCTCCGCCTTAGGGGTTCAATGCAATTCTCCTGCCTCAGCCTC 
CAGATTGGGATTACAGGCATGTGCCACCGTGTCTGGCTMTTTTCTTGTAATTTTAGTAC 
AAAAGGGGTTTCACCACATGGGCCAGACTGGTCTCAAACTCCTGGCCTAAAGCGGATCCA . 
CCTGCCTCGACCTCACAAGGTGCTGCGATTACAGGGATTACAGGCTGGGATTACAGGCAT 
AAGTCACCACACCCAGCCTAAAAATTAAAAGTTTATACTGTATTGTTCACATTGTGATGC 
AACTTTCCATATGTATTTCCAGAATCAACTATGTATCAAGGAATATTGAAAGCATAAAAT 
GAGATCATGGTGAATTCTCTCCCATTGTCATTCTTGNGGTAAGGGAATGAATGGTGG 
Sequence 222 

CNCGCGTCCGAGTTA TTNAT ATAAAGAACATTTTTCTGTTTTAGAGAGAACCCATTTATT 

TGTGMAGMACAGAGTTTTGTTCTTATCCCTATAMCACTGCATTTGCTGTTTCTTCTC 

TAGCT GATGT GACATATTAGGGAAGCAGCACCCGACTGGGACTCAAAAGACCTGGGTTTG 

GAMCTTTTTGATTA CTAGC TTAGCTGTGTGAAATCAGGCAGATGGTATAACTTCTCTCA 

TTGATGATGTCMTCTTTTTAAAAATTTTCTAGCAGTGAAATTTTAAAAAATGAAATGTT 

AGGTGAAACCTCAAACTTACAACAGTGGTAAAATTGGAAGGGTTCTGGATGGAGTGAGGG 

CAGGGAGGGGGTAACTGGGCTGAGGGGCCCCTCTTCAGCCTCTTCCTCCCCTTCAACCGA 

AGTCTGAAAATCTCTGGCTAATATGGGTGGATCCCTAAGATTTCTTTTGTACTTTTAACA 

TACCGAAAAGNTATTCTAAAGAATTTTGGTATGGTTTT 

Sequence 223 

CGCNTCCGCCGAGCGCAGCAGTCTCCCCCTAACCTCAAAAGCCTCCTCAGAAGTAACCTG 

G TATCA GTGGGC TGTG CAGATTCTTATAGTCTTCTTTGCCTTTATATATGAAAATAACTT 

TGTTrTATGTTTGTTTTCACATAGGTGGAATTATACTGACTCTTACTCTGTGACTTGCTT 

TCTTCACATGACAGTAACTCCTGGACTGTTTCTGTAGCAGCCACACAAGTCTCCTTTATA 

TTGCCAATATCTATCTTTATCCCAAAGACAAGATGAATGTTCAAAAACGGTAATCCAAAC 

TCACTTCTAAAAATGGGTGTGTGTTTAAAGAAGAGCTGCTCAGCTGAGGCAGTTTTGCTG 

CCGAGGGGATTTGAGCAATGTCTGGGGACGTGTGGTTGTCACAACTTGTGAGGGGGCTC 

Sequence 224 

GCNTCCGTCAATTTGCATCAGTTGCCTTAAAGAATGGGGTAGTTATAGGAAGCTCACAGA 

GAGGAAAAAMCTCTTCATGTCTACATTGCTTCTTGMCTGATTTTATTTATAAAAGAAG 

AATTG7TGAGCTAGTGATAGAAGTTTCATGAATTCCCTGGTACTAATTATCAGTTAAATG 

AGCTCTCTGAAGTCTCTGGAGAGCGTTCTGTGTCACTAATATTGGTGAAAATTTGAAACA 

MMTGCTCTCCCATTGTCCACATATTGC7TCTTTGAATTTGGT7TTTCGAGCCAAGAAC 

TTAGGG TGTGA GAATATGTTTGTGGGGAAACCCACACAAATTTTATGTTAGTCTCTGTAC 

ATTTAAAl I I I I A CCTTCCTGATTACTTACGTAAGACTAAACAATTTAAGTTTCTAAAAT 

GCCATCACTTTTGCAAATAAAGGACTTTATTAAAGNTGATAAATAACAATNATGGGCCAT 

CAGCTCCACCTATAATrAATATCCTTGCCTGGCACCCCTGGMGGGACTTAGCTTNTT 

Sequence 225 

CNTCCGCAATTACTGCCTTAGGCTAGATTCCCATGAATANGAATTGCTGAGTCAAAAGGT 

TGACACATTNTTTMAGGTTAGATACATATTAGTCAAAGT7TTTAAGCAANATGCT7TCT 

AAGCCTNTTTGATCTTTATAANNCATTGNTCCTTTCTAAAAATATATAACTGTCTTCTGC 

GTNCCMGGATMTTNTTTTTATTAATATGGGGCTTCTTGTGTCTACTTCCTCCCCTTTC 

GTTAmCTTCAAMTGTTTAACAAACTAATACATTTCAGAACACATTATGCTTNCATCT 

TGTCACTATTTGCAGTACCTTGTATCTCCTGGACTTTATGCAATGCGTGTGTGTGCACAG 

ATGGAATATGTTNCACCATTTGGCT 

Sequence 226 

GCAGTGCCGGCGGCGCAGTTGGGAATGGGAGTGCGCTTGCACAAGCGCCGACGAGCGATA 
GCGACGCTATGCCCTCTCTTGCCCAGGCGCCGGGTGGCGCGGATCAACCTGCTCTCGCAG 
CAGCCCCTGTGGTGGCCGCCGGTGATGATGCAGGAATCGTATTTCGACGGTGGNCAGCTT 
CGCTTTCGATCAACATCTTCGNCGAAGTTCCGCGTGAACCGTNCGGNATCGACATCGTGG 
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CGCTGGCATCTGCCGCTATACCAATCATCGAAACCGCGCTGGAAGGTCGGGTCCAGCGCC 
CTGCAGAAGAGACGCCTCCAGCAGGACAAGGTGGCCTGCTCGACGAGCAGGCTACCCGAG 
TGTCGTTGAACAAGTATCGCCTTGGGTCT 
Sequence 227 

CTGACACCCAGTATTGCCAACCTAAGGACTAGGAAGAAAGGGCTTACATGTTTCTTGCTT 

T AATA ATATGTGAATTAGAAAAGGTATTCAAGAAGATACAGGGTTAGAATGTGCATCTTT 

CTTTTATGTACTGAAATGTTTCAACTCATATGCAGCAATGTGGTNACCTTGAAAGATGAT 

TCCAGAGATTTCATGCTTCTGGCAGTTTCAGGTCACATTGGGGTGAAATTTGACATACTG 

TAGTCATCCCCAAGGTAATAATGTTGATGAAATGGTATAAACCCTGATACATTCTTAAGC 

MMTGMTCTAAMGTTTGTTTCAAAATTTTTAACGTTTTATGTTGCTCTGACTTTCCA 

TATACTGATTTTTACATTACTTAGTGAAAAAAAAAAATTACCTTTTAGCTTNTGGCAACA 

AAAACATTTTTGGCTATT 

Sequence 228 

CGNCGCGTCCGCAAGACAGGAGAT7TCTTAAGGTCATATATGCACGATTTGCTTAAACGT 

CATATAGCATAACTGGAAGAATATAGAATCTACCCAAAGCCTATGCTCTTTCTACTGNAT 

TATTTGCTTATAATAGAGAATGCTAATCAGAATCACCTGAATAGTCTTGTAGAGGGTCAG 

TACACAAGCCTAGGTCCTATCCTTCAGAGAGGTGAAATGGAGGCAAATATATTCTGAAAG 

AGTTTCCCAGTTGATTCTGATGAGCACTCACAATTAAGAAGTGCTGCATTAAGAGCTGTA 

CATGGTGGCTTATGCCTGTAATCCCAGCTATTCCTGAGGCTGAGGCTGGAAGATCACTTG 

AGCCTGGAAGATCACTTGAGCCTGGGAGTTGGAGACCAGCCTCGGCAACGTAGTGAGACC 

ATGTC I I I I I I I 

Sequence 229 

TTGGGAGATATGGCA NGGT GAGAATGTTTGGGAAAGGAAGTAGAGACAGACATTGGATTT 

TTGTCAGTCAANTTTCTTTTGATTTTAAATTATTTATTCATTACATTGTCTACATGGCAG 

TTAATATTCTAAAACTATMACAAAATTTAATATAGAAAAAATATTATGCTTTTCTTTTT 

TGCCTTTGCATGCTCTTATTTCAAAATAGATTTAAAATTCATGGCATATTATACTAGAAA 

ACAAGTCTGTCAATGATCTAAGCTTCTATCTTTAGATACTAGGAAAAAAGAGAAAAACCC 

AAAGC AAG C AG AAGCAAG G AAATAAG AAC AG AAATAAATG G AATTG AN AAG AACACTTCA 

GAAAATCAGTCAAACTGAAGCTGATTTTTATAAAGATTA 

Sequence 230 

CCACGCNTCCGAGGAAACTGGCAGGGAAACAAAGTATCCCTGGAAGGAATTTCTTAAAGA 

GGAAGAGGCAGATCCCTACAAGTTTAAAATCAAAGAATTTGAAGATGTTGATCCCAAAGT- 

GAAATTGAAAGATGGACTTGTGAGGAAGGAGAAAGAGAAGCATAAAGATAAGAAGAAAGA 

TAGAGAGAAAGGCAAGAAAGATAAAGATAAGAGAGAGAAAGAAAAAGTGANAGATAAAGG 

CAGAGAAGATAAGATGAAAGCCCCAGCACCCCCACTGGTGTTGCCCCCAAAAGAGTTGGC 

CCTGCCCTTGTTCAGCCCTGCCACAGCCTCCAGGGTCCCAGCCATGCTGCCATCTTTGTT 

GCCAGTGCTTCCGGAAAAACTGTTTGAGGAGAAAGAGAAGCCGAAGGAGAAAG 

Sequence 231 

NCTAGACTCCCCTCTCGTATCATGGATCCCAACATCNAGGNATATGGNCATTTACGTGTT 

GGGATCTGCTCTGCCATTGNACACAGCTATATTCNATTGCCCCGGGNGTTGTGTATNTTT 

CCAMMCGTTGAMGGGAGGTTCAGMGTATNCAGTTATTNGTATTATTAGTCGTTTTG 

AAACTGAGTNGAAAGACTCATTNANGAAAGNTCCATATGCCTTCTTGTCTGTCTATGGCT 

GGNNTGCTCNNGAGAAAAGTCCNCANTTATACAATTGTA 

Sequence 232 

TTTCCI 1 1 I 1 1 rGGGGGCGGGGTCGCGCTCTGTCGCCCAGGCTGGGGTGCAGGTGGCGCG 

ATCATGGCTCATTGCATCCTCAAATGCCTGGGCTCAAGCAAACATCAGTTTTCTTATCTG 

TGAAATGAGGATAAAAATGTCTCCACTTAAGGGTTGTTGCAAGGAAGGTGTTGCCTTAGT 

CATAAAAGCTAGGGAAGGTGTTCCTAACGAAAAACAATTCGTCAGAGACATGAAGGTAGA 

GGAAGAATTCACACATGAAGGGGGCTGGGGAAAATGATTTAAGAAAAGAAACAGGCCTGG 

CGCAGTTGCTCAGGCTTGTGATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGGATCACCT 

GAGGTCGGGGAGTTCTAGACCAGCCTGGCCAACCGTGGTGAAACCTTGTCTTCTACTAAA 
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AATACAAAAANATTGGCGGGGTGTGGTGGCAGGTGCCTGTATTCCCAGCTACTTTGGAGG 
CTGGGACGGGATAATTNCTTGAGCCCCGGAGGCGGGGGTTTCCGGTGAGCCCGGGATTGC 
GCCCCTTGCACTACAAGCCTGGGGCACANAAGCGAGGACTNTGTCAAAAAAAAA 
Sequence 233 

CCACGCNTCCGGTCTCAAATCNCCTCAAGTATATTCAAAATTTGACCACTTCTCACCAGC 

ACCACTGTCATTATCCTGATTCAAGCCTCCATCATCTCTCATCGTTACTGTGACCTCCTG 

ATCATTCTCCTTGCTTCAGCCCTGGCCCCTGCAGGCAGCATTCAGTATTAAGCAGTAGAG 

TTGTTTCTATAAAAATGTAGTCAGCTGGGTGTGGTGTCTCACGCTGTAATCCCAGCACTT 

TGGGAGGCCAAGGTGGGAGGATCACTTGAGCTCAGTTTGAAGACCAGCCTGGTCAACATG 

GTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACTGGGCATGGTGGCGGGCACCTGT 

AATCCCAGTTACTGGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAAGGCT 

TGCAGTGAACCCAGATCACACTACTGNTTTTNCAACCTGGGCACAGAGTGAGACTGCCTC 

AAAAAAAAAAAAAAAAAAA 

Sequence 234 

TCGAGGTCAAGGACGGTTATGGCCCGTAACTACCTGCTGCCGCAGAATTACGCCATCAAG 

TGGACGCGCGGTGCTGAGGCCCAGATCAAGGACATCACTCGCGCCCGTAAGGCTAAGGAG 

ATCAAGTCCAAGGAGGAGGCTGAGCAGATCCGCTCGCAGCTGGAGCACCTGGTCGTCCAG 

GTGACTGTCCAGGTCGGCGAGAACGGTCGTCTGTTCGGGGCCGTGACTCCTGGCGATATT 

GCGCTGGCAGTCAGGAAGGCCGGTGGCCCCGCCCTCGAGAAGCGGTCCATCGAGATCACC 

AAGCCGATCAAGACCATCGGCAAGCACACTGTCCGGCGTCAAGCTGCATGACGCTATTAA 

GGGTCACGTCACGGTCGAGACTGTTCCCGCCGCGTTGATTTGACGTACACGCAGANTAGG 

GGGAGGGGCATCCAACCTGGGTGCCCTTCCTTTGCTTNCGTCACCACCGGCGAAAGGTAA 

ATGACCGAAAGTAATTCTTATTACNGTGCTTAGGGGGGTTTCGCCGCGCCGGGCCGTTAG 

ACTTAGTCTAGAAGAAAAAACCTTCCACACCTTCCCCTTGAACCTGGAAACATTAAAATG 

AATGCNATTGGTGGGGTGGTAAACTTGG 

■Sequence 235 

GNGGTAGCTTTGTGTATGTCGGGCACTTCNAGGAATAGGGTGCAGGAGAAACGTCTCAGT 

GTCTCCCCTTCCGAATCTTGGCTTCTGGAGGGAGAGATGCTGGGGTGGGAGTGCTCCTTG 

GTGGAGTACTCAGGAGCTTAGTAAMGCAGAGGGGGCTGGAGAGGCAGGCCTGGCCTGCA 

GAGCCAGCATGGAGAAGCCTGGTGTAGGGCTCTCCAGCCTGCCAAT7TACAGTTAAGAAG 

AAAGGAGATATGTATATATATATATACACACACATACATACATACATACACACACACATA 

TATACACACATATATATACACACATGTATACATATATATACACACACATACACATATACA 

CACACATGTACACATATACACACATATATACGTGGGGTGTGTATATGTATATATGTGTAT 

ACGTATATA 

Sequence 236 

AACTGTCTAATCTCTTGACCTCGTGATCCCCTGCCTCGGCCTCCCAAAGTGCTGGGATTG 

CAGGCATGAGCCACTGTGCCCAGCCATAC 1 1 I I I I I I I I I 1 1 I CTGGNGCATTTCTGAAG 

TTAMTATGTTGAGTTCTCTGCCCATTTGTTCAMTTCTATTGNATTATTTCTGNNGGAT 

TGACAAATGTTATGAACAATTTGGT1TCAMATAGATTTTTCTAT7TAGCAACTTTCTAT 

TCCTCTAGGACCTAGTA7TAAGGTCATTAAGATGGCTTAATGAGTCATTAAGCCATTAAT 

GAGTC ATTAA GATGGCTGACCAATCTGTGCCATCTTTTTTAGTGACATTTTGTTTATCAG 

TGTCTCTTTTGTGTATTTCTTTGATTTATATCTTGACAGTATTACATAAGCAGGAATAAA 

AGAGACTTTGAGTGGGAATGTCTCGCTCAAATATAAATAACCTTATAAATACAATCTTTA 

CTTCTCAGAGTCCTAAAATCTTCTTGGTTAGTCTGGTCTTCCTGGGTTGATTCAAGGTTC 

TCGG NGACTAAATGGCAT 

Sequence 237 

TGTTNACCTTTGGGATTACACAATACTTGCAATCCAGCTCTGCCATGGGGGACATCATTA 

CACAAAGCACTCCTCACCAGTAGTCAGATGGCACTTGATAGCTACAGGCATGTGAATATT 

CTTTTACTGTCMCTTTTGTTGTGATTTCACCTAAAATATGATAAGCTCCTTGAGGATAA 

AGGCTAAGTCTTACTTCTTCGTATTTCTGGTAACCGCTTGTACCAAACACCTGCCAAGCA 

TTGTTGATTGCACCTATGAGGAGGTGAGAAAGTGCCAGGCCGTCATGCCCCTCTATACAA 
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ACATGTAACAGTAGACTCTGGTTCATCAAGGAGTAAATGACATGATATGTGATAGACCAC 

ATACATTCGTGGCTATAGATAGGGTGAGTAACTGTMCCATATCCAATTGTGAGCAAAAA 

GTGTTTMTTGGGGNGGTTTTAMCTAATATAAMTTGGCTAM^ 

TCTCGAATCATCTGMGACCTTTT7TTATTTTTTMNANTTAGGCC 

Sequence 238 

CGCTCCGTTGCAGTGAGCTGAGATCACGCCACTGCACTCCATCCTTGGTGACAGAGCANG 

ACTCCATCTCAAAAAAAAAAAGAAATTGGGTTGTCTAGGCACAGTGGCTCGTGCCTGTAA 

TCCCAGCATTTTGGGAGGCTNCNTAGAANAATCACTTGAGCCCAGGAGTTTGAGACCAGC 

CTGNACAACATAGTGAGACCCCATCTGTACCAAAAAAAAAAACACCAGACCTGGTGGCAC 

ATGTTTGTAGTCCCAGCTACTCAAGGGCTGAGGTGGGAGGGATCACTTGNNCCCAAGAAG 

TCAAGGCTTCAGTGAGCCATGATTGGTGCCACTTGACTACAAGCTGGCAACAGANCCAAG 

ACCCTNTTTAMAAAAAAAMAAAAAAAAATNGGNTTGGGTTATAGGNGAAGTTNCCTGG 

MAACCTTACAAAACAAGAACCATGTTTGGGGGAANCAANTGGANGGGGTNAAANTGNCC 

NCTTGANGCTTNTCCTTTNGNGGNCCNTAATTAAAMNGTTTCCCNGGTNTTGGCCCNGG 

GNCCGGAAMTNCCTTNGMTTTCTTTAGCCCCTTTTGGGGNGGGCAAAAGCCAGGGNGG 

MTC CCNCNNGG GNTCAGNGATTTTNGNAACCNNACCTTGGCCAAAAATNGGGGAAAACC 

CCNG 1 I I I I I I I NTAAAAMTMCNAAAMNTTTNCCCCCTTTNCTTTNGGGGGGGCCCA 

CANNNNCCCCCTTTTTTNATANANAANAANAANCCCCCCCCCCCCCCCCCC 

Sequence 239 

CATCTCAAAAAAMACTCACAACATATTTTATGTACACCTAGTATCAGTGAGATTTTTTA 

ATMTTAGATTTATTATCTTCCATATTGNNTACMMCTTTCTTCAACTTCATAAAAAGT 

CMGTAACATACTGTACTTACTTCTCTCACCTCATTTTAAAATAGTGGATAAATTGTCAG 

ACACAGTGACTCTGCACCTGTAATCCTAGCTACTTGGGAGGCTAAGGTGGGAGGATTGCA 

TGAGGCCAAGAGTTCAAGACCAGCCTGAGCAACACATAGAGACCCTATCTCTTTAAAAAA 

AAAAAAAAAAAAAAA 

Sequence 240 

TGTCGACCCACGCGTCCGCGCTCCTGTCATCTCCCTTGGGTCTTCATTTAAATGCCACAC 

CAGAGAGGCCCTCCCTGGCCACCCTAATGAAAACTTCAACATCCTCAACCCTAACATTTC 

CTGTCCCCTGGGTTATTCCTCCCCTTGGTATTTATCACCATTTAATGTACTATCTGGCCG 

GGCATGGTGGCTCGTGCCTGTAATCCCAGCACTTTGGGAGGCCCAGGCGGGCGGATCACC 

TGAGGTTAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAA 

TACAAAAATTAGCCAGGCATAGAGGCATGCCGCCTGTAATCCCAGCTACTTGGGAGGCTG 

AGGCAGGAGAATCTCTTGAAACCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCAAGCCAC 

TGCACTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAACAAAAACAAACAAACA 

MCMACAMCAMCAAAAAMCCCMCAMCCCACCMACATACCATATTTGGTTTCTG 

TMMAMTAAMMGAGAGAGAGMTTTTTAAAAMCAAAAACCATACTATCTAATTTA 

CAI I I I IAATCT 

Sequence 241 

CCCCGCGTCCGCTAAGGCATGTGAGCGCC7TGTAGGGCACGTTCTGTCTTCTGACTACAT 
MGCMCACTTTTGCGACCCTAAGTCACAGCACCCACAGCTTTTGGCAGGATACTTTAAA 
ACAGAAAAMCACATCATTGATCCTGGCAGGATTTTTTAAGGGATTGTGCTTMGAATGT 
TTMCTTTGGTMTCAGAGACCACCACTGGTGTTTTTCTCCAAMTCACATCTTTAAGTT 
MTTAATACTTGAAGTTAATAAATAATACTGCTCAAGTGTATTAGTAATGATGCCATAAT 
ACC ATGT GAATTTATGCTGATTCAMTGTTGTTTTTTTCGTTTGATACTCATATGGCC^ 
CTGTTTTGAGGACTTCAGATTATTTAGCMCTGATTTMTCTGGTCMGAMTMACTTT 
GCTTCAGCTGGAMGCGTGAGGCTTGAGAAAAGCAAGGTTTTGGACAGGGGACCTATGAA 
GCTCATGTTGAMCTTAAGTGTTTTAAGGCTGTATGGGAACTTGGAATGGGAGTGAAAAG 
AACCAAGCCGTCTACTGNCAAGGTTTTTCCCTCCTCCCTCTAMATTTATAAACCTCATT 
CTTAGAAGTGGCAAAMGTTGGGAACCTTTTCCACTGNTTCACTTCTNTTANTNAGGGGA 
TTAAAGGNGGATNGGNAMNGGAAGTTTTCNTGGTTTTTTAAAAAAAANNGNANAANGGG 
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Sequence 242 

GNCACGGCGGGGCAACCTGGTTCCCCTGGGCGGTAGTCCCGGCAGCTGTGCTTCCCCCTT 

CACTGCAGTGCGTGTTCAGTTGGAMCACGGAGATTTCACCTTTACTCCCTGGCTATAGG 

GGGA TGGTGAGAANGGGGCTGCGTGTNAGGNGANATTCAAGGCATAAATCTTAGNTTGTC 

TTTTAAAGAATTGNGTGCCGGGACGTGNTTGGAGGNTGGGTGCANANAGGAAGAGGCGAN 

CTNCTGGAGGANGCAAAGGACAGGTCANTNTGCGNANTTCNGNNNGATGCTGCTGANTGA 

ACCCCNGNTGCTGTNGCCCCAACAAGAAAGGCTNNGCTNNGATCAAACATTTACTGNACC 

TANAAGGCCACATTCTNGCTTAANNANNCCAGGMNGCNCNTTCTTGTTNCCCATTACCA 

AMTCATTNACACTTGGTTTTTGGCCCNACATACCTTTCCTTTTNGAC 

NNAAANGGCANTANACCNNNGNATNTNNCNTCATNACANGAATAAAAAGCCCNCAGAACN 

CNACANANGNGCCCATCACTANNNNTANAGATNTCAAAAGGGGGCCCNGGATCCTTNACC 

ANAGAANACTCTGGTCCNNAGNGNAAGAAAGNAAAACCCCNCNANANGCACTGGGACNNT 

NTNATANTTCTNNTTNAMCAAMNCTTATAANNNAAAAAAAAMTTTTT 

Sequence 243 

AATTAATAAGCCTCCTGGGTGATTTGGCTGCAGGTGGTCCTGAGACCACACTTTAAGAAA 

CACTAATCAAGAATCMGGCCTTATGACTCTTAGTCCTGTTTCTTCCCAAAGGACTATGT 

TGCCTCCCTACATTCGAAMCATTCCMACMTACATMGAGTCTTTGTAATTCACATTA 

G TGCT CGTATTTACTGACTCMCAATTMTTCCTGATTGAGTCATTCATTGAGTMTTTT 

CTTTTACTCACTCAGCCTCTGTGGTTTGCCAGGAGCTACAAATAAAAGATTAAAACACAG 

TCACACCTCCAAGGAACTCATATAATAGTAACAGAAGTAACTACCATTTCTTGAGCACTT 

TATTTATACATACCGTCATACAGTTTATCTCATTCACCTATCTCATTCCCTGCCTTTCAG 

TMGCCTGTATACTATTATAACCGCATAGTTTGGGAATATTTCATATATATATATATATA 

TATATATATATATATATATATATATATGTATAATCACATATATTCTCTTCAACTTGTNCT 

TTACATTTAATATGTCTTAMAGTATTTCTTTGTCAGTTCTTGCTTTATATATGTCTAGT 

ATTTCTTTATAAGGGCTACAAGMGTATCCCTCMGCATTGGATATTTCAATCAAATTAC 

CCTTGGGGGGGTNGGGGGAAAAAAATNNGGGGGGGNTTT 

Sequence 244 

TCGCCCCGCGTCCGGGGAAGGCTAAGGCGCGAGGATCCCTTGAGCCCAGGAGTTCTAGGC 

TGCAGTGAGCTATGATCACGCCACTGCACACCAGTATGGGCAACAGAGCGAGATCCCATC 

TCTAMCAAATTTAMACMACGAACAMTGAAAAATATTGCTGTCTTAAGGTTGGGAAGi 

GGGCAGAGACCCCTTTGCTTGCTCATCACCAAGACACTTCTGTGAGGCCCCAGGGCTCTT 

TGGAGMCGTTTTGAAAATCACGGTTCTAAGTAATTATAGTTACTGTGACTGAACTAATT 

TAGCCCTAAGCTTCCTACAATCAAGATAGAGATACACTATGGACTGCATTTCTCCGCTTC 

AG ATTAA AAMAAAMGTTTAAGTCAGMTGTAGTTATATTTTCAGGTAAAATGCTCAAT 

ACATTTTCAAGATGAAGCTGCTCAAAATTAAAGCAGTGAGTCCAAGGGTTAATCTGNAAA 

MAAAGTACAATTTACTATCTCCTGGTTNCANACTTATAGACCCTCATAGGTGCATTGGC 

TMTAC MGG GGCCACTAAACACATTGTGGCATTTACNGGATTTTATTGGTGAAGNGCTC 

T ATMGTTT TATTGGTGCCAGGTAAAAGAAANGCCTCNTATAAAAAAATGGTGGNGGGGG 

Gl I I I I I I I INCCCCCCI I I I I 

Sequence 245 

TTCCGTACCTMGMTTGTACCCTTTCATAACAGCACCACTTGGATATGTAGAAAGAGTT 

TGTTGTGAGATCAGATGTAAAACAAATAAAAGTTATTCGTGAMTGATATGGAAGACTGG 

GATTAGAMCTGTGGCATTCMGAAGCCAGTTAAGCTGTTCTCAGAATTGACAGAGATTC 

TAGAGATGGTGTAGTGCAGATGGTGTAGTGCAGGGATTCTCAGCCTCACTGCACATGGTA 

ATCTCATAGGMAATTTTAACAAGGACAGATGCAGAAATTCTTATTAAACTGTCAGGACT 

GMGTCCAGAGGTCACTATATTTTAAAAAGGTCTGCAAGTGATTGTAACATGCATCTATG 

GAGGAAAACATCANCGTAGGAGAAAAAGGGAAAGAAACCAACTGGAGTAAAGGCTCTGTC 

TTGAGCACTGTGCTGGCTCCTGTCATTCTCCATCTCTTTTTATCTTCATAGTAAGTGAGA 

TGGGTTTGACAAATAGGGA 

Sequence 246 

GCGTCCGGNGTMCTTGAACNAAAGTATTCTCCTTCTTCTGTATATTTGTTCTCAACCCC 
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CAATTCCTCATAACTTTCTAAGTAAGTAGGGTGAAAAGAAAGTCAGGGCTGCXJTAGGGAA 

GCTACATTTGCnTCTCAAGGCTTGCACCTGCATATTTCATGCCCGACATGTTGCTGAACA 

CAGCCGAGTAAAGTGCCAGAATAAAATCAGCTCTGTCCACGTGGTACCAGGAATGCTGCT 

GAAATGAAGGTAAGTATAGGATTCCACAGGATCTAAGACAGTCCAGGGAACTGGGACCAA 

TTGTGGGAAACCTCAAAGCTGAGCATATTTGAAGAAAATGCAGAGTAAAATGGCTATGGG 

CTGCCGGGAGGCCTGGATTTGAGATACCAAGCACCACTGATATAAACACTGATGCTCAGA 

AAMGAMTGATGAGTGTGTATTTGTGTCCCTTGAGACCTMTTTAATTTATAATTATTC 

ACCAGAGATA AGAGGC AAGGGATGTCTACTTGCTGGATATCTGAATTTACACAGTTCTCT 

AGTCATTAGTATTTTTTACAAATATACCACTCAGACTAGGGGGGCAGAGGATCTGGGGTT 

TNMT TTCAAG ACCTGTCACTTATCTGCATGAATTTTGNAGCTTATAAGAACTTACAGGT 

TAACC I I 1 1 I I 

Sequence 247 

T GTCGA CCCCGCGTCCNTNGTATTTATAAANATAATNCTGNTAGATAAATAAGTGATTCA 

TATTTTGTCAMNCTATTTTAAAATTTCAATATTTAAMTATTNTTGAATCACTGGGTGT 

CGNTAAGTGGCATCATNNATGAGATTTGATTCCATGTACCATATAATNTTAGATTGGTCC 

TNTCTCACCCCTTTTAAACTCCTTCAAGCATTGCTATTACTGGGGTTGCCTTTGGGAAAA 

CTTACTTCTAGATACTACCATATATCTGAAATAGTAGAGGTGGATGTTAATAAAATTCAT 

AAMTNATCAT GTAT TACTTTTTTTGATTTACCACTGGAAGGAAATACAGNCATGTGCAA 

TATAATGACCGTTTTGGTCATNGAGACCCACATGTGTGACAGTGGTCCCATAAGGATGNG 

GCTGAAAANNTCCTGTTGCNGCCTAGTGACACTGTAGCCATNGNAACNCCATAGCACGAC 

ACGTNACTCACCTNTTCATGGTGATGCTGGTGT 

Sequence 248 

CCCCGCGTCCGATTTTGAATGTATTGAGACTAAAGTTCCTTTGAAACATTAAAGAAGATC 

CATCAAACTGGCAACTGTATATAAGGTCTGATATCTTCTGGCACTCAGAGGGAGAGTTTC 

TGGTGGAGGTGGAAGTGACTTAGGAGTCATCCATGAACATAAATGAGATCACTCGTGTAG 

GGAAGAGTTAATGAGATTAGAGAGCCTTGGACAGAGCCTTGCAGCCCAAACCCATGAAAG 

AGCAGTAATAGTGAGTGAATTACCTTCTGAGGCAGTGCAAAGTAGATAAGAAAACAAAAN 

CTNNAAAAAAAAAAAANAANAAAAAAAAA 

Sequence 249 

CCGCGTCCGCAGTGGTNGTGATCTCGTCTCCTGCAACCTCTGCCTCCCAGGTTCAAGCAA 

TTCTCCT GCCTCA GCCTCCCAAGTAGCTGGGATTACAGATTATGTCTTGTGTTANNGAAA 

TCATTCMTTTTTCACCAGAGAATAAGGAAAAGCAAATTAGACTGTGAATAAAATCTCCA 

TGTACCCACTAGGATGACTACAATTCAAAAGGCTGACCTATCCAGGTGAAGATGAGAGTG, 

TGCAGCATTGGAAATTCTTACACATTGCTGGAGATTTAGAAGCTAAAGAAAGAGGACAAA 

TGATTTGGAAGCAAAGATAAGAAGGAAGAAGGAGATGGGGTCTTGCCATGTTTCCCAGGC 

TGGTCTCAAACTCCTAGATTCAAGTGATCCACCTGCGTTGGCCTCCCAAAGTGCTGGCAT 

T ATAG GCATGAGGCACCATGCCAGGCCTGTTTTGAAATTTCATACATATTTATAAAACAT 

ATTTTCAGGATGAGAGAATATACCAAGAAGTTACATAGCATTGTGTATCTGTATAGAATA 

AAA 

Sequence 250 

TCGACCACGCGTCCGGGTGAACGTGGTCACCAAGGCCATGGGTACCCTGGGGGTGAGCTT 

ATCCTCCTGCAGCGTCCCTGGTTCCAAACCCACCTTCGAGCTCTCAGCCGACGAGGTGGA 

GCTGGGCCTGGGGATCCACGGGGAAGCTGGTGTGCGCCGGATAAAGATGGCAACCGCCGA 

TGAGATTGTGAAACTCATGCTCGACCACATGACAAACACCACCAACGCGTCCCATGTGCC 

TGTGCAGCCCGGCTCCTCAGTTGTGATGATGGTCAACAACCTGGGTGGCCTGTCATTCCT 

GGAACTGGGCATCATAGCCGACGCTACCGTCCGNTCCCTGGAGGGCCGCGGGGTGAAGAT 

TGCCCGTGCCCTGGTGGGCACCTTCATGTCAGCACTGGAGATGCCTGGCATTTCTCTCAC 

CCTCCTGCTGGTGGATGAGCCTCTCCTGAAACTGATAGATGCTTGAAACCACTGCAGCAG 

CCTGGCCTAACGTGGCTTGCAGTCTTCATTACTGGGCGGAAAGCGGAGCCGGGTAAGCCC 

CTTGCCGAGCCCCAAGAAG 

Sequence 251 




m 
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GTCGCCCCGCGTCCGCAAGAAATGATGTTAGTAAAGATACTTTAATTGCGGGGAGTTCTC 

TGTTGGCTGG TGMT AAGGACGTTCTCTTTGAGTTCCACCTTTCTTTGAGAAATTTTCAT 

GTTTCCTGTACTTTTCCTCTATTATATCATGGAAACTTACTAAATCAGCCTGTGCATTCC 

GCCCACAT ATTCC CTCATGGTGCCTTCCGGTCCTAGCAGCTGAGCCAGGGCCATCMTAA 

ATCTTCTTCTTTTCCCCCAAGATATTCCTAGTTGCTTCTTTACTATTCATTCTTGTAATT 

CCATTCATCTGTGTGTATCATTTAGGATGCTGGGCAGTTGCATCGCATTTCACCTAAGGT 

CACATCCTGATGGGATTTGGTGGGATCCTAGAGCTGGCTAGGACCACTTGTGAGAGCTGA 

TTGTTAAATTCTCCGAAATGACGTGACTCAAGTTGTTAGACTGCTTAAAGG 

Sequence 252 

CCGGTGAAA CCTG GAGCTGAAGTGAATTCTCTTAGAGTATA7TTTGAAACTGTACTAGGA 

CTTTAAACACTTTTGGAATTTAAAACAGCCATAAAAATTCTTGTTATACTGAAGGAGTTC 

CTGAGGCAGTGTGCCTCTCATTTTACCACCTAAAGTTGCCATAGAGGTCCAAGGAGACAC 

TGCTGATAGCAGAAAGTC TTCCAG AAAGAAATTAGGCGACCCACACCAAGCATGTATGGC 

TTTGAGTCTTACAGATGGCTTTTTAATAGTTTAGTCTCTTAACCTAAGGAAGTTTCTGAA 

GTTCCGGTCAGAGAGTCTAAAAATTCACATTTTACCTAATAAATGATAATGAGGCTATTT 

ATCTTGTCTGTCTGGATTTT7TCACTTGACATTTAATGAAATATCCCATATTACCTATAA 

I I I I I ATTTGAAG 

Sequence 253 

CCCCCCGCGTCCGAGATAATGCTGTTTGCTTCCGGCCGCTGTAAATCATAGGTGAAAACC 

AGTAG CANG TGCTCACTCAGTGCCTCCCAGAAGCGGTCTGCGGGTCTCAGCTGGGCTGGG 

GGCAGTTTTCATTGGGCAAGGCTTGGGC7TAGCTTCGAAGCANGGGCTGGGAGAGGATGG 

ATGGGGGTGTGAGAGCAAAAGAAAGACCTGGCTTTGCAGTGATGGCANCCCACGTTCAAA 

TNNNAGCTCACCACTGACCNGTCGNNTGACGNGCGCCAGGNG7TAGGAGACTGNAACTGN 

TTNTGNGTNNNGNNTCCGGNCGTNCATNNNNNCTGCTCAGCATACANANCCTNTTNCTNA 

TCNTAATCCTCATACNCATGNCTGNNNACTNTACACTGTTCTACTTATCAATGACAGGTC 

AAAAGTGTTATCATNTGTGACNTAGAATGAGTGAAGTACACNCCCTCTTGGAAACTATGA 

ATGACTTAAAGAATCACCNTTTGCAAAAAATC 

Sequence 254 

GCCTTCGCCCCTGCCTCCTCTTGGCTGCGGCTGGTCATCTTCCCACCTCCACAGTGGGCC 

GAGCTGCCAAACAAGGGGTACACATGTGCCCCCTGACATAGGCCAGGTGGTCTCTGCCCT 

CTGTAGCTCCCATGAGAGAGGGCTCCTCGGACCGAAACAGGAGGCCACTGCCCTTGCGCA 

CACACCGTGGCCGGGTCCTCCTGGGCGGAGCGCT7TCCGTGTGTGGGAAAGTCAAGGGCA 

GCCCCGAGCCTCGAAGCCCAGGCTCCCAGCCCCGCGCCATGTTGCATTCCCGCCTCTACT 

CCTTGGTAGGCT GGNTG CTTTGAGTGGTTCTTTTTMTTCTTTCTGTTGGTTTCTCCTTT 

TCCT7TGCCTGGGTT7TGCT7TAACCTCTCTGTTGCAGAGATGCAGAGCACTCAGAGAGC 

CTATTTCTATCATCGCTTTCCTATTCTCCACCTAGAACCAGNTGACTGGCCGCCCGAGTG 

GNGTCTCTTGTGTGTGTGGTGCCGTCAAAGCTGTGGAAAGAAATGCTTCTGCCTAGGTTT 

CTTCGCGCCCCCCCCTTGCTTGGCTTTTCCTGCCTGCTTACACCCCCGGTTCCTGATCTG 

CCCTGGGC 

Sequence 255 

GC CCACG CGTCCGCAAGANAGCTCCTCAGATTTGTCATAGACTATATTTAAAGAAAGGCC 

ACAl I 1 1 l CTTATTTAAAATGCATTAAACAATGCANCCAATTAAAAGAACTGAGNTGGAT 

TTGTACAAAAGCAGGGACTAGGTCTGNTTTGTTCACTGCTATATTCCCAATGCCTAGAAC 

CATGTCTGGCAAACATACTGGCATGGGAAGAACATTTCCATAACCCCTGAATGTTCTGTG 

CCCCTTTCCAATTAATCCCTACCCTCAGAAGCAACCACTATTCTCATGCTTATTACATTA 

GTTTTGCCTCTTCTTGACTTTCATATAAATGAAATCATACATCTAANAAAAAANANAAAA 

AAAA 

Sequence 256 

TCGACCCCGCGTCCGATTTGATATAAATAGTTATGTTACTCATATAGAAATCTCTTCCCC 

ATTACACACATACAAACATTTATCTATGAGTGGCTTATAATTGCAAATAAGATGTAAATC 

ATGCTCATGATCATTGTCAAMTTGTGAMGATTTTTTTCTATACCTCTTTTAGGTTTGT 
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TTTGTTTTTGTTTTTGATTTCCAGGTGGCATTAAAGACAAGAGGGAATAATATTCATTCT 

TACTTCTACTCCCMGTCACTAGTTTGCTGAATTTAATTGAGTTAAAGAATTGTATCAGT 

CTTCTTGGAAGTCTAATACAAAACCAGTTCAACACTAGTTATTCATTCTTTGCTAATTCA 

CCAGAATTGAAGGATGGATAAAATGAGAAAGAGMGTAGTTCTTCATATTATTAAAATAA 

AGAGTTAAATTAGACACTTTGTTGGACTCTTTGGTCTTAATAATTCCTACTC I 1 1 I I G AG 

GTCCAAAAGTTTTGTCTTTGATAAATATAATTTTAATGGG 

Sequence 257 

AAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAATGC 

ACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCTGGGGAG 

TGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGA AI I I 1 1 AGTTATATGTGCTGTAAT 

GTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCMCATATCCACATCTTATATTCCACA 

AATTAAGCTG TAGTA TGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 

GGCGGCTGCATTTTAGTMTGGGTCAMTGATTCACTTTTTATGATGCTTCCAAAGGTGC 

CTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGC 

ATAAACAGAGCAGTCGGCGACACCGATTTTATAMTAMCTGAGCACCTTCTTTTTAAAC 

AAACAAATGCGGGTTTATTTCTCAGATGATGTTCATCCCGTGAATGGTCCAGGGAAGGAC 

CTTTCACCTTGACTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTTCTTT 

Sequence 258 

GAGTCGACCCCGCGTCCGCTCTGGAGGAAGCATAGATTAGAATCATGATTTTTATCTATT 

TTAAGAGAATAGAAGAACAGAAGGGGTTACAATCTTGCAATATTATGCAACTCTTCTGCT 

CTAATATATCAAAAACTTGATGATCCAAGATCATGCAGAACAGCTGAGAAGAAATCAAAG 

TAAACAGTGTACCTTGCAGCCAACAGATCCTGCCAATATGAGATTAGAACTCTCCATCCT 

AGCAAAAAAAAAAAAAAAAAA 

Sequence 259 

CTGGTACCTGCGAGTCGCTGCAGCAGCTGTGGCAATTGTCACCTTCATCCAGGCCCATCC 

CGCTTTGAGGGCCTAGAGAGAGTGGGCCAGAGGTTAACCCCCGATTCATCTGCCTCCCCA 

CGCTGGGCATCTGGGTGTGCCAGGGCATTCCCCCGCTGGTCAGACAGGTTTTTGGGCCAG 

GGCGGGGCTGACCAGGGTTAATTAGAGGGAACTGGCTAGGAGGAGCTGGGGAGGGGGCTG 

GGGAGAGTCCAGGCCTNCAGAGCCCCTGGGACACAGCAGGTGTGTGCTGCCATGGGCCGG 

GGCTTGAACTCTGCCAGACTCAGGCGCCAAAAACGGCGCTTGCGACCTCAGGTCCAGAAG 

CCCCGGCAGCAAGCTG 

Sequence 260 

TCGACCCCGCGTCCGAAACTCTGTCTAGTCTAAACTATTATTCTATACTTCTCATCTCTA 

TATGTTMGGATTGATCTCCAAGATAMTTGTTTTTTGTTTGNTTTTTAGGGACAGGATC 

ATGCTCTGTTGCCCAGGCTAGAGTGTAGTGGAACAATCATAACTCACTGCAGCCTCGAAC 

TCCTGGGCTTAGGTGATCCACCTGCCTTGGCCTCCTGAGTAGCTGGAATTCCAGATGCAA 

GGCACCATGCCTGGCTMCTTTTTAAAATTTTTCATAGAGATGGGGTCTTACCATCCTGC 

CCAGGCTGGCCTCGAGCTCTTCACCTCAAGCAGTCCTCCTGCCTCAGCCTCCCAAGGCAC 

TGGTATTGCAGGAGTGAGCCACCACGCCCAGCCCAAAATAATTNTTTTTTAAAGCAAGAT 

GTAGAAAAGTGATTATAATATGTTTCCATTTAGGCMGAAAAAAAATGGAGAGGACTATA 

CCTGTACTCTCTGNACATAGGATCCACAGAAAACTTCTAATGGATGGTTATCCCTGNGGN 

GGGAMCTG GGGG ACAGGGMTGATGAAGCAGGAAMTTTTTACTGGATAAACTTTAGTT 

CTGGTGGCTCTTTTTCTTCTATCATGNGNATGGTAACTT 

Sequence 261 

GTCCCGCAAAGCCTTTAAAAAGAGTCCGMTTTCACTTTTCACCTTTTGTAGATGTGCAC 

GTGTAGCTGTAGAGCTCATACTTACGTTTCACATGGCATAGTTGATGGATATGTAGGTGT 

AAAGTTTATGGTAGTGGACAGGCTGAGAATGGTGTATCTGTGACAAAAAATCTGATGGAA 

GTGATATA TTTGA TATGAAAGTGAACATTTTCTTAGTTGGGTGTTTATAAC I I I I I I I I G 

GTAAATTGTTTTTAGTTTTTATCCTTATTTYACTTATGCTTGGCMTAGATGGTCTTT^ 

TCCCCAMTCnCTTCTGMTTCCMGGAMCACTGTTTTAGCATTTATTTGATTACTTT 

GGTTCATTCTTTTCTCCACTCCCATTTATTTGTTTTCCATTTTGTAACTTCTATAAAGCA 



• 
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GATAAAAATCTGGAACTTCTAGATCTGACCTTCATGCCTTGTCTTTTCTATGGTACTTAT 
TCTTTCTGTCTCCTTCTCATTTTGGATTGGGCTTATGAGAGAAATCTTGGGGTTGATCTT 
CCAGCTC ACTMTTTGATATTCATTTGTGTCTCTTCAGTTACTTAGCTTGCGTGAAAACT 
I I I 1 1 1 I I CAGCAATTGTGTACTTAATTTTCATA 
Sequence 262 

CCNCGCGTCCGTGCCCTGGGGCCCCCCTGGGCGAGCATACCCTAGGCGTCAGGCCTGGAG 

GTCTCCTCGGTGCCTCCCAGCTCTTCTGTGCTCCTCACACTCTGCTCTGNGCANATTGGC 

TGTATTTGTAGGTTACTGCCTTTTATTTCTCACATTTCTTTTTGGTGAGAGTATATCAAT 

CMTCAATCATCCTCAGACCTCTATGATAACACTGTGCCCCACACACAAGAAGCTACTCA 

ATTMTGTTTGTTTGTTGAAATGAGAGAAAATA7TTGTCCTTAGTACAGAAAGAAGATGA 

AGCCAACCTCTGATAGAAGCCCACCCATAGACTAGGGTGTTGAGCTGCCTTCCAA 

Sequence 263 

NCCCCGCGTCCGGTAGTTGCAGAAGCATGTTCTTGAACTTATCAGTCCTGACCTCAGATT 

TCATCTTCTTCTGGTAGATGTAGCATACATCTCTGAGTGTTATTAGAGGACCAGNCTAGA 

GCCTCATCGTACTCCTTCAGTTACTTCATMTCATCCAGCTCTTTATATTAAATGTTTTT 

CTGTTTAAMTGGCAGCMTGTTTTTTATTTTTTGAAATGGTCCCTGACAGCAACAGATC 

TTCTCGTGTT TMTT ATTG AGTCT GTGCTGATTACACAGAATTAAGGATATAGTTTCTAA 

AGTACTTCCATTTTTATATATTTTAGCATTATTCTGAMGGCCTGGAAAAAAACTATTTT 

TTATT CGATTTGAAAGTGAAGTGACATAGGTGGGTCGCTATAGCAAGAAATTACCCTGTA 

TTTTTCCATCTCTATCATCACAGGCATCTCACAGAATTAGAAGTCGGACATTATTGATGG 

ATATATTAGTCATGAATAATTAAAATACATTAAATATAAAATGGGTCAGATACGGGCAGA 

TTT 

Sequence 264 

CNCGCGTCCGGGAGAAMGAGTTTTATACCTMTTCTTNTGCMGTGATTACATATTTTT 

ATA CCCA GGAT GTTCA GCMGATGMCTTTATTTTTTAGATGTATCTATGGNTTTTCCAC 

CCAT TTTA TTMTTTTTAGAAATATAAAAGTGCCTTTAAAATTTAGCTGGGTTAAGATAG 

TAAGTTTTAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGAGC 

CTAGATCGTGCCATTGTACTCCAGCCTGGGCAACAAGAGCGAAACTCAGTTTCAAAAAAA 

AAAAAAAAAAAAAAAAAAAA 

Sequence 265 

GATCCTCTMGTNCCCMTGATCNGAGMGAAATATGAAAGGGAATTTTAAAATATTTTG 

AACTGAATGAAGATGAAA ATGCCA CAGATAAAACTTTGCATGGGGNAGCTACATTAGTTA 

GCTTAGAGGGAMTTTATATTTTTAAATTCTTATATTAGGAMTAAGTCTTAGATAAATA 

ATGTCAGCTTCCACCTAAGAAGTTCAAAATAAACCCACAGTAAGCNGAAGAAAGGAAATA 

ATAAAGCTGA GAAAA AATCAATAAAGTTTAAAAGAGAAAATTAATAGATATCAATGGAAC 

AAAAGTTGCCTTTTTAAAAATATCAACAAAATTGATAGACCTTTAGCCAGAATGATCAAA 

AAAAAAAAAAAAAAAA 

Sequence 266 

CCCGCGTCCGCCTGAATTTCAGAGTCNGTTGAGAAGGTAGGGAGCAGGGATCTCTCAGAC 

ACCMCATGTTCAGCTTTATTATAGATAAGGCTGAATTGATTTCCTGGAATNGACCTTAC 

TTCCCCACCTCCAAACTGCAGGCCTTCCCCTTGCAGGTTCAGAACATGTTAAGGTAAGTT 

CACTGCAGAGATAAACAGGGTGAGGACAGGCCAGTCCTAAAAAAAAAAAAAGAAAAAAAA 

A 

Sequence 267 

CCGTTCT7TTTCCNAAACTAAAGAATGCATAGGACATAAGTTAAAAGTTCATACATAACC 
TGGCTTCAAATCCAGTTCTACCACTACCTGAAAACATCAGTTTATTTCTCATCAATGGGT 
TGTTATAAAGTACCTAGCATAGGGTATTGCTTAAATGTTAATACTCCCAATCCTGACACT 
AATGTTTCAGGGAAGAGTGAAAGAAATCACATTAACTCCACATTATTGAACATCTTCTGT 
GTCAGGCTGAATACATCTTTGTATCCATTGTCTTCTGGTTGTtTAAAGACTAGTGTAGAA 
GCCTGACATGTAAATCGGTGATTATATAAGATAGTACTGTCTTGTAATGTGTTGTGCTAG 
AGAGGTTAAGTATTAATTATTATGGGAGCTCAGGAGAGGCACCCACTGGCTGCTGGGAGA 
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AGAGCAAAAACCTATATATAMGCAATTTTTAAGATTTTGGTTCTATAAGGGTAGAATGT 
TCGATCCATTAGTGTATTCCCAAAGTCCTATACACATGTCAAGATCGGGAAAGCTCACAC 
ACACAATAATGCCCAAG 
Sequence 268 

CGTCCGGCCCCATGCCCAGCCAAGTTATGTTGTTTGGTCAATTTTAGATAATTAAAAAAA 

AAAATCCTATGTAGTAACATCCTACTAGGAAAGAAGATAAACTCAANTATCCACTGAGTG 

GTGACCfGAAACCAAAGAGACATGGAGGGGCAGCCAGTCAAAAGCCACTGCACTCCAGCC 

TGGGCAACAGAGCAGGACTGTGTCTGGAAAAAAAAAAAATCTAGAAACATACTCAAATGT 

AAACATAAGAACTTAGCATATGGTAAAGTTGTGACTGTGAGTCAGTGCAGAAAAGAATGA 

ATCAATCAGTAAATGGTCCTGGGAAAACCGGCTATTTATTTTAAAATAATAACAAAATAT 

GCTGTATATCATCAATGGATTAAAGATTTATATACCTTCAGCATATGTCCTTAAATGTTC 

TAGAATAAAATATATATGAATTCAATGTAATTTTTGATGTGAGGGAAGGCCTTTTAGACA 

TGACACAAATGTCAAAAAGCATAAAAGGAAAGATTTTGAATAAATAATAAAC 

Sequence 269 

GCGTCCGACAGATACACCATCATTATAGGTGGCCAAATCATTCAAATCGCTGTTTGCTTC 

TTCTCGGGC TCTTA TTCAAGCTCA CGATG CTATTCTCATAGTGATTGCGAATCTCACTGC 

ATAMGGTGATTTTGAAAACATCACTTTTCCTCCACACTAATGTGTGAGCTTAGCCAGAG 

TAGGAGATATATCTTACTTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 270 

CCCI 1 1 1 1 INCGGCCNTNCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAA 

AGGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCAT 

CTACAI 1 1 I IGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAG 

GTCCTGAAAT AAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCA 

TTCGGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAAGA 

TGCTGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGGCAT 

TGGGATCATTGCATTGATATTAGCACTGGCCATTGG 

Sequence 271 

ACCGCGGNGGCTTCATGCAAGCTGTGGGCATGGNCAACGATCACGAAAATCATTNTTCCT 

TTAAATAAAATACAATCCTATNNAAAGGAGTNCTTCCATGAGCAACAATCAAATACGTGC 

TTATGCTGCGATGCAAGCAGGTGAACAACTGGTTCCTNATCAANTTGACGCAGGCGATTT 

ACAAGCCCATCAAGACGAAGTCAAAGTTGAATATTGTGGATTATGTCATTCAGATATTTC 

AAGTGATTAATAATGATTGGCCGATCATCTACTTATCCTGTAGNCGCAGGTCATGAAATC 

ATTGGAACCATCACTGCTCTTTGGTTCATGAAGCGAAAGGACTCAAAG 

Sequence 272 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAAATTTAAGTATGAAAGAAAGAAAAAACAAAANGTTGAAAAAGAAAAACAATCTTTCAC 

AMCAATAGAGAAATT CGTTT ATCTTTTGGAATCAATTTAAATGATATAAAAATCAAAGC 

AAAAAAAGCCAAAGAA I I I I I ATTAGATAACCGACAGAGTAAAAGTGGCTCTTCGTCTTA 

GAGGGCGTGAAAATACAAGACCTGMCAAGGTAAATTMTTTTAAATTCTTTTTTTGATG 

AAG NAAAATCG ATTGCAAAATTAAAGTAAAG AAN NGC AATCAGTTTGGTAATTTTTTAAC 

TCTTCATATTGA ACGT GATAAGAAAAAATTACCCAAATTTACTTCTTCAAAACAAATAAA 

GGAATTAATTGATTTTGAAAAAACTATTNAAGGAAGGMGAAACTAATGCCTAANGCCAA 

AAACAAAATCGCACTTTAAAAAAA 

Sequence 273 

CCCTTAGCGTGGTCNCGG CCTT TGTACGAAATTATGACTGTTTTAGCTNCAGGAACAGAT 

TTAAAAGT AATTG AAG ACGTTTTAAAATCTGTTTTTG ATG CT AAG AAC ATCG AAAAAATT 

G AAAA ACTTGAAAGAACAGAGTTAGCATACGAAATTAAAAAACACAAACAAGGAATTTTT 

GTTTTAGCTAACTTAA AATC TGA GGAAAG TTTAATCGMGAATTTGTCAGAAGAGTAAAT 

ATTCTCAAAAAACAAGTTTTAAGA I I I I TAGTTATTAATCTAGATTCTGAAAGAGGAATG 

CACAAMCTTTCAGACCTAGAAAAAA TGATM ACACAMTTTTTCTCTTCTAAAAAACCA 

ACANCTTCAACAGAAGMGGTAAAAGTTTTCAAAMCCATTTGTCAAAAAACCTTTTGTT 
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CCAAAAACTGTAAAAACCAGCAAAAGATCCTAAAGTAGCTCACACAGCAAAAAAAAAAAA 
NAAAAAGTCCTGC 
Sequence 274 

CCCTTAGCGTGGTCGCGG CCGA GGTACNAAATTATGACTGTTTTAGCTCCAGGAACAGAT 

TTAAAAGTAATTGMGACGTTTTAAAATCTGTTTTTGATGCTMGMCATCGAAAAAATT 

GAAAMCTTGAMGAACAGAGTTAGCATACNAAATTMAAMCACAMCMGGMTTTTT 

GTTTTAGCTAACTTAA AATC TGAGGAAAGTTTAATCGAAGAATTTGTCAGAAGAGTAAAT 

ATTCTCAMAMCAAGTTTTAAGATTTTTAGTTATTMTCTAGATTCTGAAAGAGGAATG 

CACAMACTTTCAGACCTAGAAAAAATGATAMCACAAATTTTTCTCTTCTAAAAAACCA 

ACMCTTCAACAGAAGAAGGTAAAAGTTTTCAAMACCATTTGTCMMAACCTTTTGTT 

AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGAACAAGGACAAAGTGCTAAGAAA 

ACCCAAAAACTGTAAAACCAGCAAAAGATTCTA 

Sequence 275 

CCCTTAGCGTGGTCGCGGCCGAGGTANTTNCCTGANCAGTCGAAGTGGATGCCCAGACCA 

ATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAAT 

GGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATT 

TTC AAAT CATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGA 

TCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCT 

ACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCAC 

AAGTGTTACTGGCTTCCCATAAACACAAGCCCTTTCCT 

Sequence 276 

CCCTTTCGAGCGNCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

MTAGMT ACAT CAGAGATGTTAAAGATTTCCCMTTGMGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTT 

AGCTMAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACC 

TACTGCAGCTTT7TTMAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGG 

AGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAAC 

TMTATGTTGAAAAAAGGCCAMAAGTAGCMTTATTGATGATGTTTTAGCTACTGGCGG 

AACAATGAAAGCGATTATTAACTTAATCGAATCTCAAGGTGCTGGTTGNTCATAAAGTAA 

TCTTTTTACTTGMTTANGATTTTTAAACCGGAATTGAAAAACTTAAAA 

Sequence 277 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTTCTAANGTTAAATCCTGAGGTAAGTCAACC 
MAACAGGATGTGMTATCCTMTAATMTTCTAMGTCTTATCTTTTAATGTTGCTCTG 
TMCCAACACCTTTGATTTCTMTTCTTTAGAAMTCCTTTANAAACTCCGGTTAACATA 
^^ZZ!?^'^^^^^® ^ Gi"l"AATTG NTT AATATTTTTTTCTTCTG ATGTT 

CTTTTAGTAAGTAGAGTGT TTTCA ACTTTTT 

GATAATGTTCCTMTTTACCTTTTATAGNTACTNMGMTTGTTAATGTTTACTTCAACA 

CCTTCTGNTATTTGTAAMCACGATTTCCGACACGAGACAATATTAATTATCAAATGTAA 

GCMTGGATTTCACCACCTACATTTTTCCCTTTCTTGCNTTGTATMTTAGGTTNAATAA 

ATCCTTT 

Sequence 278 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGATAGTNNTCTCAGTAAAAGGTCTATTATCT 

MCTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCA 

CAGCTTGAAMGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGC 

AGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGANCCAACAATAAAAATAGCACT 

AGCAATAANAAGAATGCCATCCCATGGAGCACACCATAAT 

Sequence 279 

CCGAGGTACf AMCTCTNTTTAATGNNCTAACGTC 

TTTAAAAATCCTMTTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCG 
ATTAAGTTAATMTCGCTTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCT 
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ACTTTTTGGCCNTTTTTCMCATATTMGTTNGGATTTCTMGAGTTGATTTACCATATT 

CTAMTCATACTCAAAACTAATMCGTCTCCTGGGTMTTTTTTAGGGTTTNCTTACCAT . 

AMTAAAMGGGGTTTTTTTAAAAAAAAGCTGGCAGTAAGGGTGTCCCAAACAAAGAAAA 

CCCTCTTGCGGTCTTGGGACCTATAATAACAATCTGCATCTTTAAGCTAAACTCAAGCCA 

TTTGGGATTMTTGGTGTAMTTTAAGCACTTTCTCCATTTGGCTAAAAAGTGGGGNGAA 

MTATCTTTAAAAATACAATCCCTTCAATTGGGGAAAAATCTTTTAACATCCTCTGGATG 

GNATTTCTATTAAAAATTGGATTTGGA 

Sequence 280 

TATTTGGTGMGATCAGCGTTATCAGCATTTTCTACGATTAAACGCTGGCCATGCTTTGA 

CTGATGAAATCCGCCAAGCTATTCAGCAGTTGGCGGATTGGGTCAGAGAAAGTCTCAAGT 

CGAAATTGAGTTAGTCGAAGTCTGCTTTCAATACGATGCGATAACGTGCTTGACCTGAAT 

G AAGTCG CTCAATGGCATCGTTGAGCTGTGACATAGGATAGAGTTCAATTTGCGGGGCGA 

TATTTTTACGTGCTGCAAATTTGAAGAAGTTGACGAAGTGCTAAAGGAGAACCCGTCGGT 

GAACCTGTTACTGATTTGGCACCATCAATCAAAGCACCCGACTGAAACTGGGAATAGGTT 

GTAAAAGTCAGAGCTAGAAAAATG 

Sequence 281 

NGGTACCTCCACCGCGGTGGCGGCCGAGGTACANCCTCTCGGCCCGGCTAAACATCATCG 

TCTTGGTAGGTCATTACCNTACCAACTAACTAATGNTCCGCACCCCCATTTTTAAGTGAA 

GCTGNGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAAGCT 

ATTGATTCCAATAGTTATCCCAGNCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 282 

CGANAAGCANGATTTTNAATTNTTGCAGCCCGGGGGATCCGNGGAGNGGGGAGAGCCCAC 

CGCGGGGGAGCGCCAACACGACNNAGAGCGAGNCGNAATACGCGCGCACACNGACCGCNG 

ANNAACAAACGNNGAGACGGGGAAAACCCNGGCGNCCNCAACAAAAACGCCCGNGCAGCA 

CAGNCCCCAAAAGCCAGCNAGGCGGAANANCGAACAGGCCCGCACCNAGCGNCCGNCCCA 

ACAGNAGCACAAGCCCCGAANGGCGAAAAGGG 

Sequence 283 

AAACCCCCTNAAAAAGTNCCAAGGGGCCGCCAANNCGACTANAGCGAGNCGCATTACGCG 

CGCNCACTGTTTGTCGTNNTTAACGCCGCGACCGGGNNGAACCNGGGCGNNACCCAACAG 

GAAGCGCCGCGCANCACAACCCCCNACGCCAGCAGGCCGAnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnAGNGGAGGAAACAGCGCAGCAGAGACCGNNACACANGCCAAC 

GCCCNAGCNCCCGCNCNNNACGCNAGCAACCCCACCNAACANCGNCCACGNACGACNGCN 

NGCCCCCGCNCAANCNCNAAAACGGGGGCCCCCNANNANGGGGACCCGANAAANCGCNAN 

NACGGCACCCNCCGACCCCAAAAAACNNGAAAAAGNGAGAAGGGAGCACGNAAGNGGGGC 

CAANCGCCCCNGAAAAGAACNGGCGNCCCGCCCNNANGAACGAANGGGANGACCCACGNC 

CCNAAAAAANAGGGGGACACCCGAGNNACCCAAACGGGGNAACAACACGCAAACCCNAAG 

GCCGGCCGAAACGNGAANGGAACCAACAAGGGGACACANGCCCGAAAACCGGNCCAAAGG 

GGGAAAAAAAAAAGAAGCCGNAANNCAAACAAAAAANCNNAACGGCGAAAGNNNAAACAA 

AANAAGAAACCNCNAACAACNCAAAGGNGNCACNNNNNCNGGGAAAAAGGGGCCCC 

Sequence 284 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATNGGAAACAATAGCTAATAC 

CGGATATAGNTATTTATCGCATGATGAGNAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

MTGGGGGTGCGGAACATTAAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGT 

TNAGCCCGGGCCCGAGAGGCTGTACCTCGGCCCGCCACCGCGGTGnnnnnnnnnnnnnnn 

nnnnnnnnnnAGNCGTATTACGCCGCGCTCACTGGCCGGCGnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnACTTAATCCGCCTTGCAGCACAATCCCCCTTTTCGCCAG 

CTGGGCGTAATAGCGAAAGAGGCCCCGCACCCGMC 

Sequence 285 

CGTCCGTNAGGCCTGTNCTTACGGCTGGGTTTAGAAACCAGCCCATTCAGAAAGACTGAA 
TCAGAACATGGATNAAGTGAACTNATTCTAAGATGACTCGNNTATCCATGTNGATTAATC 
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TNCTGGNTCATAATAGGCCTCTTCCCTTTGATTGAAGGGTCACGTNTAAGTATANAAMC 

ATAAAACTGTMGGTAGAGGAAGCGAAGGATAGCTTNGTATTAATGTTGCGTTAAAGCTT 

CAGAGACAAGAACAAGMCACTCCTCCCACGTGACAGCATTTGAATAGGAGGCGGNGGGT 

GCNGCAGCCTGGGCAGCTTCAGTCCCGATTTACAATAAAGTACCTTGNGNGTNATTAGTT 

CTTAAATGTTTATTTAGAAATGGCATTGATGTT 

Sequence 286 

GCGTCCGGTCACCGCACTGAACTTCGGGACCAAAAGCTTCCATGCCGCGGNCCCCAGACA 

AAGGCAGCTTCCCGCTGGACCACTTCGGTGAATGTAAAAGCTTTAAAGAAAAATTCATGA 

GGTGTCTCCGTGACAAGAACTATGAAAATGCTTTGTGCAGAAATGAATCTAAAGAAGTAT 

TTAATGTGCAGGATGCAAAGGCAGCTGATGGCACCGGAGCCGCTAGAGAAGCTCGGCTTT 

AGAGACCTAATGGAAGGGAAGCCAGAGGCAAAGGATGAATGTTGAGAAGGGAGCCACAGG 

ACCTTGTCCCCAGCCTGGAGCAGAGCTGAGCCCTTCTGCCCACAGNGCAGGGGGACCTGA 

CACTCAGCCCGTGCTGGCCCGTGCAGGGGCTCTCCCTGGG 

Sequence 287 

CGTCCGGTGCACTGCAACTTNTATATNTAACCCTAAACTCCAATAAAACAAATTCCAGGG 

AAAACCAAGGGTGTAATGGGATGTGCGTGTTTATCAGGAGTGTGCTCTCACGTGGATGCT 

GAATGATGGAGGACAGCGGACTGCATAAGCCAGAAACCTGTACGGGTGCTGGCTGTGGAA 

AGACGTGTCTGTCTCTATCTATGTACAATAGTTNATTCTGTCAGGCTGAAAAAGTATGGT 

CTTTAGGACCTTGCCCCTCTAACTATAGAAACTTAAACAGTGTACTGCTATTAGATATAT 

CTGATATTAATAGAACATGCCAAGTGCAGGTCCCAAATGCGTATTTGTGAAGCACACATC 

TGAGTAAATGGCTTAGATGGAAAGCAAGTCATCATGAGTAAAAATTAAGCCTCAAACCTG 

CCGGTGCTCCTCACCTCTTTGTCACCAGGTAAAGGTCACACTGTGTGTTGCTTTTGNTGT 

CTTCTCTTCCTAACCTAG 

Sequence 288 

CNCGCGTCCGGGGCTTGTTTACTATGGCCGATGATCTGGAGCAGCAGCCTCAAGGCTGGC 

TGAGTAGCTGGCTGCCCACGTGGCGCCCCACTTCCATGTCTCAGCTGAAGAATGNGGAAG 

CTAGGATCCTCCAGTGTCTCCAGAATAAGTTCCTGGCCAGATATGTGTCCCTCCCAAACC 

AGAATAAAGAT CTGG ACGGTGACTGTGAGCCCCGAGCAAAAGGACCGCACCCCCCTGGTG 

ATGGTGCACGGTTTTGGGGGCGGCGTGGGCCTCTGGATCXJTCAACATGGACTCACTGAGT 

GCCCGCCGCACACTGCACACCTTCGATCTGCTTGGNTTTGGGCGAAGCTCAAGGCCAGCA 

TTCCCGAGGGACCCGGAGGGGGCTGAGGATGAGTTTG 

Sequence 289 

NGGAAAGCCGGCATAAGTGACATNGTTTGGGCAGTTGCCNGCTGGACTGAAGGGCNAACC 

CANACCACCTTAAGCCATAAAAGCCCGTGACACTGCTANCNAAGGTGCCTTGCCCACCGC 

TTTGCCACCNGTCCCGGAAATGNAAAAAAAGTCGCGTGCCNTAAAAAAGCTGCCGGAAGG 

NCCTGGGTGNACNTTTGGGCCACCCCCACCCCGCTGGCAAGGNCTTGAATTGNGTNACNC 

CAAAAGACGCCCANGCCGGACCTTGGNAAANNATTGTTNTTTNGGGANAAAAAAAAAATG 

GANCCCGNTGGGGGAGGCCCTTGGGGGCATTGNGNAAGCCCCCGGAGGGTTCCCCGNTGT 

TGGCNGGGGTCAMTTCAAAGGCCAGTGGTNGGCCACCCCGGGGMCCTGGNNGCTGTTG 

CAAGNACCNNGGTGGGGAACCGTTTCAAGGAATACCCCAACCAACCCANGTAGCCACCTT 

AAGGTAATTTGGCCCACCTTGCCCACCANAANGNGCCATTGGGGAAGAAACCACCAAAAA 

CGTTCCCCCCGGGT 

Sequence 290 

CCCCGCGCTCAGTATGACTCTTTAGTCCCAGTTTTCATGGGTAGTCTCTAAATTCTTTAC 

CTTTATGTGATTGTGAGTTGGGAGGTGGTGGGCATCATCTTAGTCCATTTACCTTTTTCA 

GTTTTGTAATT ATCG TCTTCATCTACTACCTTTATATAATATAAAGGGAAGGGGTCTTCC 

TTTACAAATAGTTTTATCATCCTTCTCrrTTTTGATGTCTATATCTTCTATTTTTTGAGGA 

GAATATATGTTGTATAGACCTACACGTGGGTGGAAGAAGAGTCATGTATGTGTGATTGTG 

TGAGATCCAGMTGTTGGMCTTTTAATTTCTTATTTTGTACTTATAATTACCTGCTGGA 

ATACCTGGTTGCATTCTGTATATTGTACCCTCATTTAAAAGTTCATGGAGGCAAAATAAC 

TCTGTTGCACATAAGGCCGGGGCTTATGCATGTCCTATCGGATGTGGGCTCAGATCACGG 
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AAGACCCGAA 
. Sequence 291 

TCTTGCAGACTCAAGCTCCGCGCGCAGCCGCTCCTGGTGCGGGCCCACAGCAGCCTGGGC 

CCCGGNTCGGCCCCGGAGCCCCCTGGCCTGCGACGACTGTTCCCTTCGATCGGCCAAATC 

CTCCTTCAGCCTCCTGGCGCCCATCCGCAGCAAGGACGTTCGCAGCAGGAGTTACCTGGA 

GGGCAGCCTNCTGGCCAGNGGGGCCCTGCTGGGGGCAGACGAGCTGGCCCGCTACTTCCC 

AGACCGGTACGTGGCGCTCTTCGNGGCCACCTGGAACATGCAGGGCCAGAA 

Sequence 292 

NGGCCCGCCCGGGCAGGTACTAAACTTTAATTAATGAGCTMCGTCATATTTTTTAAGTT 
' TTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCAC 
CTTGAGATTCGATTAAGTTMTAATCGCTTTCATTGTTCCGCCCAGNTAGCTAAAACATC 
ATCMTMTTGCTACTTTTTGGGCTTTTTTCMCATATTMGTTTGGATTTCTAGAGTTG 
ATTTACCATATTCTMATCATACTCAAMCTMTMCGTCTCCTGGTAATTTTTTAGGTT 
TTCTTACCATMTAAMGGTTTTTTTAAAAAAGCCTGCAGTAAGGTGGTCCCAAACAAGA 
AAACCCTCTTTGCGTCTGGGACCTATAATAACCATCTGCATCTTTAGCTAACTCAAGCCC 
ATTTGATTAATTGGNGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGGTGAAATAATCT 
TTAAAATACAATCCCTTCAAATTGGGGAAATCTTTTAAACATCCCCCGGCGGTACCTCGG 
GCCCGCTTCTAGAACTAGTGGGATCCCCCGGGCCTG 
Sequence 293 

GCTCCCCGCGGTGGCGGCCGCCCGGGCTGGTACGCGGGGAAATGCAAAAAATCAAATCAA 

TTTMTACGMTACATCATGAGATGTTAMGATTTCCCAATTGAAGGGATTGNNTTTAAA 

GATATTTCACCACTTTTAGCANATGGAGAAGTGCTAAATTACACAATTAATCAAGTGGCT 

GAGTTAGCTAAAGATGCAGATGTTATTATAGGGCCAGACGCCAAGAGGTTTCTTGTTTGG 

GACACCTACTGCAGCTTTTTTAAAAAMCCTTTTATTATGGTMGAAAACCTAAAAAATT 

ACCAGGAGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAAT 

CCAMCTAATATGTTGAAAAAAGGCCAAAAAGTAGCAATTATTGATGATGTTTTAGGTAC 

TGGCGGACAATGAAAG 

Sequence 294 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCTAATTG 
ATCCTGTTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGA 
TTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATG 
CACTCTTATATTCTGCCTrrCTGCCTGGAGTTTGAMTACATGTCTCTTTAGTTTCTTTT 
GCACATGCTACATTGTGCTTTAGACCGGGAGATAATACAGGNGCCTTACCTTACAAATTN 
ATNT TTNTGGCAACNCNAATTNTNTNGAAATTTTNNTTTAATTTNAAAACCCCAAACCAA 
TTTTCCNNCNCAAAAAI I I I I I I I I IGGGAAAAATTAANTTCTTTAAANNNAACCCCCCN 
AAAAATTATNGGNGNNAAAAGGGNGCCCCNTTTTGGGCCC ) 1 1 1 I II I I 1 1 1 CNCGGGNG 
GGGNAAA AAATTTNAAAAAAAN I I I I I I I I GGGNNCCCCGGGAGAAAAANNTCCCCTNTT 
1 1 I I I 1 1 ICNGGGGTTTTAAAAANGGGGGGNNAAAAATNTTTNNTGGCCCCCCCCNTTTT 

Sequence 295 

TATAGGGCG AATTG GAGCTCCCCGCCCGNGGTCCCAAATGGAAGTGTGAAAACCANGGCC 

CATCCCCCNNTTTTNTAGAGGGGTGGTAAAAAATAAACCCANANATCAAGGGGAGAAAGG 

AAAAGGATGAMGGACAAACTGCCAAAAAAATTTNCCCAAAGTGGCGACTTTTTTAANTN 

TGGGAGCCAGAATTCTGAGGGCTTTGCATTGTCTTTGCAATTCNCTCAAGGAGCCTGAAA 

TTGAAAAAAMTGCCMCAAGGCCAAATNMCTACTTTTTAGGAGGGGGTTTTGGAGGTC 

TTGGGAAGCCTCATTCCCNTTCAACCNNTCNMTTCTGGGAATGGGGGAAATGGAAAGAA 

TAGAAGATGTTGGGTGCCCACTAGGCTACTGNTTGAAAAGGGGAAGCTTGAAAANTTNCT 

NCACCCAAGGTTGGGTATTCAAAAATATTGTAATGGACTGGGTATTGGCAAAAAGG 

Sequence 296 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGTGGGTCCCTTTTCAGAGGT 
TGGGCCTTCTAGACCTCACCTGTTCTCACTNCCCTGGTTTAAATTCAACCCCAAGCCATG 
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GCCAATGGCCAAATAATAGAMTTGGTTCCCTACCCAGCTGGACCAGGGGAGGGAGGTCT 

TGTGCAGTTTCTTGACCACTTTGTTGGTTGGACCATNGGCTTAAATACCAATGGGGTATT 

CGGCTTGAGACCTAAAGTTTGTAAGAAAATTNMCCAAAATGGTGCCTGCTTGGGTTAAA 

MTGGGCTACCACCTCMTCTGGACTTCMTTCTTTTMTTCTMTTTTAAGTTTGGGTT 

TTGGTATTCTTTGGCCTAAAGGTGGCGGTAGTCCCAACCTCTTTGGGTANTTACCCCTTC 

CTAAATAGGTCAATACCTAGGTAGGTCAATACCTCCCTGGGTGGTAAGGNGGTATTTCTT 

CTTAAAAAAGCCTTTTAAAA 

Sequence 297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACT 

CCTACGGGAGGCAGCAGTAAGGNNTTTTCCACAATGAGCGAAAGCTTGATGGAGCGACAC 

AGCGTGCAGGATGAAGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAA 

AATAANNAAAAAAANGGTACCT 

Sequence 298 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACGCGGNGAAATGCA 

MAAMTCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGG 

GGTTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAAT 

TAATCAAATGGCTGAGTTAGCTAAAGATGCANATGTTATTATAGGTCCAGACGCAAGAGG 

TTTCTTGTTTGGGACACCTACTGCAGCTlvrrTTTAAAAAMCCTTTTATTATGGTAAGAAA 

ACCTAAAAAATTACNAGGAGACGTTATTAGTTTNGAGTATGATTTAGAATATGGTAAATC 

AACTCTAGAAATCCAAACTAATATTTTGAAAAAAG 

Sequence 299 

■ CCGGGCA GGTAC GGCCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCA 
GTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGAAG 
TTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
GTACCT 

Sequence 300 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCTAGGACAATCAGGAAG 
TMTCTTAAAAAAATMTTGMGATGTTAAAAAATACGTTAAAAATAAAAAACTTGTTTT 
f AAACATAGATGCAGTAGAAATTGAAAAACCAGATTTAGATGCAAAATTATTAGCTGAATC 
AATTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTTGCAATTCG 
TTTAGCACAAAAAGCCGGAGCTAAAGGTATTAAAACTAAAGTTAGTGGTCGTTTAAATGG 
TGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACACACACTTAG 
ACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTTGGAGTTM 
AGTTTGAGTTTCATTAGGCGAAGTATTT 
Sequence 301 

CCGGCCAGGTACGCGGGGAAATGCAAAAAAATCAAATCANGNTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCMTTGMGGGATTGNATTTAAAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATNACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCMGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTT7TAA 

AAAAACCnTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGANACGTTATTAGTTTNG 

AGTATGATTTACAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGMCAATGAAAGCGATTA 

TT AACTT AATCGAATCTCMGGTGCTGNTGTTCATAMGNMTCTTTTTACTTGAATTAG 

GAl l l 1 1 AAACGGNATTNAAAAACTTAAAAAAATATGACCGTTAGCTCATTAATTAAAAG 

TTTAGTACCTCGGCCCGCTCTAG 

Sequence 302 

AGGTACTTTGATATCTNCGCCCTCTCGTGTGTTCCTTGTGGNGNTAACCAGAGGCAAGAT 

GCCCGAGGAACTTCATGTGTATGTCTACCAGGATTTCAGATGATCTCTAATAATGGAGGA 

CCTGCTATTATTTGTAAAAAGTGCCCAGAAAACATGAAAGGTGTTACAGAAGATGGCTGG 

AAC TGCAT TTCTTGCCCTAGTGACTTAACTGCCGAAGGAAAATGTCACTGTCCCATTGGC 

CATATTTTAGTGGAAAGAGACATTNATGGAACATTGNTGNCTCAAGCAACTNGNGAGCTC 
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TGNGATGGAAATGAAAACTCTTTTATGGTAGTAAATGCTTTAGGAGACAGGNGCGTNCGA 

TGTGAGCCAACATTTGNTAATACCAGCAGGTCCTGTGCATGTTNCGAACCTAACATTTTA 

ACAGGGGGATTATGTTTCAGNAGCACAGGGMTTTTTCCTTGTACGTANAATTTCACCTG 

CACGTTATGGAGAAGTTTGGCAT 

Sequence 303 

CCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCAATTGMGGGATTGTATTTMAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCAAGAGGTTTCTTGTTTGGGACACCTACTGCAG CI H 1 1 I A A 

AAMACCTTTTATTATGGTMGAAMCCTAAAAMTTACCAGGAGACGTTATTAGTTTTG 

AGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TTAACTTAATCGAATCTCAAGGTGCTGTTGTTCATAAAGTAAT CI 1 1 I I ACTTGAATTAG 

GATTTTTAAACGGAATTGAAAAACTTAAAAAATATGACGTTAGCTCATTAATTAAAGTTT 

AGTACCTCGGCCGCTCTA 

Sequence 304 

GCGGTGGCGGCCGAGGTACCTTNTCCGAATGCACCTTNAAAGCGGGTATTAGCCTATACA 

GGCTGTTTTAGTCGAATGCAGACCATCAAGGAAATTCNNGAATATCTATCTCAAAGACTG 

CGCATTAAAGAGGAAGATATGCGCCTGNGGCTANTCCANAAGTGGAGAANTACCTTACTC 

TTTCTGGGNTGATGAGGAATCATAAATCTGGAATATTTNGAAAATCCAGGATGAACAACA 

C 

Sequence 305 

GCNG NCGCGGGG AAATG CAAAAAAATCAAATCAATTTAATAGAATACATCAG AG ATGTTA 

MGATTTCCCMTTGNNGGGATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAG 

AAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTNAGCTAAAGATGCAGATGTTATT 

ATA GGTC CAGACGCAANGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAAAAAA 

ACCTTTTATTATGGTAAGMAACCTAAAAMTTACCAGGAGACGTTATTAGTTTTGAGTA 

TGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAGGCCA 

AAMGTAGCAATTATTGATGATGTTTTAAGCTACTGGCGGAACAATGAAAGCCGATTATT 

AACTTMTCGMTCTCMGGNGCTGGTGNTCATAMGTMTCTTTTTTACTTGGAATTAG 

GGATTTTTNAACCGGAAATTGGAAAAA 

Sequence 306 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAATTA 

TGAAATTATTGCAGAAAGAAGATTCACTCTCACCTGATGAATAAGTGTTCATAGGTNAAG 

GCTACMMTACTMTTTGTTATTATTTTTMTAATMTTTTTGTTTTGCTGAGAAAGTG 

GATTT ACCAC TTTTTTATTTTTTAATCCMGGAGGAAMATTATTTCCAAACCAAATCCT 

AAMATT TTTCAC GTTCTAAACCAGTTCAAGAACATTGAGTAAACAGAAATATTCCATTT 

GTCAAAGTTTTTCTTATCGGCTCAGATAATGAAAAAATTGGGATAATTGAAACAAGAGAA 

GCTATTGAAATGGCAAAAGAACAAAA 

Sequence 307 

CGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGCAAGCGGACGTGAGC 

GATAATGGCGGATATGGAGGATCTCTTCGGGAGCGACGCCGACNGCGAAGCTGAGCGTAA 

AGATTCTGATTCTGGATCTGACTCAGATTCTGATCAAGAGAATGCTGCCTCTGGCAGTAA 

TGCCTCTGGAAGTGAAAGTGATCAGGATGAAAGAGGTGATTCAGGACAACCAAGTAATAA 

GGAACTGTTTGGAGATGACAGTGAGGACGAGGGAGCTTCACATCATAGTGGTAGTGATAA 

TCACTCTGAAAGATCAGACAATAGATCAGAAGCTTCTGAGCGTTCTGACCATGAGGACAA 

TGACCCCTCAGATGTAGATCAGCACAGTGGATCAGMGCCCCTA 

Sequence 308 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATGTTATTAATGTGACTGACA 

AGTAATTAGAAAACTGGAMTTAMTTTTACAAACATTTTTAAAATCGCTNCAATTAAAA 

AAATTCAAGATGGTTACATTATGAATATGAATGAAATGTCATTAGCGACTTCGTTAAATG 
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TAT ATGTAATTCTATA TTTTCCCCAAMCCCACATTTTATGMGMTATTTATTTATTTA 
TTTATTTTTGi mi iGAGATGGAGTCTCGCTCTGTTGCCAGACTGGAGTGCAATGGTGC 
GATCTCCGCTCACTGCAACCTCCACCTCCTGGGTTCAAACGATTCTCCTGCCTCAGCCTC 
CCGAGNAGCTGGGACTACAGGCACCGNCACCACGCCCGGCTAATT 
Sequence 309 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGACCCTCCTTGATGGTTTACTTTGCAAGCTA 

TGGTGACCTCCGCAAGTTGTGTCTGGGCCCATCCAGGGCTCTGACTAATTGTATTCAAAT 

C AAGGCA GGAGCGGGCCAGCTGGCGTTGACTTAACCAAGCCATTTTATAAGCCTCCCGAT 

CATTTTTAAGCCACTCTAAGTCGTGTAGTAGGATCTGGTCAGAGTTATGTATACTCTGAT 

GGGCATGTGCTGTGTCGTCTAAAATGTCCAGAAGTTCTGAAACACTTTTAGATCTTCCAG 

AATTTCTTGAGGAAGTCTGCCTAAGTAACTGATGCACATCAAGTTCATCACCGGAGGAAT 

CAAMGMTTTCCATTTTCTATTTCTCTACAGAAAAGAAAAGGATCTTCCTTTAAGATGG 
AAATATTATTTCTCTTC 

Sequence 310 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTTGAATTATCTA 

TTGAAAGAACT ACTACAT CGAGTTTTTGTTCTTTTGCCATTTCAATAGCTTCTNTTGGNN 

NAATTATCCCAATTTTTTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATAT 

TTCTGTTTA CTCAA TGTTCTTGAACTGGTTTAGAACGTGAAAAATTTTTAGGATTTGGTT 

TGGAMTAATTTTTCCTCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAG 

CAAAACAAAMTTATTATTAMAATMTAACAAATTAGTATTTTGTAGCCTTTACCTATG 

AACACTTATTCATCAGGTGAGAGTGMTCTTCTTTCTGCAATAATTTCTAATTGCCCCGN 

GTACCTGCCCGGGCGGC 

Sequence 31 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAAGGTACCTGACTGTGGC 

TCANATCTGCGTCGCAGCAGCGAGAGAAGAAATCACTCCATATCCGATGAGAGGAAGGGT 

GGCACAGANATGGTGTCTACAATTAGAGACATTTCTGACTCCACCTTAGCCTAAGCAAAC 

TTTATATACTGAGTMCATTTGMGGTTGTCTTTTMTGGTGGGGGGTGNTTTTTCCTTT 

TTAAACTACAGT 

Sequence 312 

CCAAAANGGGNCCTGGGGCGTGGTCACGGCCNAGGTACAAAATTATGACTGTTTTAGCTC 

CAGGAACAGATTTAAMGTMTTGMGACGTTTTAAAATCTGTTTTTGATGCTAAGAACA 

TCGAAAA AAATT G AAAA ACTTGAAAGAACAGAGTTAGCATACGAAATTAANNAGCACAAA 

CAAGGAATTTTTGTTTTAGCTMCTTAAAATCTGAGGAMGTTTAATCGAAGAATTTGTC 

AGAAGAGTAMTATTCTCAMAAACAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCT 

GAAAGAGGAATGCACAAAACTTTCAGACCTAGAAAAAATGATAAACACAAATTTTTCTCT 
TCTAAAAAACCA 

Sequence 313 

AATNGGGCCTNGCGTGGT CACG CCCAGGTACNAAATTATGACTGTTTTAGCTCCAGGAAC 

AGATTAMAGTATTGAGCNTTTTAATCTNTTTTGATCTAAGACATCGAAAATTGAAAACT 

TGAAAGACAGAGTTAGCATACNAATTTAAANGCACAACAGGATTTTTTGTTTTACTAACT 

TMA TCT GGGGM GTTTAATCNAAGATTTGTCAGAAGAGTAAATATTCTCAAAAAACAAG 

TTTTMGATTTTTAGTTATTAATCTAGATTCTGAAAGAGGATTGCNCAAAACTTTCAGAC 

CTAGAAAMTTGATAMCACAMTTTTTCTNTTTTAAAAAA 

Sequence 314 

NGGGCCTNGGNGTGGNNACGANCCAGGTACTTTTACCAAAGAATCTACTAGAACTCTCTG 

CTATTCAAAACAAAGAGCTCATACTTGTTGGAGTAGGGAAAAAATTAGAAATTTGACCAA 

AAGATAGATTCAATCAACTACAAAGTCAATTCCAAGATGCTGNTAACATCGAAACTCTTG 

AAAAAGAACTATTANAATCTGGAGTTGAACTfTAATGGATCATATACCTGNTTTGCTTGA 

TCAAGCGGATTGNTCAGCTTMTATTAAAGAAGATGGNATCTATTTAGATCTTACTTTAG 

GACGNGGNGGNCATTCGAGNCAAATTTTAAAAAAACTTACTA 

Sequence 315 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTCCATAATTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGAAMGCTTTGAGCTTGATTCATT 

CTCATATCCAAACGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 

GGTGMGATACTCCTAAATCMGTAATACACCATCTACTTTGTTGATTTGAAGTTTTTTT 

AGTTCTTGATCAAAATCTTTAAAATCAGATCAAATMATTCAATATTTGAAGAAATTTTT 

AAGAG l 1 1 l I CTTTTGTTTGTTCAATTGCTTGTTTGTCTTTATCAAAGACTA 

Sequence 31 6 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACC TATCT ATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 317 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGMTTATG AACTTA AAAAGCAMTTMGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCA I I I I I I GCTTCTTGCCAAACAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTAATAGTAGAGAAGGAAGGTTTTTAGCATTAGATCAAA 

TTGTTAAGCTTTGAAATGAAAGTGAAAAAGTTAAAAACAAAGAAGAAGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 318 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 

CAGGATTCCTCATTGGTGAGTTCAACCATACATTTTGGGTAACTCCTGAAGACATCTGCA 

MTTGTGAGTTAGTTGGTGGGGTCCATTAACATTTGCATATGTTATTCTTTCTACTGAAG 

TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 

TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 
CTTGGTT 

Sequence 31 9 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAATACGTAGAGATAATAACAGTTTTT 

TAAAMACTTAATATTTGTTATTGA ATGT ATATTTTTGAGTATTGCATCTTTTCTATACT 

MTAAGGAGGTGTAATTTGAACGCTTTTAGAAAGAAAAAAGAAGAATTAGTGCAAGAAAT 

TMAGATTTGATTMTTCTTCTTCTTCATTAGTTATAGCTGAATATCGTGGATTAACAGT 

TGCTGAMTTGAMCTCTTAGAAATGAAGCTTAMGMGCAGGTGTTTTTGTAAAAGTTT 

ATAAAMTAGACTATT TAAAATAG CATCTAAAGAAGCAGGTTTCGGAGATTTAGAACAAT 

CACTAGTTGGTCCAAATTTTTTTGCTTTTGGTTCTACAGATGCANTAGCTCCAGCTMAA 

TTATTTCAAAATTCGCTAAAAACAMTCCAGTMGTTGTATTMAAGGCGGTATTTTTTG 
A 

Sequence 320 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACGGAATTTTAAAAATAGACTNTAAAAAACCN 

TCAATTGCANMCTMTTTTGNTTGATTTACTTCAAATTGAAGGAACTCTTTTAGGTTTT 

GTTGNAATAGTAAANGAAATTCGATNAGTAATTCAATCAAAGAATAATGAATNGACAAAT 

TCTGACATAAAAACAGCCAAAGAATATCATTTAGAACAACTAANAGCGCTTAAAAAAACT 

TCAMTTAACAMTA TMTT TATM TTTAT GGCACTATGAAAAAAAAGNGACTTTATTTA 

ATATCMCTTTANCATTTTCMCMTTTTATTAGCTATTNCTNGTGGTAAAAATCAAACT 

ACTCCTGTAGGCACACAACCANTTGATANTCCAAGCCTGGAGGATTCATCAAGNGAATCA 

AN AGTTCTG GATTT AAATTAATAG AAAAAAAG 

Sequence 321 

CCCTTAGCGTGGTCGCGGCCGAGGTACAATTTCTAATTGATCCTGTTCACATTCAGTGAA 

ATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGATTTAGGTAACTATTGTGTCTT 

CCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATGCACTCTTATATTCTGCCTTTC 
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TGCCTGGAGTTTGAMTACATGTCTCTTTAG 

AGACCGGAGATAATACAGTGACTTTACCTCACAAATCATATTCTGTCAACACAAATCTAT 
GAATTTAGTTTATTTAMATCAGAACMTTTCCTACAAMTTTTTCT 

CTAACAGACCTACCAGAATCATGCTTAAAGGGCTCCCTTGACACTTATTCTATACTGAAG 

GATAAATTTTAAAAAAT 

Sequence 322 

CCCTTTCGAGCGGCCGCCCGGGCAGGTA C 1 1 I I I M I H I 1 1 I 1 1 I I I 11 II 1 1AGTTAA 

ATATAAAT TTATT TATTAAC Mil TCMTACCTTTTGAAGGTCAAAGTTTGTAAGATGAA 

TTAGGTTCTTTTGAAAGTAAATTCCCACAACTGAAAGTGCAGCTAGAATAATAAATACTG 

CATATGCAGCTTCTGTTGATGCTGATTTAATTCCTGCAACTAGAACAGTTGCTATAGAAT 

AAMTGCATMCCTMTCCTCANATMGTCCAMTTGATATCCMCTTTTTTAGGATTTG 

ATCCTTTGTATTCGGGAGGMGATTTMGATAACACCTTGAATTCCTCAAAGCATTACTC 

CCATTAAAAATCCTAAAATATAGAATAATCAAGTTCCATTACCTA 

Sequence 323 

GCGGTGGCGGCCGCCCGGGCAGGTACAAGGTAAAGCAAGAGCTGGCTCTCTACGTTCACC 
AATTTTTGTAGGTGGTGGTCGTGCATTTGGGCCTACAAATAATAAAAATTACAAAATTAA 
ATT AAAC A AAAA AGTTCG C AAATT AG CTTTTG CCTC AG CTTTT AAGTC AACTTG CTC AA A 
ATAATCAAGTACCT 
Sequence 324 

NNCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAA 

TAGMT ACAT CAGAGATGTTAAAGATTTCCCMTTGAAGGGATrGTATTTAMGATATTT 

CACCACTTTTAGCAAATGGAGAAGTGCT AAATT ACACAATTANNCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTA TAGGT CCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGNAGCI I I I I I AAAAMACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTATTAGTTTTGAGTATGATTTAG 

Sequence 325 

CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATMTTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATTCATTrACTAACTCATMGCTGMMGCTTTGA 

CTCATATCCAMCGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 

GGTGMGATACTCCTAAATCMGTMTACACCATCTACTTTGT^ 

AGTTC TTGATCAAAA TCTTTAAAATCAGATCAMTAAATTCMTATTTGAAGAMT1TTT 

AAGAG 1 I I I I CTTTTGTTTGTTCAATTGCTTGTTTGTCTTTATCAAAGACTA 

Sequence 326 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAMGGCAACAGAMTTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACC TATCT ATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 327 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGAATTATGAACTTAAAAAGCAMTTAAGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTT TGCT TCTTGCCAMCAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTAATAGTAGAGMGGMGGTTTTTAGCATTAGATCAAA 

TTGTT AAG CTTTG A AATG A AAGTG AA AAAGTT AAA AAC A AAG AAG AAG G ATTTT ATCC AA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 328 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 
CAGGATTCCTCATTGGTGAGTTCMCCATACATTTTGGGTAACTCCTGAAGACATCTGCA 
MTTGTGAGTTAGTTGGTGGGGTCCATTMCATTTGCATATGTTATrCTTTCTACTGAAG 
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TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 

TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 

CTTGGTT 

Sequence 329 

ATAGGGCGA ATTGGA GC TCCCC GCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGAGC 

TAACGTCATA I 1 1 I I I MGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGA 

TTACTTTATGAACMCAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGTTC 

CGCCAGTAGCTAMACATCATCAATMTTGCTACTTTTTGGCCTTTTTTCAACATATTAG 

TTTGGAT TTCTAG AGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGTCTC 

CTGGTAATTTTTTAGGTTTTCTTACCATAATAAAMGGGTTTTTTTAAAAAAGCTTGCAG 

TTAGGGTGGTCCCAAACAA 

Sequence 330 

CCGCGGTGGCGGCCGCCC GGGC AGGTACACCTCTGATTCTCACTAGTTGAATGCAAGAAC 

TTGAAAGGTTCAGGTMGTGTTTTGAAAMTTTTGACTTTCCAMCTTTTGCCACTTGCT 

ATCTGAAACTCAGGAATCAAAAAATACCGACAGGCACTGTTACTTTCAAAATTCTTTCTA 

TAAGTTGAGAATGGGACAGATTTGCAGAGCAAGGGAAACTTGAACAGTTACTTCTAGTGG 

TAGGAAATGAGGTGGCTAGGATATTACCCAGCTGGTGGGTGACTTGGGCAGTGTGTTCCT 

GCTTTCAGTGGTTAGC CTTTAG CAAATCTGCTTTAGAGTGAGAGTAGAGGGCAGGCTGTT 

GTATTACAGTGCTCTTGTTTTTGTAAMTTTAATTCACTCTACTGNTATTTTGTCTCCTT 

GGGTAAAG NGNTATTTAATTTTTCT 

Sequence 331 

ATTATTGCNGAAAGAAGATTC ACTCT CACCTGNTGAATAACGTGTTCATAGGTAAAGGCT 

ACAAAAT ACTAATTTGTTAT TATTTTTMTMTMTTTTTGTTTTGCTGANAAAGTGGAT 

T TACCA Ci I I I I lAI 1 1 1 1 I AATCCAAGGAGGAAAAATTATTTCCAACCAAATCCTAAAA 

ATT TTTCAC GTTCTAAACCAGTTCAAGAACATTGAGTAAACAGAAATATTCCATTTGTCA 

AAGTTTTTCTTATCGGCTCAGATAATGAAAAAATNGGGGATA 

Sequence 332 

CCGCGGTGGCGGCCGCCCGGCCMGGTACGCGGGGGCAGAAGAGGAAGATTTCTGAAGAG 

TGC AGCTG CCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGGTA 

AATATTTTCCCTTCGTATTCGGTAGTGCTGTTGAGTCATCTTGTCCAATGCAAATCCTGA 

GMGCTATGTTCCCAMGAGGGCCAGCTCCATTTAGTGTTTGTTTATAGCCTTACTATGC 

CTCTAC CTCTGG GGGTTGTAAATCTGTTNTACCCAATGGGNGGGTTTGTTNCCCTTCCTG 

AANCAAAl i i i I CTGCTTNNACACTTGGGCAAACCNTTTCCTAAATTTCATCCTCCCANA 

ACTTTGCNCCNCCNTTGGGGGGAGGTTTGGGGTTTTCAACTCCNGAANAAAAAGAGGGGC 

CCCCCA CCNN AGGGNNTNNTTTNnTTTAAATTNGGGCCNNGGGGGNTTNNANTTAAAAAA 

NNGGGNTmNGGGGGACNNTTmNTmNACCCCCCCCCCCCTTTTTT 

Sequence 333 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGTTGACCTTCCTTG 

CCCAAGAATGAATGATTAAAAAGTTGAAAAGCTGTGTTTGTAGATCGTTCAGGCCTTCGT 

CTGCAACTATCAGGAAATATGGGTAACTCTCTTCCAAGAAACTTCCCCACTCTTTTGATA 

AGCATCTCGAAAATGTTGTTCGAACATCAATGGTGGTATTTTCTGAAGATGAAGAATTAA 

AGCAGTTCTCAAAGAAAGAAGTTCAGGGAAGTTGAAATACGCATACTCGGCATCCTTGAA 

GAAAACTATGGTGAATTGTCCTCCTTTGCTAATAAGATCCACAAGATAGCCGTTCAACCA 

GATAGAAAAAATGGGAGGTNCCTGCCCCAGGCTTTAAANTGANTNNCNCCNCAGCATCCC 

CTTGGGGATNNGGTAANNNCCNCCCCCCCNANCAAAAAAAAAAATTTTTCCCCNCCNACC 

NMTTNTGAAMACCNNGNGGGGGTTTAAAAATTTTGNGNTCNNAATTTNAAAAAAAAAA 

AN 

Sequence 334 

CCGGGCAGGTACTAGGAGATATTGATTCTAGTCAATTAGGCATTGTAGACTGTCATGACC 

ACTTAATAAAAAATTATGGACCTGAAGCTCACGAGCATCCAGATTTTATTATGATGTCAA 

AAGATGCTGCAATTAAAGAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTA 
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CAATGGACCCTCCTAACGTTGGGCGTGATGTTTATCAMTGTTAGATATTGCAAAGAAAT 

TAGMGGAAMGCTMCATTATTATGGCMCTGGTTTTCATAAAGCTGCATTTTATGACA 

MGGTGC7TCTTGACTTGCTTTGGCTCCAACAGATAAAATTGNAAAAATGGTTGTAGCTG 

AAATCGAAGAAGGAATGGATGAATATAACTACAGCGGACCAGTTGTAAAAAGATCTAAAT 

CCAAAGCCGGAATTATTAAAGC 

Sequence 335 

CTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGAAATGCAAAAAAATCAANNCAGTNNANT 

CNAATACATCACAGATGTTNAAGATTTCCCMTTGMGGGATTGTATTTAAAGATATTTC 

ACCACTTTTAGCAAATGGGAGANGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTNTTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTA 

ATATGTTGAAAAAAGGC 

Sequence 336 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATGCAA 

AAAMTCMATCMTTTAATAGAATACATCAGAGATGTTAAAGATTTCCXJAATTGAAGGG 

ATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATT 

AATCAAATGGCTGAGTTAGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGT 

TTCTTGTTTGGGACACCTACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAA 

CCTAMAAAATTACCAGGAGACCGTTATTTAGGTTTTNGGGTNNAGATTTAAAATTGNGG 

AAANCCCCNNTTNTAGAAATCCAAACTAATTNTGTTGNNAAAAGGGCCAAAMGTNCCM 

TTATTGGTGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTATTAACTT 

Sequence 337 

CCGCGGTGGCGGCCGAGGTACCAATAATAGCAACCCTGTGATTTGTCCAAGTGCCCGGGA 

GTGGAGGCCATCCTGACAACAGCTCTATGATTTTCTATGCCAATGACACAGGAGCCCAAC 

AGTTTGAAAAGTGGTGGGATAAGTCCAGGACAGTCCCCTTTTATCTTGTAGGGCTCCTCC 

TCCCACTGCTCAATTTCMGTCTCCTTCATTTTTTTCAAMTTTAATATCCTAGGCACAG 

TGTCTGTCCTTTATTTGATTTTCCTTGTCACCTTTMGGCTGTTCGCTTGGGATTTCATT 

TGGMTTTCATTGGTTTATACCMCAGMTTTTTTGTACCTGCCCG 

Sequence 338 

CCGGGCAGGTACCTGGAAGACTTCTCCACCTCGGGGGCCTGGCTGCCTCACAGGTATGAA 
GACAACCACCATAACTGCTACTCTTACGCACTCACGTTCATTAACTGCGTTCTGATGGCA 
GAAGGTAGACAGCAACTGGACAAGGGTGAATTTACGGAGAAGTACCT 
Sequence 339 

ATAAACTGCGGGATCTCAATGGCTTCTATGATCGTATTGAGGCAGTAGTTCCCACACTCT 

GCCCGGTGCCAGCATGAAAGAGAACAGGGAGAGTCAGCCTTTCACAGTCCTTGTCAAATC 

CCAATTACTCTGGTTGCAGATCACTTGAAGCCTGCCTTGTTTCTCTCACAAAGCTCTGCC 

CGAGAGTCCAGCCCCGCGTACCTGCCCG 

Sequence 340 

GCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAAGATAGTCATNTCAGTAAAAGGT 

CTATTATCTAACTTGCCAAACTTGTTTANNGAGAGCCCTAAGGAACTAAAACTGCCATAA 

TGCC^GCACAGCTTGAAMGCMTTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAG 

TTCCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAA 

AATAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCAC 

CTNTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAG 

Sequence 341 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTCTAGGACAA 

TCA GGAAG TAATCTTAAAAAAATAATTGAAGATGTTAAAAAATACGTTAAAAATAAAAAA 

CTTGTTTTAMCATAGATGCAGTAGAMTTGAAAAACCAGATTTAGATGCAAAATTATTA 

GCTGAATCAATTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTT 

GCAATTCGTTTAGCACAAAAAGCCGGAGCTAMGGTATTAAAACTAAAGTTAGCGGTCGT 
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TTAAATGGTGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACAC 
ACACTTAGACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTT 
GGAGTTAMGTTTGAGTTTCATTAGGCGAAAGTATTTGCAAAGCAAAATCAAGCATATAA 
TGAAGAAGAACCAACNCACAAAAAAGGGCCAAAAAGAGCAGCAAGAGTTAAAAAAGAA 
Sequence 342 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACC 

TTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGA 

Sequence 343 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCATGAATGCCGTGCTCCAGGTGTTCAC 

AGCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGC 

CTGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCT 

GGAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTNTTGCCAGATGACAAGGN 

GACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCAC 

Sequence 344 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGA 

GCTAACGTCATA I I I 1 1 IAAGI 1 1 1 I CAATTCCGTTTAAAAATCCTAATTCAAGTAAAAA 

GATTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGT 

TCGGCCAGTAGCTAAAACATCATCMTAATTGCTACTTTTTGGCCTTTTTTCAACATAAT 

TMGTTTGGATTTCTAGAGimGATTTACCATATTTNTAAATCATTCCTCAAAACTAATA 

ACCGCCNTCCTGGGAAATTTTTTTAGGGTTTTNTTACCCCTAMTAAAAAAGGGNTTNTT 

TT 

Sequence 345 

CCGCGGTGGCGGCCGCCCGGGCACGGTACCACTTGAATTATCTATTGAAAGAACTACTAC 

ATCGAGTTTTTGTTCTTTTGCCATTTCAATAGCTTCTCTTGTTTCMTTATCCCMTTTT 

TTCATTATCTGAGCCGATMGAAAAACTTTGACAAATGGAATATTTCTGTTTACTCAATG 

TTCTTG AACTGGTTTAG AACGTG AAAAA I I I I I AGGATTTGGTTTGG AAATAA I I I I ICC 

TCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAGCAAAACAAAAATTATT 

ATTAAAMTAATMCAMTTAGTATTTTGTAGCCTTTACCTATGAACACTTATTCATCAG 

GTGAGAGTGMTCTTCTTTCTGCAATAATTTCTAATTGCCCCGCGTCCTTGGCCGCTCTA 

GAACTAGGTGGG 

Sequence 346 

CACTACTATAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGA 

GATATTGATCCTAGTCAATTAGGCATTGTAGACTGTCATGACCACTTAATAAAAAATTAT 

GGACCTGMGCTCACGAGCATCCAGATTTTATTATGATGTCAAAAGATGCTGCAATTAAA 

GAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTACAATGGACCCTCCTAAC 

GTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAATTAGAAGGAAAAGCTAAC 

ATTATTATGGCMCTGGTTTTCATAMGCTGCATTTTATGACAMGGTGCTTCnTGACTT 

GCTTTGGCTCCAACAGATAAAATTGTAAAAATGGTTGTAGCTGAAATCGAAGAAGGAATG 

Sequence 347 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTAGAAGAAGAAGG 

AATACCTAAAGAAACAGACATAGAAATCATCCCAGAAATCCCGGAAACTCTAGAGCCACT 

GTCCCTTCCAGATGTGCTGAGGATCTCGGCAGTTCTGGAGGACACCACAGGCCAGCTCTC 

TATTCTGAACTACATCATGCCCGTTCAGTACCT 

Sequence 348 

TNCTTAGGGCGAAlTGGAGC TCCCCG CGGTGGCGGCCGAGGTACTAAACTTTAATTAATG 
AGCTAACGTCAT A I I I I I I AAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAA 
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AGATTACTTTATGMCMCAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTG 

TTCCGCCAGTAGCTAAMCATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATAT 

TAGTTTGGAT TTCTAG AGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGT 

CTCCTGGTMTTTTTTAGGTTTTCTTACCATMTAAMGGTTTTTTTAAAAAAGCTGCAG 

TAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAG 

CTAACTCAGCCCATrrGATTMTTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGG 

Sequence 349 

tcttagggcgaattggagctccccgcggtggcggccgcccgggcaggtactaaaacaggt 

gctcctgttaaaatagaagatcttgcagctacttctagagatttaaattctaaacgatca 

atagcagcgtatcx;tgttccggctttmtmttccggctttggatttagatctttttaca 

act ggtcc gctgtagttatattcatccattccttcttcgatttcagctacaaccattttt 

acaattttatctgttggagccaaagcaagtcaagaagcacctttgtcataaaatgcagct 

ttatgaaaaccagttgccatmtmtgttagcttttccttctaatttcntgcaatatct 

mc attt gatamcatcacgcccaacgttagggagggtccattgtmcmcagtttttcc 

tccttttgctacatattcattcatttcnttmttgcagcatcttttgacatcataataaa 

ATCTGG 
Sequence 350 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTTCACTATCA 

CTGTCCTGTAAATTTAGTAGCCTTGGCTGGAAACACTGTAGTCGACATGATCTGATATTG 

CTTAATATTTCAGAAAGAGACAGTCTATTTTCACAATGTTTACTGGAAGCATTGGTCCGA 

GAGAMTTAGMGAMAGTCTATAGTTTGGGAAGAGCTTGAAAAACTATTCAGCATTTCA 

GGGTCTATCTGTTTCAGGACTGGGTCATGTTCTGTGGATATTCGGTCCATTATGACCCTT 

CCACCTCTGCCAATTCGCCTCCTTGCAAATCCTATACATCTTCTTGGGACTGTAAGTGTT 
GTAAGGC 

Sequence 351 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTGGTGACTGTA 
ATTAACAACMTGTATTTTGAMGTCACTGAGTAMTTTTMGTGGTTTTTCCAAAAAAG 
CACATAAGGTAATGCACCGTTAATTAGCTATATTGAGCCATTCCACAATGTATAGATATT 
TC AAAACATGTTAC ACAT GATMA TCCAGTTTTTCTACGTCATTTTTTAAAATTATATTT 
AAi i i I I i I ATTTTGAAAGTTTTTTCACAGATC I I I 1 1 I I I AGTATTATTACCTTCTGAT 

ATATGTGTCAWATTGMGMCCATACCTTTTAAAGGTATTATTTTGTAATTAAGGTATG 

TCAACAGTAAAAAATAACCAGTGGCCCAGGCCATNGGGGCTCATGCCTGTAAATNCCAGC 
ACTTTTTTCGGAGGCCCG 

Sequence 352 

CCGCGGTGGCGGCCGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCT 

TGATCCCCAGAACTGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCT 

TCAAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGAGAACAAAGATTTGGGGCACTTTC 

MAATTMTGAAAGGTTTACAAATGACCTTTTGAATTCATCTTCTGCTATATACTCCAAA 

TATGCAAATGGMTTGAMTTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTTTGAG 

GTCGGTTCAGGTCACCCAATTTCGAAATGGAAGNCATCGTTGCTGGGTATTGAAAGGTTT 
GT 

Sequence 353 

CCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATTCT 

TCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGACAA 

AMCCMMAMCAAAMCTTATTTTAGAAAGACTTGGATAATGATTGATTATAAAAAAA 

TGATTGACCATACTTTATTAAMCCAGMGCTACATCTAMGATATTTTAAAACTTATTT 

CACAAGCTMAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATTAG 

CAAAAGAAAMCTTGCAAATACAG ATTTAG ATATCGTTTCAGTAGTTGGCTTTCCATTAG 

GTGCATCAMCACACAAACCMGGTTTTTGAAGCAAAATTAGCAGTTGAACATGGAGCTA 

CTGAAATAGATATGGTTATAAATGTGGGTAAATTCAAA 
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Sequence 354 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGCATGAAACCACACA 

TACGGGCAAAAG CAGGAAGAG ACTTTGCAGACATAAAAACCAGGTAAGTTTTGAGAGTCA 

GGTG ATTGCA AG A I I I I I I I I I CTTTCATTGACACTTAGCTAATTTTGAGCCATATTTTT 

MATGTGTGTCTT TTCAG TTACTTGAAACCTCTTCTCCCACCTCCCATCCTTTCCAAACC 

TCCTCCAAGCCAATTTTCCAGTTGGTTTAAAAGAAATCATTAGGGTGACCCCCACTCCCA 

CTTTCTGGAACCTCAGGCAGAGCTCCTGGGGTTCCTCC 

Sequence 355 

NNI 1 1 1 1 I I II I I I I I I 1 1 I I I IGCGTGGAATGCTTCACCTCTTGCTAAAGGAACAGTAA 

TGTCATATCCTGTTAATCTTGCAGGAGGCGCTAATAAGAATTCAAATGCTTTCTCATTTC 

TCTTGT AATAATTT CAGTGAACTGAAAATGATTTAACAGCTTCGTGAACAACTAAAAGTC 

TTCCAGTTTTTTTAACAGAGTTAATAATTGTATCTGTATCTAAAGGAGAAATTGTTCTTA 

AATCMTTMTTCTACAGAGTATTCTCCATTTMTTGTTTTAAAGCAGCTAGTGCTTCGT 

GAACTTGTGCTCCATATGTTACTAATGTTAAATCAGMCCTTCTACTMTACATTTGCTT 

TACCMTTTCMCTTCATAMTTCCTGCTGGAGCTTCTTGTTTGAATGAACGATAAATTT 

TCTTAGGTTCTAAGAAAATAACTGGATCTGGGTCGTTAATAGCTGCGATTAATAATCCTT 

TTGTATCATAAGGAGTTGAAGGCATAACAACTT 

Sequence 356 

AGGGCGA ATTGGA GC TCCCC GCGGTGGGCGGCCGAGGTCTAAACTTTAATTAATGAGCTA 

ACGTCATA I I I 1 1 I AAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATT 

ACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATMTCGCTTTCATTGTTCCG 

CAGAGCTAAMCATCATCMTAATTGCTACTTTTTGGCCTTTTTTCAACATATTAGTTTG 

GA TTTCTAG AG TTGAT TTACCATATTCTAAATCATACTCAAAACTAATAACGTCTCCTGG 

TAAl I 1 1 I lAGGTTTTCTTACCATAATAAMGGTTTTTTTAAAAAAGCTGCAGTAGGTGT 

CCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAGCTAACTC 

AGCCATTTGATTAATTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGTGGTGAAATATC 

TTTAMT ACAATCC CTTCAATTGGGAAATCTTTAACATCTCTGATGTATTCTATTAAATT 

GATTTGAI I I I I I I GCATTTCCCCGCGTACCTGCCCGGGCGGC 

Sequence 357 

GAATGGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCT 

ATTATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATG 

CCGTGCACAGCTTGAAMGCMTTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTT 

CCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAG 

Sequence 358 

AGGTACANGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGG AACTAA AACTGCCATAATGCCGTGCACAGCTTGAAAAGCAATTAAGA 

GTAAGCAAGATTAGI I I I ICCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCAG 

GAGGGGAAGGGAAATATAACGGAACCCAACAATTAANAAATAGGCAAATAGCCAATTAAA 

GTAAGGAATGGNCATCCCATGGGAGGCANCAACCA7TAATTTCTTGGGAACCCACTNTNT 

CCCNGGGATTCAGGGCTTCCATTTGCTTCACNGATGGCTTCACGTCTGGNGCAGCCCGCC 

MCTCTTACTTTGCCAGGAAACCCTCACCTCAACTTTGCCCAGGGTATTTCTNCCCCGGG 

TCTTGGAAANGAAAATGGGCTTCNTCCACCTGAAAAAGGGTTNGAATCCTTTCTTCCCAT 

TACCCAGGCTTTCCNTTTAAAGCCAMMGGCAAATTCCCTCCTTTTTTGGCTTTTCCT 
Sequence 359 

CTAATTGATCCTGNTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTNACAGC 

TTATTGATTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTT 

CTCTATGCACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGT 

TTCTTTTGCACATGCTACATTGGGCTTTAGACCGGAGATAATACAGTGAC7TTACCTCAC 

AAATCATA TTCTG TCAACACAAATCTATGMTTTAGTTTATTTAAAATCAGAACAATTTC 

CTACAAAATTTTTCTGGAAAATAGACTCCTAACAGACCTACCAGAATCATGCTTAAAGTG 

CTCCCTTGACACTTATTCTATACTGAAGGATAAATTTTAAA 
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Sequence 360 

CCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGTCGTGCACAGCTT 

GAAMGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACC 

Sequence 361 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATT 

CTTCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGAC 

MMACCAAAAAAACAAAAACTTATTTTAGAAAGAC7TGGATAATGATTGA7TATAAAAA 

MTGATTGACCATACmATTAAAACCAGAAGCTACATCTAAAGATATTTTAAAACTTAT 

TTCACMGCTAMGMCATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATT 

AGCAAMGAAAAACTTGCAMTACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATT 

AGGTGCATCAAACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACNTGGAGC 

TACTGAAATAGATNATGGGTATAAATGGTGGGTAAAT7TCAAA 

Sequence 362 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCT 

TGGTAGGCCATTACCATACCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCT 

GTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTG 

T7TCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCT 

Sequence 363 

CACTACTATAGGGNTNATTGGAGCTCNCCGCGGTGGCGGCCGTGCTGTTGCTTGGCCGCG 

CGCCAGGC TGGC CAGAGGTGCTGTTCCACTGGGGCGTGGCCGTGATGGTGTGCGCCGATC 

GACGTTCTGTTTTAATGAAAGGATGGACATGCAACTGACACTCGCACAACTCTTGGCAAT 

CATGCCCAATGCCCGCTCCAAAGCGGGTATTTTTTTGCCTGCGCTAAACGTGGCCATGGC 

GGAATTCGGTATCAACACGTCGGCGCGCCAGGCCGCGTGGCTGGCCACCATCGGTGTCGA 

G TCCGG TAGCCTGCAGCGGGTAGAGGAAAACTTGAACTACCGCGCGGATCGCCTNCTCGT 

TATTTTCGGAAAATACTTCACGCCGGCGTTGGCCGCAGCTTATTGCCGGCAAGCCGGAAA 

TGATCGNCAACCGTGTTTACGCCAACCCGCATGGGGGAAACG 

Sequence 364 

AGGTACTAAACTTTAATTMTGAGCTMCGTCATAT7TTTTAAGTTTTTCAATTCCGTTT 

AAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCGATT 

AAGTTA ATAATC GTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTT 

TTGGCC I CMCATATTAGTTTGGATTTCTAGAGTTGATTTACCATATTCTAAATC 

ATACTCA AAACTAATAACGTCTCCTGGTMTTTTTTAGGTTTTCTTACCATAATAAAAGG 
1 1 1 1 N lAAAAAAGCTGCAGTAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTAT 
AATMCATCTGCATCTTTAGCTMCTCAGCCATTTGGATTAATTGTGTAAATTTAAGCAC 
TTCTCCATTTGCTAAAAGTGGTGAAATATCTTTAAATACAATCCCCTTCAATTGGGGAAA 
TCTTTAAACATCTCNGGATGGTATTCTATTAAMTTGMTTGMTTTTTTTGGC 
Sequence 365 

CCGCGGTGGCGGCCCGAGGTACCAAAATAAAGGGTATTTGCTACCT7TAATACTTGCCAG 

TTCAGGTTGGAGGCACAG GCAG CAGCAAGAATGGAAAGAAATGTTCTTACAACATTTTCA 

CAG GAAAT GTCCCAGTTAATTTTGAATGAAATGCAAAAGCTGAATATTGCAGTTTATTCA 

ATGATTTTGTTGMTCTGAATTTTTTTTGATTGATGGGGATTCATTACTTATCACATGTA 

TCTGTGAGATATCATTTAAGCCTGGGCAGMCCTCCATTTCTTCTATCTGGTTGAACGCT 

ATCTTGTGGATCTTATTAGCAAAGGAGGACAATTCACCATAGTTTTCTTCAAGGATGCCG 

AGTATGCNTATTCMCTTCCCTGGACTTCTTTCTTTGAGAACTGCTTTAATTCTTCATCT 

NCAGAAAGAATACCCCATTTGATGTTCGAACAACATTTTCGAGATGCTTATCAAAAAGAG 

TGGGGAAAGTTTCTTTGGAAGANGAGTTACCCCATAT7TNCTGATTGTTGGCAGACGAAA 

NG CCTTG AACG ATCTACAAAACCNC AG CTTTTTAAACTTTTTAAATCNTTC AATTCCTTG 

GGGCAAAGGGAANGNTNAACNTTTGGTACCTTGCCCGGG 
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Sequence 366 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

MTAGMT ACAT CAGAGATGTTAMGATTTCCCAATTGAAGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTCACAATAATCAAATGGCtGAGTTAG 

CTAAAGA TGCAGA TGTTATTAT AGGT CCAGACCCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAA 

Sequence 367 

AGGTACAGCCTCTCGGCCCGG CTAA ACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TMCTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 368 

ACCGCGGTGGCGGCCGA GGTAC GGCCACACTGGGACTGAGATACGGCCCAGACTCCTACG 

GGAGGCAGCAGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGNGTG 

CAGGATGAAGTTNTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAANAAAAAAAAAAA 

AAAAAMAGGTNCCTGCCCG 

Sequence 369 

GGCGGCCGAGGNACAATATAGNCATCGCNTTAAACNGCCNANTNTTAANCNCGCCAAACT 

TGGTTACTGAGAGCCCTAAGGAACTAAAACCGCCATAATGCCGGGCACAGCTTGAAAAGC 

AATTAGAGGAAGCAAGANNAGNNNTTCCTCCCTTCCAGNNCCTCAGCAGGCCTGGCTGAA 

GGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGA 

ATGCCATCCCANGGAGCACACCAAAATTCNGGAACCACCNCTCCCGGANCAGGCNTCCAT 

TGNTCACGAAGCTCACGCNGGGCAGNCCGCAACTTTACTTTGNAGNAACCTCCCCACTTG 

GC CAAGG GAATTCNCCCCCGGGCCTGGAAGAAAAGGGNTCTCCCACCCGGAAAGGGGCGN 

ACCTTTTCCCAAMCCAGCCTTTCCTTAAAGCNAAAAGCAMCCCNCTCTTTTGGGTTTC 

NCAAAGGGGGCNGNACAAAAGGGAAGGGTTTTGGGGCNGGGGGGGGGAAACAAAANCCCC 

NCATTNGGAAGNTTGCCCCCGGCCGAGGGGAAGGGGAAAAGGTTGGNCCCCGGTTGGGGG 
GGG 

Sequence 370 

ATTGTTGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACANGGAACTGCCAAAGGCAACA 

GAAATTCTTTCTCCCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTC 

AGAAACCAGATTCAGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGG 

CCAGGGAAGGAGANNAAGGTAGNCAAGCGGGGGTGGTCTCAAATCTGGTTGNGCTCGAGC 

TATGCAAATGCCTCTCATGGAGATGCGAGGACCTATCTATTATGATGACCAGGGCCACAT 

CCGGAGGGGGCAACAGACTTTCATNTATCAAGCCCTT 

Sequence 371 

AGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGC 

TTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCA 

GAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATT 

TTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 
Sequence 372 

CCGGGCAGGTACCGGGTCTAGGGAAGATGCAGAACACTTCAGCCTGGCAGAAGGCTCTAA 

AGATCGTCAACCTTTCCTCTGTTCATTTTGAAATTTTTAAAAATATGCTAACTTAGCTGA 

AAATCTCATGGAAACCAGGCTCCTCATCAGACTTGAAAGTCAAACCGGTTTCTCAACAAC 

TTCTCTTTATGGTTCTGTATGGCTCCACAGAAAACCAGAAAAACATTTGGGGCAAGAAGC 

TATGACTCTGTGAGGCCACATGGGAGCAGGCAGTCAATTATTACACTAAGGAACACCCAG 

TTAGCATGAAGTATCCCATCACCTCGGTATTAAGCCCTGCATGCATTAGCTATTACCT 

Sequence 373 

AC I I I 1 1 1 I I I I I I I 1 1 GCTCAATAGAAGTATGGAATAATTCCAGGTAATTTAAAGCATA 
1 1 1 l l CAATTGGTGTAAGCTGCTCCATGAAGTCAGCTAGCTCCTCTAATTGGGATGGTTC 
TTCATCACACGGCATGTTCTCAGAGTCTGATGACAGAGCATCAGTGTGTGGTCCCAGCAC 
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CCCCTCCTGTGCGGACTTCTGGGCATCCTCCTCCAGATACTCAATACTCTTGAGGGCCTG 

AGGMAGTCTCTATGAAAGGTCTTGCAATTTTGGGTGCAATGGTTTCCGTGACAGAAGGT 

TCCTGAGAAAGCACCACAAACTCCTCAGCTTTGACCGGGAAGCCAGCATCACGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 374 

CCGGGCAGGTACCGCTACTGAAATTATTAAACATACACTACAGATCAATTATATAANTAT 

GTTAATATCTTTAGAAATCAAGAGTTGCAGCATAAGAGAAAGGGATACAAAAACAAAACA 

AGC AAAGA AGTTACATAAAAACGTAACGTTGTATTGGAAAAACCAGTATGAACTTATGAT 

TTAGTTTTCTTTCTAAAAACGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 375 

NGCGNTGCGCTCACCTGCCCNCTTTCCANTCGAGGNAAACCTGGTCGTGCNAGGGTGCNA 

NTAATGA ATrC GNCCAAACNCCNCCGGNNGAGNAGGCGGTTTTGCGTNATTGGGGCCGCT 

ATTCCNCTTTTCTCGCTCACCTGACTTCGCTGCCGCTCGGTCGNTCGGCTTGCCGGCCGA 

AGCCGGGTAATTCAAGCCTCCACTCAAAAAGGGCCGGGTAAATTACCGGGTTTNTTCCAC 

CAANGAAATTTCAAGGGGGGGATTAAACCNCNAAGGGAAAAANGAAACATTGTNNGANNC 

AAANAAAGGGCCCNNNCAAAAANNGGGCCCANGNNGAACNCNCCGTAAAAAAAAAG 

Sequence 376 

CCGGGCAGGTACACTCCAGCTCCTCTATCCTTTGTTCCAGGTGAGCCTTNTCATTAAATG 

CTGTCTCTTGAGCAATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTGGGTCGAAGCTTGTACACCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAAAATNGCNCNC 

TGGGGCNAAATCNNGGGAAAATANGCTGTTTTCNTGGGAAAAAAATTGTTTTNCCCCGCA 

ANAAAAATCCCCCACAAAAATTNTANNGNNNGGGNGAAAAAAAAAAGNAAAAAANNGNGG 

GGGGGGGCCCCCAANNGGGGGGNNGCCCNCNCCCCTNATANTTNATGGNGGGGGGGGGCC 

CTCNANNNGGCCCCTTTTTNIVINNAANNGGGAAAAACCTTNGGCGGGNCCCNTTTNTATNA 

AN AAAAA ACC CNCCC CCCCCCCCNNGGGGGGGGNGGGGNGGTTTNGNTTATTGGGGGGCC 

CNTTTTTCCNNTTTTCCNAANAAAAANAAAANANNCGGNGNCCNGGAGGGGGAAANGNGG 

GGGGGGGG 

Sequence 377 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGTGATG 

CTGGCTTCCCGGTCAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGAACCTTCTGTCACGG 

MACCATTGCACCCAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAG 

TATTGAGTATCTGGAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACA 

CACTGATGCTCTGTCATCAGACTCTGAGAApATGCCGTGTGATGAAGAACCATCCCAATT 

AGAGGAGCTAGCTGACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTA 

CCTGGAAATATTNCATACTTTNTATTTGNGGCNAAAAAAAA 

Sequence 378 

GCCGGCCCGAGGACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAA 

AAAAAAAAAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGGGGCCCCGGGTACCCAAGCTTTTG 

TTCCCTTTAGTGGAGGGGGTTAATTTGCCGCCGCNTTGGGCGTAAATCAATGGGTCATTA 

GCTGTTTCCTGNGNTGGAAAAATTGTTATCCCGCTCACCAAATTCCACACCAAACCATTA 

CCGGAAGCCCCGGGNGNAGCCATTAAAAAAGGTGTNANANAGCCTTGGGGGGGGTGCCCT 

ANATN GGAA GGTGGAGCCTAAACCTCACCATTNAATTTGGCCGTTGGCGCCTCACCTTGC 

CCCGCTTTTCCAAGGTCCGGGGGGAAAAACCTGGTCCGNGGCCAAGCTTGCAATTAAATT 

GGAAATTCGGGCCCA 

Sequence 379 



• 
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GAATTGNAGCTCCCCGCGGTGGCGGCCNNCCGGNCAGGTGGAAAGGTGGGTGGGGAGAGG 

GAGGCTTATTTGTTGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAA 

GGTATATTTGGTTAGATCTAGGTGA GTTC TACTTTAGAGGAAATCCTGGTAACTGTTGTT 

TGTTTGTMGTTATAGCTGTAATTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCAT 

TCAGATACTATGCATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTA 

AGCTATTATTCAAGGAGTATCTACCATCATAAAGGTGGTTTAAGTCATATAGGATAATAT 

CMTCAATMTACAGGGAGATGGCMAMTTTTTTGGGNAAANCCCAATATTANCTTGGG 

TTTNATGACCCCCNAATCTCACACTTTGGGGNCNTATGGGAAAGGCTTTTTTTAAGACCC 

GGGAGTTCAAGACCNGCCCTGGGCAACATTAAAAAACCTCCTTTNNNCCAAAANCTTTAA 
AAAA 

Sequence 380 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGMTATCCMTTCATGTGMCTACAGGAMTTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTAATAAAAGTCAAATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 381 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GCATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAA 

TTGATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGT 

ATTA AATCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGA 

GGTGTTTTTTTGTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGA 

GTTAGGTAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTG 

GATTTCAATTCGGCACTTTATGTATTAGCTAAAMTTTCATGACCAGATCTTTTGAAGTA 

TACAAAGTA AATCT TCAAGGTGGATAGTTTATCCAAGTGTAAAATGTGTTGCACTAGGTC 

AGCTTGGMTTTTGAGATGACTTTTGGCATCATTGCATACATCTGGTTTGTGTACCTGCC 

CGGGCCGGCCGCTCTAGAACNGTGGATCCCC 

Sequence 382 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCA 

GTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACAAA 

TNGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTC 

CCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGG 

TATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 383 

TTA GGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCTGCATATACACCAT 

TMTTTTACATCGTTGAGGTAGCCAAAACTGCTCGTAAGTGGGCTTTTATTAAATAATAT 

AATGTTCTTAATAGAGGAAAAAGGAATTGAATACATTTTTAAAAACAAAATAACAAAACC 

AATCCATTGTC CCACAAA AGAAAATCAGTGGAGACAAAAGCAGTTTAATTTGCTGGATTC 

TTTTGTGGCTTATTTTTTGAGTATTATTTACAAMTGTTAGACTMTTTTTAAGCAATAT 

TAATAATAAGCAACATACAACTCCAAGAATAATATAATAAATAATAAACTGCGGACGCGT 

GGTTCGAAGCTTGCTCGNNGGGGGCCGGGNCGCTTCNAGGCCCNCCCGGGCAGGTACCCA 

GTNATCACATAMTTCTGCMTCATNTGGNTATTNAGCTTNACNTGNTTTTTTTATTTGN 

NGAANTTGTTGTTGTATTCAG 

Sequence 384 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACC 

CACG CGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATAT 

TTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCT 

TTACCATATCATTMTAMAGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGT 

GGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 385 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCA C I I I 1 1 I 1 1 1 1 1 1 I 1 1 I 1 1 
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TTAAGAAATAGGGTCTCACTCTGTCCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTT 

CTGACTCCTAGGCTCAAGGGATCCTGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGG 

CAGGAGATCGCTTGAACCGGGAGGCGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTT 

TCCAGCCTGGGTGCCAGAGCAAAACTCTGTCTCAAAAAAAAAAAAAATAATAATAATAAA 

TAAATAAAAAGGCAAGGAATATAGGGAAAAGTCAAAAGAGATGGACTGTGAGAAGACTGG 

GAAAGCCAGAAGAATGGNGGAAAATGTAGCATGGAGTAAGACAATAAAAATATAAGAGGA 

CTCATTTCGGACGCGTGGGTCGACTCACCTCGGCCGCTCTAGAACTAGTG 

Sequence 386 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGCTTCGACC 

CACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACATTTTAAATGACCTAATACATAT 

TTAGTCCACATTGAMCTTTGGCATTTTGTCATTGCCATTAAMTTTTGATGGCATTAAA 

ATTTGATGCCATTAAAATTTTGATCAGTAGGTAGCATTTTTTTCTTAGCTACMTTGTTT 

TTTTTMTTATMGTA TTAAA MTTCATGMGATGATTCTTTTTGTAAMCAGTTTTGCA 

TAAAAAGTAAGTCTCATTTTAAAGCAACTACCAACTTACTGGCCACCT 

Sequence 387 

AGGT CTTCGAC CCACGCGTCCGATGGTTTTTGCAAAAATTGAAAATGCATCGATATTACA 

GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGMCTATMCACAGGMGTTTTTA 

GAGTATGTCC ACTCTGGTTTACTC CTTTGTAAGTATTAATACCTGATAATTTACATCCTA 

CAGCCCTGCCI I I I 1 1 I I I I 1 1 I I I CAAGTTTGTCCCAGCAAGTCTTGGCCCTTTGCATT 

TTCTTAATACATTTTAGTACCTGCCCG 

Sequence 388 

CCGCGGTGGCGGCCGAGGTACAAAGAACAAAGGGAAGCTAAGGAAGAAAAGATAGTCAAT 

AAAAGATGTCTCATCTGGGCTTAGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCT 

GAGGCTCGAGGACTGCTTGAGTCCAGGAATTTGGGCAAGTAGGAAATTACTGAACAGCTG 

CTATCACAGACAAATGCCTAACATTGTGAAGTGCTACACAGGGGAAGGAGACCCACGCTA 

AGAGGAGAGCATGCACCCAGACACAGAACTCAGAGGACACAGTTCAAAACACACATACAA 

GAGGCTTAGGCACCTGTGGGCGTGTGTGTGCTCACAGCCAGCAAAATGAAAAAAAATCCC 

AGCTCTGAAGGAGAGGCAAGTGCATGGCTTCCGTACCTGCCCG 

Sequence 389 

ACT TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGA 
TGGTTTTTGCAAAAATTGAAAATGCATCGATATTACAGTTMTTTTTTCAGTGNGTATGT 
GGTATTAGGCTTAGMCTATAACACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACT 
CCTTTGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCC I I I I I I I 1 1 I I I I 



TCAAGTTTGTCCCAGCANGTCTTGGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGC 
CCGGGCGGCCGGCCGCCCGGGCAGGTACNACTACCTCTTTAAAGTTGTCCTTATTGGAGA 
TTCTGGTGTTGGAAANAGNAATCTCCTGTCTCGATTTACTANGAAATGAGTTTAATCTGG 
AAAGCAAGAGCACCATTGGAGTAGAGTTTGCAACANNGANGCATCCAGGTTGATGGAAAA 

Sequence 390 

TCCCCGNGGT GGCG GCCGAGGTACTATTGACTAAAGTCAAGTTGGGGGAGAGAGAGGCGG 

MGTATATTACTTTTNTGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAAC 

ATTNNATCATTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTNTGAGGAGG 

TTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGG 

TAGATATTTGCACTGGT 

Sequence 391 

AGGG CGAATTGGAGCTNNCCGCGGTGGCGGCCGAGGTCTTCNACCCACGCGTCCGATGGT 
TTTTGCAAAAATTGAAMTGCATCGATATTACAGTTMTTTTTTCAGTGTGTATGTGGTA 
TTAGGCTTAGMCTATMCACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTT 
TGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGC CI I I I I I I I I 1 1 1 I I I CA 
AGTTTGTCCCANCMGTCTTGGCCCTrrGCATTTTCTTAATACATTTTAGTACCTGCCCG 
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Sequence 392 

CCGCGGTGGCGGCCGCAC 1 1 1 1 1 1 1 I I M I 1 1 I I I I ACCTGAAAATGCTTATTCTAGCTT 

CACATTTGATTGTTTGGCTMGMGAAAATTATTTATTAGACTTAATTTTCCTCACGAGT 

TTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTAGACGCTTTCAGGACCTTCTTTT 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 393 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAMCTATTTTAAGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAG 

Sequence 394 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCTACCCACGCGTCC 

GCACATTTTGATGGTCAGTCAATAACTTAAGCAGTTACCAAAATACTAGGTATCCAAGGA 

GCGAGAGGTGGGCGAGCATAAGAAACACATTTCTCATGGCACAGCTCTGCCAAAGCCCTG 

CAGAATCATTTACACATAGGTCTTTGGTTAGTAGCCCCTGGCACAGAATTCTGATCTTAA 

ACAAATATTGTCTATAATCAAGTAGAGCAATGCAATTAAAAAAAAAAAGCACAGGTTTTG 

GGGCCATGCTGAAATCCCAGCCTTGCTATTTGCTGGCTGTGTGACCGTGGTTCCTTGGTC 

TCATTATGCTTTGGTTCCCGTATCTATAAAACGGACGTAATAATGTCTCCCTCTCATTAT 

TGTGAAGTCGAAATGATGTCTGTAAAGTGCCCAACACAGTACTAAAGGGCTATT 

Sequence 395 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTTTAAGTAACAT 
T CAGAT TTGTGTGT GTGG AGAGGTTGTAGGGAACAGAATTGTAGGAAGGTGCTCACACCT 

G GTTTGTTTGTTTTATGTATATATGGTGGGTAGAAAATAAGGATTAATGAATGCA 

GTMGGTATTTGAGCACTCTTGTTTATCTTGTGTAGGTGCCAACCAAATATTTTTATAGA 

GATGTGGTTAAGCCTCTTGGCATGTTCAACTGTGTACCT 

Sequence 396 

CCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATA 
CAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAA 
NGANGGAGAATGAAAACTTCCGGACGCNTGGGTCGAAGCTTGACCT 
Sequence 397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCGCGCGTCCGCATTT 

ATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATC 

TTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAA 

TCCTC CAGAGAA GCCT GTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTGTT 

TTTTTGTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGG 

TAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTC 

MTTCGGCACTTTATGTATTAGCTAAAMTTTCATGACCAGATCTTTTGAAGTATACAAA 

GTAAATCTTCAAGGTGATAGTTTATCCAAGTGTAAATGTGTTGCACTAGGTCAGCTTGGA 
ATTTTGAGAT 

Sequence 398 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCATGTTGGCTGGGCTGGT 

CTCGAAC TCCTGA CCTCAAGTGATCTGTCCTGGCCTCCCAAAGTGTTGGGATTACAGGCT 

ACTAAAGCi I I I I A TTTAT TTCTGTGGATTTGAGTTATTGTGTAGTGTCATTTATTTTTC 

A TATG MGGATTCCTTTTGGTATTTTTTGAAGAACTTCTTTATCAGAATTAATTATCTCA 

CTTTTGAAAATTTTTMCAATCTAGAAACGTCTTTTTTGTTTTTTGMGMTAGTTTTCT 

GGATGTAAAATTTTTGGTTMTTTTGTTTTTCTTTGAACACTGAATATTTAATCTTATGC 

CTTCTGGCTTTCAGTACCTGCCCG 

Sequence 399 

CCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAGATCCGTTTCAGA 
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AACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACCTGATCTTATTCT 

CATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAATTTACTGGAGTTG 

GCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCACCTGAGAATTTGA 

TAGACATAGACCCAGAGTTACTGAGGCAGGTGCTCTGTTTTGGGGACCAGCAATCGGTGC 

TTTAGCAAGTTCTTTGGGTGATAGGGTTTGGAAACTACTGCTCTAAAGCATCATCTGTTT 

TGACTTTGCCATGCACAATCTGAACTCACTCCCGTGAGGCCCTGCTCCTGATACTTTAAA 

TCGTCCTGTCTC I I I I I CTGCCTCTCTGTGGAG 

Sequence 400 

CCGGGCAGGT ACAGGC ACCTATAGAATTTMAGGGGAGATTTCTTTATTTTGTATTCAAT 

GTATTAATMGATTTTTMAACATATTTTGGAGAAATTGCTAATTAGTGTATAATCCTGA 

TGCCAATTCTAAAAAACCI I I I I I I 1 1 I I I GTAGAGACAGGGTCTTATTCTGTCACCCGG 

GCTGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAA 

TCCTCCCACCTC AGCCT CCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCT 

AACCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCTGGTCTTGAAC 

TCCTAACCTCAAGCAAACATCCCTCCTCGTGCTCCCAAAATGCTNGGATTACCAGCATTA 

AGCCTTACMGCATMGCTACCATGGACTGGCTTTCNMAAMTATTTGGTTTAAAATTC 

Sequence 401 

ccgcggtggcggccgcccgggcaggtggaaaggtgggtggggagagggaggcttatttgt 

tgctgcagtgtaactaagtgaaacctaatnnatatgactcaaactaaggtatatttggtt 

agatctaggtga gttct actttagaggaaatcctggnmctgttgtttgtttgtaagtta 

tagctgtaattaattttccctgtattcaaagcccccaaaccctgcattcagatactatgc 

atttagacttccttaggcaaagtcaaggcaacaagctgatgattctaagctattattcaa 

ggagtatctacca tcataa aggtggtttaGtcatatagataatatcaatcaataatacag 

gagatggcaaaaattttttgtgaagagccagatagtanctgagtatgatgacccctaatc 

tcagcactttgggaggctgatgggagagggtcatttaagaccaggagttcaaagaccagc 

ctgggcaacattaaaaaactccatttctaccaaaaactttaaaaaaaattagc 

Sequence 402 

GCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGGGAGGACCTAGGCAACGGCC 

TGAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAACCTACTGAAGATACGTGGTTT 

TACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGCCATCTGAGACCAGCCATCTGG 

CAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATCCCACTGACACCACGTAGAGCA 

CAAATGAGT TGCCT CCACTGAGCCCAGCCCAAATTGTTATCCTATAAAATCATAAAAACA 

TAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCG 

Sequence 403 

TACTATAGGGCGAATNGNAGCTNCCCGCGGTGGCGGNCGAGGTATTCAACAAGGGCCCTG 

AGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGT 

GTGAGGGCAGCAGGGMGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCA 

GGGAGGTGGAACAGGCAGGAGAGACTCTTCAGGAA7TGAGGAGATAGAATAGAGGACACT 

AAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTT 

TAAAAGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTG 
CCCG 

Sequence 404 

CCGCGGTGGCGGCCGAGGTCAAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGT 

CAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGNACCTTCTGTCACGGAAACCATTGCACC 

CAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTG 

GAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTG 

TCATCAGACTCTGAGAACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCT 

GACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTACCTGGAATTATTC 

CATACTTCTATTGAGCAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 
Sequence 405 
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TTAGG GCGM TTGGAGCTC CCCGCGGT GGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 
GGAAGTTTT CATTCT CCCTC I I I I 1 1 I I I 1 1 1 I I I 1 1 I 1 1 AGCAATTCAAGNCATGTTTT 
TGTCTACAAG I 1 1 I ICCAGTATTGTATAGATAAATAATAATTTACTAGGCTGCCTTGAGT 
ATACTAGACAAGAGACCTGCCCG 
Sequence 406 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I 1 1 1 I 1 1 1 1 1 1 H 1 1 AC 

CTGAAMTGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATT 

AGACTTMTTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAA 

AGCTGAGAAGTCCMTG CCATTCT GATTCTTGCAACTTACAAGTAGTCTTTTTTTGTCTA 

GACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATC 
TTTTGGGTACCT 

Sequence 407 

CCGCGGTGG CGGCC GC AC I I I I I 1 1 1 1 I 1 1 I I I I I 1 1 CTCTTATATTG AAGTAAAATTTA 
AAATTTAATACI I I I I ATTTTTTAAAAGCATGTATGGCATCATTTCAGTCTTATTAAATT 
CTCTCTGCATCCATTCACCCATCCTTCTTTTTGTGTGTGTGTGTAGTGGTCTCTGTGAGA 
' GGGTTCATTAATGTCAATCCTGATCATTTCTTCCTCAAGAGATGTCAGTAGATTTGTTTT 
TTTTG CTTTG G ACTTTTATG AATTG ATTG AATTTTTATGCC AATTATTTTTAAAGTATTA 
CATAGAAGAACA AATGGACA GAAAAATTTAAATGCAATCAAATCTTGTTGATTTTGAAGT 
ATAGGAAATAATC I 1 1 I I I I I ATTATACTTTAAGTTTTAGGGTACCTGCCCGGGCGGCCG 
CTCTAGAACTAG 
Sequence 408 

CCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTC TCAGA GTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 409 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATATnT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTAATAAAAGTCAAATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 410 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCAGTGCAAATATCTACCCAGT 

TAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTCCT 

CAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTTCT 

TTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGCCT 

CTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 41 1 

ACTT AGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACi 1 1 1 i iCATTGATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGNNNTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAA7TCCATGGAATTCCTTTTGAAGTTTATGAAGTACCTGCCCG 

Sequence 412 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

A7TTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGG 

Sequence 413 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTGTGATGCTT 

TTGGCAGGAATTACAGAACAACCAATGCCATTCAAGTTGTGGAGATTATACTNGCAGGTG 

AACTCGTAAAGAGAAGATTCTGGAATGCCTATATCTGAAAGCTTGAGTCGACACCTN 
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Sequence 414 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGAATGAGTGAGTGCAGAACTGGC 

AACCAGAAGACAGGAACAAGGCCTGGGAATGGAGCGGAAAGGTAGCTGCTATATATAGTT 

CCTTCAGCCAGTAACGATTAGAGCCAATAGCCATCTGGATGATGAATGGCTCCTAATTGC 

CTTAAATTACGGCAGTTAGCTAAGGGTTTCTGTTGCTACATGGGTTACCGTAGGCCGCTG 

CACCCTGC ATAACTGTCCTCAGGCCTGCGTCCCCTGAGTCTCAGC ACTTGGGCCTCCACC 
TGCCCG 

Sequence 41 5 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAMGTGCTGGAA 

TCACAGGAATGAGCCACCACACCCAGCCAMTTGGGCACAMTTTAAAATTTGACTTTTA 

TTAATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAG 

GCAMTCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTC 

TGAAGCGGCCCCNTGGGTCGACTTTGTAACCTGCCCGGGCGGCCGNTCTAGAACTAGTGG 
GATCCC 

Sequence 41 6 

TATGGCGAATTGGAGCTCCCCGCGGTGGCGGGNCGAGGTNAAGCTTCGACCCACGCGTCC 
GATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCNTATT 
CTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 417 

CCGCGGTGGCGGCCGAGGTACTCTTGATGTCATAAGATTAGAAAATGTGGTTAATTGTCA 

TCMCCCATTAAGTTCT TAAATG TCATTGAATGGAGTCCTTGTCATGTTACAGAGGAGCG 

TAAMTTGTGGTTAAACATTTTT TTAAA GATTACATGGTAGAGCCACAGTTTGTTATGCA 

GMGGMAATTTAGCAMTATTATTTTGCTTMTAGCCTTTAAAAAATCGTATAAATTTG 

ATTTGTAGTTTTATCCCCAGAGTCATTAGATTTTTCCAAAAAAAAAAAAAAAAAAAGGT 
Sequence 41 8 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCT 

TAACTTAGATTGTTACTTATGTTTCTAAATCTGNGGMGCACATTTCCTTTTNTTNTNNT 

TTTCTTTTACTGTTAATATCCTTATTCTCTATTTTACCAGTGGAGAATGNTTAGTATTAA 

TTTCCATTTANCTCANGATTCAAGAAATGCAAAGTGCTATTTTTATCAMTTTCTGAAAG 

CCTACTGTCTTCTGNTTTGGMGTCCCACMCAGCTCTTTAATTTCCTTAAGCCCCACTT 

TCCTCATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGA 

GCGGACCGCGTGGGTCGAAGCTTGTACCTN 

Sequence 41 9 

CCGCGGTGGCGGCCGCACTTTTTTTGTATTACTTCMCTTTTAAAAATTCTAAAGAAAAC 

CATCATCTCAGACCAGCATTTCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 420 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCT 
CCCTCGTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGC 

AGGTGATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTG 

AACTGGAAAGCACCT 

Sequence 421 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAAMGATCTGGTCATGAMTTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

MCCCATTCTACCTMCTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTNATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGGATTTAATACTTTGG 

Sequence 422 

GGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTCGTCCGGATCCTTCTAACGCAGG 
AGTCTCAGACGCAAATGCCGGCAAGGGCCAGGCAGGTGATGTAAGATGCGTGGAGCAGAT 
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GCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTGGAAAGCACCT 
Sequence 423 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATAC 

AGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGT 

AGAATGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACT 

TTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGT 

AGGATACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAA 

GCTATCAAAATAGAAAACAATAATGGAGAAAATMGACACCAAAAGCTATAAGACTCACA 

GAAAATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 424 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGAC 

CCACGCGTCCGATTTGGCAGAAATTCAGGGTAATGTCAAGGTTCTTAAATCTGAGAGAGA 

CAAGATCTTCCTTCTTTATGAACAGGCACAGGAAGAAATTACCCGACTTCGACGAGAAAT 

GATGAAAAGCTGTAAGAGTCCTAAATCAACAACGGCACATGCTATTCTCCGGCGAGTGGA 

GACTGAAAGAGATGTANCCTTTACTGATTTACGAAGAATGACCACAGAACGAGATAGTCT 

AAGGGAGAGGCTAAAGATTGCTCAAGAGACAGCATTTAATGAGAAGGCTCACCTGGAACA 

AAGGATAGAGGAGCTGGANGNCCNTCCCGGGGGCCGGGGNCNGCCCGCCCCNNGCAGGGT 

CANATGATTGCAGAATTTATGTGATTCCTGGGGT 

Sequence 425 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACTGAGCCACTTA 

CAATTATT TCTG AAAAATCTCAGAGAACTGAGGATAGATCAGAAAATTTAAAGAAAGCAA 

ATACCAAATTTTCAAACCAGGATAGGAAGTGAATACCTTGTAAATACACTTTGTTAAGTG 

ATGATMTTCTGAGTAAAAATTTAGAAGATTTTGAGAAAAGCATTTGAACTTCTAGGGGC 

CAATAAAATACCATGCAGAAGAATGTTTAAAAAGTCATGCCAAATTTGAATCCATTTGAT 

CCTCAACCTCATCAGATGTTATATGCCAAACTACTTATTTTGGCTTAGATAATAATCATA 

TAGAATGAAACTTTCCACAAATAGACTGTGGTCAGTGGCTG 

Sequence 426 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCA 

AMACMGTTTGGTCCCCCTGTTATAGNGCCTGGTAMGGTTTTTGTTGTTGTTTTGCAG 

GGGTGGGGGAACCAGGAAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGT 

MCTGCTACAGCAAAGGGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGT 
CGGA 

Sequence 427 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTC 

GTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGNAGGTG 

ATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTG 
GAAAGCACCT 

Sequence 428 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCAC 

GCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGTTGGGAGCACAAAGA 

TGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCA 

GGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGCCCTGA 

AATGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCT 

TTACAGCAGCCTCNAAGACTGATCAGGTTACTATACCCTCTCTTNTGTCCACNGTGCATT 
TNAA 

Sequence 429 

CCGGGCAGGTCAAGCTTCGACCC ACCGT CCGGCAATGATGAGCAAAAACAAGTTTGGTCC 

CCCTGTTATAGAGNCTGGTAMGGTTTTTGTTGTTGTTTTGCAGGGGTGGGGGAACCAGG 

AAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGTAACTGCTACAGCAAAG 

GGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGTCGGA 
Sequence 430 
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CCGGGCAGGTACACTCCAGCTCCTCTATCCCTTGTTCCAGGTGAGCCTTCTCATTAAATG 

CTGTGTCTTGAGCNATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTAGGTCGAAGCTTGTACACCTCGGCCGCTCT 

AGAACTAGTGGGATCCCCCGG 

Sequence 431 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGATTTAAAATCTC 

CTCCTCCTACAGCGGTCGAGTATTGMGCAGGTCTTTGAGGATGGGCNGGAATTAGAGTC 

ACCAAAGGAGGAATACCCTCACAGTTTTCTGCAAGAGTCTCTTGAAACAATGGATGGTGT 

TTATGGGTCTGGGGAAGACCCCNCGCCCCAAATGTTGCTCCCCT 

Sequence 432 

CTTAGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACACACAAACACACACACAA 

AGTTTAATATACTTTTTAAAAAATTTTATTGTATTGTTTTCTTGAMTAGGGTCTTGCTA 

TGTTGCCTAGGCTGGTCTTGAACTCCTGGGATTAAGCAATCCTCCCAACTAAGCCTTCCA 

AAATGCTGGCATTACAGGTGTGAGCTACCACMTCAGTCTCTTAGATTTTGTT7TTTAAG 

AACA ATTCGAAGTTTACTGCAAAATTGTGAAGAACGAACAGACTGTTCCCACATATCCCT 

TTTTCTTTACACACCGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 433 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGCTTGAGGTGGG 

TTTAGGAAACATTTGGTATCTNTGGCAGGGACAGATGTTGACCTGGCCGGTCGGCAGCTT 

TTACAAACCTAAGGACT TCAGG GTCCGGTTGCGCATGAGGACCGGGGAGGACAGAGCTGT 

TTGCMTAGGTGTGGGCTTTTATAGCATTGTGAGCATTTCACGTTAGCGTAAGTG7TGCT 

GCTGTGCAGGTGGTCTCTGGGGCTTACAATCTTCCCCAATGTTCTTCCCCACCCCTCCCA 

CCATTCTGGTGAACAAGCCTCTTGGGATTCTTTGAAAAAAAAAAAAAAAAAAACCT 

Sequence 434 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATACAGAG 

GTATMTCTGTAACATCAATAATGTAMGTGGGGAAGGGCAAGGTGGAAAAGGAGNNGAA 

TGCTTGTATGTGACTAAMTTATGTTGGTATCAGTTTAAAATATATTATTATAACTTTAG 

AATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGTAGGA 

TACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTA 

TCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACAGAAA 

ATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 435 

CAGGTACAGGCACCTATATGMTTTAAACGGGGAAGATTTCTTTATTTTGTATTCAATGT 

ATNMTMGATTNTTA AMCATATTTTG GAGAAATNGCTAATTAGTGTATAATCCTGATG 

CCAATTCTAAAAAACC I I I I I 1 1 I I I 1 1 GNAGAGACAGGGTNTTATTCTGTCACCCGGGC 

TGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAATC 

CTCCCACCTC AGCCT CCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCTAA 

CCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCT 

Sequence 436 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCAACAAGGGCCCTGAGAG 

AGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGTGTGA 

GGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCAGGGA 

GGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACTAAAG 

CCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTTTAAA 

AGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTGCCCG 

Sequence 437 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCACGTAGCAAC 
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ATATGAGTATTTCTCTAGATAACI I I I I 1 1 I I GACAAGGTCTCACTCTGTTGCCCAGGCT 

GGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTTGACCTTCCCTAGCTCAGCTGAAC 

CTCCCATCTCAGGACACCATTGCCTCCACTGCCCATCCTGCATCTGCCTGCCTACCCCAA 

A AGTGT T GAGAAT ACAAGCATGAGCCAGAGCCACGGAACCTGGCCTCTAGAGAGACTTTC 

TATTTTAGTTTTTTCTT CTCTTA TTTGTGMGCCTTGAAAAAACTACTGTGGTTTATTTA 

GATTCTGGTTTGTGAC I I I I 1 1 AAATAAACTTTTTATTTTGGMTAAATTTATGTTTAGA 

GAATAG TTGCA AACATAATAAAGTGAGTTTTCATAAACGCCTTACCAGTTTCCCCTGNTG 

GTTAACATTTTACATCACCATGCTGTTGCATTGGTCAAAACTA 

Sequence 438 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAMGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAMCTATTTTAAGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

TTAGCTAAAAATT 

Sequence 439 

TCGAGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTC 

AAAATNGCAAGCNGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGA 

TTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTNAGCTAATACATAAAGTGCC 

GAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCA 

TTCTACCTAACTCANGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGNCCAC 

AAAAAAACACCTTATGGTCACACTTAAAA 

Sequence 440 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTA 

TAATAGTAATAATAAAAAAAGAGGTGCTTTTCTCCTAAGTCAACATTTTAGAGGAAAAGA 

GTCAATTCAAGCAATTATCACATATGTGTAACTGAAGCACATATGTGTAACTTTTCAAGA 

GTGATTAGATG GTCTG TTGTCTTTGAAGTGATAGTCAAATATCAGGTGTGTTCTAGGGAG 

GTTGTGTMGACTTTTGCTTGTATTCTCCCGGACGCGTGAGTCGACTCAAGACCTGCCCG 

Sequence 441 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTNTCTAATGTC 
ATATTCTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 442 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCACG 

CGTCCGCTTCAGAATATCCMTTCATGTGMCTACAGGAAATTATAGTTTAGATATTTTT 

AAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCMGCTAGATCTCTTTCA 

TATCATTMTAAMGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCA 

TTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 443 

CGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTC 

CAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTT 

TGTATACTTCAAAAGATCTGGTCATGAMTTTTTAGCTAATACATAAAGTGCCGAATTGA 

AATCCAGAATACAATAGCTATGAAGGGCCGNTAGAGTGCAGATAACCAACCCATTCTACC 

TMCTCAGGTTGAGATTGCTTTANAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAA 

CACCTCATGGTCACACTTAAA 

Sequence 444 

ACNGNCAGGTACCAAGATTAAGGACAGAGTTCCCTCCATTGGTCATTGATTTGNAAACCA 

AAATGTATCTGTGACAGGTATTAATCCGGACGCGTGGTCGAAGACGAAAGGACACGAGAA 

ATANGGACCTANNCCGCTCTANAACTAGGNATCCCCNNNNCTGCAGGAATTCGATATCA 
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Sequence 445 

CCGCGGTGGCGGCCCGCCCGGGCAGGT6CCAAAGCCTTAACTTAGATTGTTACTTATGTT 

TCTAMT CTGTG GMGCACATTTCCTTTTCTTCTTCTTTTCTTTTACTGTTAATATCCTT 

ATTCTCTATTTTACC AGTGG AGMTGTTTAGTATMTTTCCATTTANCTCAAGATTCAAG 

AAATGCAAAGTGCTATTT TTATC AAATTTCTGAAAGCCTACTGTCTTCTGCTTTGGAAGT 

CCCACMCAGCTCTTTAATTTTCTTAAGCCCCACTTTCCTCATCANCAAGTNGGTGTGGC 

AATGGATCATAATAGGTTGC 

Sequence 446 

CGGGCAGGTACCCTTATTTTCCCTGATACAGCACAACTCTGCCTATTCTAATCATGACCT 
AGACACATTCAATGAACTACAAGTTCTCCTTTACACGGACGCGTGGGTCGACTCCGGACG 
CGTGGGTCGAAGACCTCGGCCGCCT 
Sequence 447 

CGAGGTCTTCGA CCCAC GCGTCCGCTTTCTTTTGTTGTTAAGGACCCATGACCTGCAGTT 

TCCCTMCATTCATTTTTATACAGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAAT 

TCATATGCCTTTAATACAATCAGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGG 

AGAGTGAAAGTGGGCCAGCCTAAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTG 

TGGCTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATCATGNTCATAGCTGNTTNCTGGG 

TNAAMTNGNTANTCCGI^CANAATTTTCACACANTCNTANNAACCGTTNCCTTNAATTG 

TTNANN CCNGG GGTNCCNAATTGGGNGNNCTNACTNAAATNAANTNGGGTNGCTTNTNTT 

GNNNNNI I I I I ANNTNNGTTTANCCTTNTNNNCANNTTNNTTNTTNAAATTNGNNAAANN 

CCNNGNGGGGAGGGNNNNTTTNNNNNNTNNNGNNANNNTNNNNGNNNNNNNGGNNNNNNN 

TTNATTNNNNTCTNTNNNNNTNNTNNNNNNTANNNNANTTTNNNNTTNNNNNNNNNNGGN 

NNNNNNNNNNNNNTNNANNTTTANNNNAANNNNGGNNNNTNNNNNTTNNANTNNNNTTNG 

NGTNGTTNTNTNNNNNNNANNNTNTNTTNNNNNNNNNNNNTNTNNNNTTTNNTTTNNNNA 

NGNNNTTNTT NI I I I 1 1 I I 

Sequence 448 

GCGGCCGCCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATG 

TGAACTACAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACA 

CMGGGTGTCATGACCAAGCTAGATCTCTTTCATATCATTAATAAAAGTCAAATTTTAAA 

TTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGA 

OCT 

Sequence 449 

CGCGGTGGCGGCCCGACCAGTGCAAATATCTACCCAGTTAGAAGAGTAMTACCATCTTA 

GTGTTATTATCAAAATATTCTGAACTCATGAACCTCCTCAGACTGTTGCTGGGACTCCCA 

GATATCAATACTCTGAGAACCACTGATCTAATGTCTTTAGTCAGTTTCTATTTGTTCTCT 

AGTATAACCAAGCATAAAAGTAATATACTTCCGCCTCTCTCTCCCCCAACTGACTTTAGT 

CAATAGTACCTCGGCCG 

Sequence 450 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTGTCAAGCTTCGACCCACG 

CGTCCGAAAGAGACCCCTGGGCTAGGGAGCCATTTGCAAGTCCCTCAGATGGCCTCAGGT 

GTGGGTCATGCTCCACTGCAGCCATGGTTACTGGGAAGCCAAGTGGACTCCATGGGGCTC 

CAAGCAGGGGAGGGAGCTCCTGGGAGACACAGTGTGGCCCAGATGCCCTGGTCATGGGGA 

CATCCMGCACAGGTGAGAAACTTGGGCAGGGAGGGTGCAGAGGAAATTTGGAAATGGCT 

GAGAGGCTGGACTCAAGGTCTCTTGGGCTTGMCTGAGCTCAAGCAGTCCACCCACTTCA 

GCTTCCAGGGTGCTGGGATTACAGTCGTGAGCCATGGCATCCGGCTGGACTCAAGGTCTC 

TTGTTCCCTGATCCTGGGCTTGGCATGGAGAAGGGAGGAAGCTGAAGGGGGCAGGTAAAA 

AGAT 

Sequence 451 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 
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TCCGATTATACCCTAAAAMGTAGAAAGATGGAATAATTAGGAGTAAAATTAGTGAAATG 

GAAAGATATGTTATAGAAAGGACCAAGAAACGCAAAAGTTGGTATTTGAAAAGACTGAAA 

A AATT CATGAACCTGTGAAAACAGTGGTCAAGAAGAAAAGAGAGGCATATTTTGATCTCT 

GTTTTACATGTTACTCAATGTTCATTGCTGCCTCCCTTGTCCATAAAGTGCCTTTAGTGT 

GTATGTTACTTTAGATTATCTTGGTGTCATCAAGCTTTACTCAGCAAAGAACCACTTTGT 1 

TGTCTACTTTAAAACATAAGTTATCTTTAMAGAATGGGTATCTTTTATAGTTCCATATT 

MTGGCGAAGAAACTGCAGGTAACAGTGCCTTACCAGCTGGGTTTTGCTAACTTTTCTC 
Sequence 452 

C CGCG GTGGCGGCCCGCCCGGGCAGGTTTTATATTTTTTTCCTCTTTAAAAAATAATTTG 

GTTTTGAATATTAAATTTACATATTTCTAAGTTAAATCAACATTCGTAGAGGAATTATCA 

AAAAAAACTAGTAAGTCTGAAAAAAAAACCATATTTTATATTCTGAGGTCCCGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 453 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACCCAACACAAACTA 

TTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCA 

CCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAA 

AATCAACATTAGCTGGGCGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGA 

GGCAGGAGAATCACTTGAAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATT 

GCACTGCAGCCTGGGCAACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAAT 

AAACACAACAGAATTATTCTGCAAATACAGATATTGGAGTAGCTGAGTTCCATCTCAAAT 

TTGA CTATGCAGGTTGACAGGTGATCTTGGCAAACTACTTATCCTTTCTGAAGTTCAACT 

TTTTCACCAAATGGTATTGGGATACAACACTTGCTCTTGCCTATCTCACATGAATTATCC 

ATTTTGGACAACTTGGTAAACTATA 

Sequence 454 

CCGCGGTGGCGGCCCGAGGTCCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTAAGATCTGAAGCAATCT 

TTAAACTCGTGAGGAMAT TAAG TCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAA 

TGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 455 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTACAGCATCCTGATAA 

CAGCCTCTGCCCTGGGAAACAGACTGTGACCATGCATTTCTAGTCCAGCATATCCTATCA 

GAAGACCAAATGGCTTCATCAAAAACAGAGTGACAACCTCTTCTCTTTGCCTCTTCTGTG 

CTTGTTAACAGGCAGCATTGGGGCAGGAGAGCCTGCAGGCCTTTCACGGCTGCTTGAGTT 

CTCACCTGTTTGTCTGAGCTCTGATTCCTCTGCCCTGTAAGCGTAAAGGAGATGTGCTGA 

GTGGAAAGACCTCTAAACAGGCAGCCAGGAAGCCAGATTTCAGGTCCATCTCTGCCTCTA 

ACTGGCAGCTTTGCCTTGGGTAAATCATCAAGTGGGCAATAGTTTCTCTCCTGTAAAAGG 

AAMGATTGGGTTTAAGATTGTTTCTGAAGTTCTCTCTAGATTTAACCTGGAAGGAGTTG 
AAATTGCT AACC 

Sequence 456 

CCGCGGTGGCGGCCGCCCGGCAGGTACAACATTTTACATTTCCAGGGACTGCAAAAATGT 

TAGTTCCTTCCCCCATCATTTAGTTTGAAMTTCTTAGATAATTCTTTGCTGGTAAATTC 

CAACAGAATAGTTAGCACACAGGTTCCACACACACAAGTTCTAGATAGGAATCTGAAGCA 

CCACAATGAAMGAACATTTAACATCTTTTAAAAATGTTTAATGTTATCAGAAAGATGTT 

TGGTATATGTGTTCCATGCATGCTCCTGCTGGTTCTATTTGAAAAAGAAGTTTTTACAGT 

TATCTGTTGTCACCATATTGTAAACGGACGCGTGGGTCGAAGACCT 

Sequence 457 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACAATAAATAGCA 
TTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACACC 
GCTGCCTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTATCAGGCTTCAGTTTCCTC 
ATTCTAAAATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTnTAATCAGTGG 
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CATC TCCCATTTAI I I I ICAmGAAATAAAACTTTTGAATTTTATCTTCTACCTAAATA 

ACATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAAAGC 

AAAGGATCAAAGCTGGGGGAAAATACTGGACCATGTGCTTCACCTCATGGTGCCAAATAA 

AGAGAAAATGGGGAGAAGATAGGGACAGATAAAGATCTATTTGCTCGGATTGNGCTCTCA 

TCCTTGGCAACATGTTGACAATGCCCTGGAAATA 

Sequence 458 

CCGCGGTGGCGGCCCGAGAGCTCCAGGACGAAGGTATAAACACAGCAGAGGGCAGAGCCT 

GATTTCAATCAGGGGCTACTCTAAGAAAGGCAGGAAACTAGATAAATACATTTAAAAGAA 

ATTCCTCAGTGGCAGGGACAGTAGAGCAGCAGGGGGAGATCCCAGCACGGACAGGTAACA 

GTGTGATGTGGCAGAAAGGCTTTGGTTGCAAGTGGAGAACAGATGTCTCTGGCTGCCTCT 

GGCAGCTGCCTCCTTCTGGGCCTTGACTTTTCAAAGCCAGGCCAGGCCTCCCCCACCCTG 

GACCACCTGTAGCTGGTTCAGAAGGCCCCAGGCTGGGCTTCATAGATGAAGACACAGCTG 

ACTCAAGTCTCTGGCTCTGTCGGCTCTTGGCCACCTTGCCGCTCCCATAACGGTGTTTCT 

CAGGTCAACCCCTCTTCTCCATTCCTACTTCAATGACCTCAGGTCAGGCCCTTGCCACTT 

CTCTTCTGGACAAAGATGACAGCCCTTCACTGGTATCCTCGTCTNCAACCTAATTTATNC 

TTCACAGTGCTGGCAGAAGTGACATCTTTTAAACACACAACGACCCCN 

Sequence 459 

GCGGCCGAGGTCCGCAC I I I 1 1 I 1 1 I I I I I I 1 1 I I ATTTTTACTCCAG AATTTTCCTTTA 

ATATTTAGG ACTC CMTCTTTACTTACAAAATAGCTTTTATTTACGTGCACATGATCGTG 

GmCMAATTTTCTAAGCACTATGCTAAATTTGTCATCAAAACATAACAGATTCCCATC 

TTACAAACATAGTTGCTAGTTGAATGAGTAAAAGAGATTTCAAATTTCAATTCAAGGAGG 

CATGTCTAAAAGACCAGACCATTCATTTGATGAAATTGTAAATGCCGATCATCCAACTTA 

ACAGGAACTGCACATTTGTTCCTTTCTAGTTAGAAAAAAATAA 

Sequence 460 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTT 

TGTTGATTGACTGATTTATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGAT 

GTCCTAATACAAAACTGATAGAGTTCAGAAATAATAAGAATCTCCTGGCCAGGCGTGGTG 

GCTCACGCCTTTAATCCCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGA 

GATTGAGACCATCCTGGCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAA 

TTAGCCCGGGTGTGATGGCGACCT 

Sequence 461 

CGCGGTGGCGGCCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATAAGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 462 

CCGCGGTGGCGGCCCGAGGTGGMTGTCTGTTTTCACAAMTTTTGTATTTTCTCCTAAT 

AGTATGAGGTNGAAGAAATCTACATCTTCTCAAGTGAGCTTATGATTAACTCGATGAGTT 

TTCTTGCTATTCTCAAATCGGAATNTCCAGACCTGGCTAGAAACTAAAGTCTAAGCCCAT 

TCATTAAAGTCTTGAATTTATTTACTTTNGCCAAGAACAGCTATATAAAATTAGATTCCT 

CCTGGTATMAATTGGGTGTTTTCTTAGATATTNGCTATCAAAAGTCATTTTTCTTGGAA 

ATCGGACGCGTGGGTNGAAGCTTGTACACCTGCCCGGGCGGGCGGCCGCA CI I I I I I I I I 

I I I I I I I I I I 

Sequence 463 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAG 

CATACAACTGAGGGTTCACGGATTTCTAAGAAATGCATTTCCCTTGTCTATGTTCATCAG 

CCTTTAATACTTTGGCTACAAGGCATATCAGAGAAAGGGAGGTTAAATTGGGTMTGACA 

AAAGAACATATGTAACTCTGGAAATAGGAAAAATGTTCCAGAAATGGGATCAATGTGCCA 

GCAATAAGCATAGTTCATTTCATTTGAAAATTCAGTTAAAGAGCCCAATAAACAGTTCCA 

AACCGGACGCGTGGGTCGAAGACCT 

Sequence 464 
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ACTATNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACTCAAGCTTTCAG 

ATATAGGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAAC 

TTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGCATCACAAGTTGTACCTGCCCG 

Sequence 465 

CCGCGGTGGCGGCCGAGGTAATAGCTAATGCATGCAGGGCTTAATACCGAGGTGATGGGA 

TACTTCATGCTAACTGGGTGTTCCTTAGTGTAATAATTGACTGCCTGCTCCCATGTGGCC 

TCACAGAGTCATAGCTTCTTGCCCCAMTGTTTTTCTGGTTTTCTGTGGAGCCATACAGA 

ACCATAAAG AGAAG TTGTTGAGAAACCGGTTTGACTTTCAAGTCTGATGAGGAGCCTGGT 

TTCCATGAGATTTTCAGCTAAGTTAGCATATTTTTAAAAATTTCAAAATGAACAGAGGAA 

AGGTTGACGATCTTTAGAGCCTTCTGCCAGGCTGAAGTGTTCTGCATCTTCCCTAGACCC 
GGTACCTGCCCG 

Sequence 466 

CCGCGGTGGCGGCCGCCCGGNCAGGTTACAAGCTTCGACCCACGCGTCCGGGAAATTTTA 

ATTA AAMT AGGTGAACATTTTAAATGACCTAATACATATTTAGTCCACATTGAAACTTT 

GGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAMTTTGATGCCATTAAAATTT 
TGAT 

Sequence 467 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTC 

CTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTT 

CACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAG 

GGGGTTGGTGTTGGAGGGTTGGAGGAATGGAAAAGGCCCTGAAATGCAGGCAGAGAAATG 

ATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTTTACAGCAGCCTCGAAGA 

CTGATCAGGTTACTATACCCTCTCTTCTGTCCACGTGCATTTC 

Sequence 468 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTT 

CCCC AAACAGTAA TATG GACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTA 

TTTTGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGC 

T GTGTA AAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTC 

TA7TTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATG 

CGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 469 

GACCTCTTTAAGTGAAACTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACAACAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 470 

GCTCC CCGCGGTGGCGGCCGC CCGG GCANGGTTACAAGCTTCGACCCACGCGTCCGGGAA 

ATTTTMTTAMMTAGGTGMCATTTTAAATGACCTAATACATATTTAGTCCACATTGA 

MCTTTGGCATTTTGTCATTGCCATTAAMTTTTGATGGCATTAAMTTTGATGCCATTA 
AAATTTTGAT 

Sequence 471 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAA 

TAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAG 

CCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTNGGAGGAATGGANAAAGGCCCTGAAA 

TGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTT 
Sequence 472 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCG 

GAGCGTTGCTTGGATTTCTMTTACTTCTMGNGTAGTTTTATTTMTTTCAGTCCTTTA 

GAAAAAANAAAATAAAANNAAATGTGCGGGCCCGGCCTGCCCGGGCAGGTNCCACNCGTT 
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CGMMMGAMGMAMMAAMCTTTCTCTTTGCCANTTCTTCTTCTTCTTTNTT 
Sequence 473 

CTA CTTA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I 1 1 I 

TCCTTTTGAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCA 

ATC CTCCCGCTTT CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGG 

CCCTTTTCTTTTTCATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAA 

GAATGTGAACTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACT 

GTTTCTTAGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGAGACTGCTTCTCGTGGTTG 
ACCTCATCAATGCCCAACC 

Sequence 474 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAMCCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

MCCCATTCTACCTMCTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTCATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGATTTAATACTTTGGAAGTAGCATCATAAG 

Sequence 475 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCNGTGCAAATATCTACCCA 

GTTAGAAGAGTAAATACCATCtTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTC 

CTCAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTT 

CTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGC 

CTCTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 476 

TTAGGGCGATTGGAGCTCCC CGCG GTGGCGGCCGAGGTCAACGCTTCGACCCACGCGTCC 

GGTTTGGGTGGMTTATMTATTTTAGATAAGATTTAAGAGGATTGCTAGATNGGAATGC 

GMTGATGATMGGCTTTTAGAGTTAGATAAGAGAGAGGGCGCTCTAGAACTAGTGGNTC 

Sequence 477 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTTCCC 

CAAACAGTAA TATGG ACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTATTT 

TGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGCTGT 

GTAAAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTCTAT 

TTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATGCGG 

ACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 478 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC7TATACTAGAA 

GATGCTCCAAGGTTTCAGAAAGGMTTMTTACTTTCAATTTGCACAATTTAGAACAAAT 

A TCTGG CTTTTCCCTMGCTTAATGATTTTCCATTTCACACAACTAAAATATAATAGCAT 

TATTTTATMTCMGTTTAACTGATGGTCTATGATAGTAGAGCGATTTAGTATTTTGACA 

AAAATCTTATGAGACATGAAGTCATTCAATTTGCCGGACGCGTGGGTCGACTCAAGCTAG 
ACCTN 

Sequence 479 

TTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTTT 

CTTTTGTTGTTMGGGCCCATGACCTGCAGTTTCCCTMCATTCATTTTTATACAGGGCA 

GAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCCAGGCAG 

ATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCTAAAAGG 

ACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACCTGCCCG 
Sequence 480 ' 

ACTACTA TAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCmCTmGTTGTTMGGGCCCATGACCTGCAGTTTCCCTAACATTCATTTTTATAC 
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AGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCC 
AGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCT 
AAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACC 
Sequence 481 

GACCTCTTTAAGTGAAACTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACAACAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGMGCTTGACCTGCCCG 

Sequence 482 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGT 

CAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACA 

AATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAG 

TCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGAT 

GGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 483 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGA 

CCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCAC 

AAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACC 

CTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGC 

CCTGAAATGCANGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAA 

TATCTTTACAGCAGCCTCGAAGACTGATCAGGTTACTATACCCTCTNTTCTGTCCACGTG 

CATTTNAAAAACNTTGGCCGNTCTAGAACTAGTG 

Sequence 484 

CCGCGGTGGCGGCCGAGG TCTTCG ACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAA 

TGCATCGAT ATTACA GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATAA 

CACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTTTGTAAGTATTAATACCTG 

ATAATTTACATCCTACAGCCCTGCC \ I 1 1 1 I 1 1 1 1 1 1 1 CAAGTTTGTCCCAGCAAGTCTT 

GGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 

Sequence 485 

CCGCGGTGGCGGCCGCA C I I I I I I I I I I I I I I 1 1 1 I ACCTGAAAATGCTTATTCTAGCTT 

CACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATTAGACTTAATrTTCCTCACGAGT 

TTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTAGACGCTTTCAGGACCTTCTTTT 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 486 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTGGAGGCCCAAG 

TGCTGAGACTCAGGGGACGCAGGCCTGAGGACAGTTATGCAGGGTGCAGCGGCCTACGGT 

AACCCATGTAGCAACAGAAACCCTTAGCTAACTGCCGTAATTTAAGGCAATTAGGAGCCA 

TTCATCATCCAGATGGCTATTGGCTCTAATCGTTACTGGCTGAAGGAACTATATATAGCA 

GCTACCTTTCCGCTCCATTCCCAGGCCTTGTTCCTGTCTTCTGGTTGCCAGTTCTGCACT 

CACTCATTCCGGACGCGTGGGTCGAAGACCT 

Sequence 487 

CTAT NGGGC GAATTNTTNCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACi 1 1 1 i ICATTNATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGTTTTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAATTCCATGGAATTCCTTTTGAAGTTTATGAAGT 

Sequence 488 

CCGGGCAGGTACAAATCAAGTCATTAACATTTTCAATGTCAAAAATACAGCACGCTGTTA 
AGAGTTCTGTCAGTGCTCATTATCCCACTAGATCCCACAAAGGGCAAACTCAAAAGATGA 
AACAAAGGCAACGCCATCAATAACCACCATATTCCACAGGCTTTCTCCCCTAGGACGTAC 
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CTN 

Sequence 489 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGA GTTCT ACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTAAGTTA 

TAGCTGTMTTMTTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAAGCTATTNTTCA 

AGGGAGNTNTTTACCCATCATMAGGNGGTTTTTAGTCATTATAGATAATATTCAATCAA 

TTAIMTACCGGGGGATGGCAAAAA 

Sequence 490 

CCGCGGTGGCGGCCGAGGTGTAATTTGGAGAATATTTAAAGCAAAAGAGCAAACAACAAA 

MCTMGTTAACACTTACCCAGTGCAGTAAGGGAATTGTAAGATACAGCCTGCTTAAGGA 

GGTCTGCAGACAGATGCACCTMGATTTCAGCTGTTTTAGGTCACTTTTCTCAAAATATT 

TATTATCTGGCAA TGGG GATGGGAGTGGGGAACACCTNTCTGTGAGGCAAATGGTATCTC 

MCAMTACCGACTTTTCAAGGAAGAAAGCTCTCCACTTCTCTCATAAACTTATATACTA 

CCTTAAACAGTATGCAGTATTCGCGGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGC 

CCGCTCTAGAACTAGGTG 

Sequence 491 

CCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACCAAAGCTGATCTCTCA 

TTCGTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAAAAGTTTGAAATTCAT 

TTTCTTCAGGAAAATACCCAGCCAGTCCCTCTCAGTGGAAACCCTGGTTATGTCGTGGGG 

CTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCCGTGTCAGCTCGTA 

GCACAGMGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCCAGATTACGTGGCCCCTTTT 

GGAAATTCCCAGGCCCAGGGACATGCTGGACTGGGTGCCCATCCACTTNATCACCCAGTC 

ATTCAACAGGGA 

Sequence 492 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGNTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

NATCAGTGGNTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGMTATTTTGATMTAACACTMNATGGTATTTACTCTTCTAACTGGGT.AGA 

TATTTGCACTGGT 

Sequence 493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGA 

ACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGA 

ACTGGATCAGMTGGTCCAAGGACTGTTAMCAGAGGAAGTATTTACATTTTGAAAACTT 

GCGGACGCGTGGGTCGAAGCTTGTACACCT 

Sequence 494 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 
TGATGCAAATTATATGCTGTCCA ACCT ACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTMACAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 
CACCT 

Sequence 495 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAAA 

CTTAMGTATAATAMAMAAGATTATTTCCTATACTTCAAMTCMCAAGATTTGATTG 

CATTTAMTTTTTCTGTCCATTTGTTCTTCTATGTMTACTTT^ 

AAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAAATCTACTGACATNTCTTGA 

GGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACAGAGACCACTACACACACAC 

ACAAAAA GAAG GATGGGTGAATGGATGCA GAGA GAATTTAATAAGACTGAAATGATGCCA 

TACATGCTTTTAAAAMTAAAAAGTATTAMTTTTAAATT1TACTTCAATATAAGAGAAA 

AAAAAAAA 

Sequence 496 



• 
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CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGA 

CCCACGCGTCCGTAGTAATAGGGAATTMCTGACCCCTTTTGGATGGGGGAGAGCATCAG 

GCTGGGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGC 

CCTGACTTGAMCCATTAGCGCTAACTTGCTCTGTTTTGAGAAAMCTTTCCAAACTTTT 

GCATGAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAAT 

TGTCTCTGTGATAAAAAAAAAATGAAATATTTTCTTATTGMTTAATATTTTTGTCTTGA 

AGCATTTTCTAGTGATAGMTGTATTTGTCTTTTTTCCTGGGGGGNACCTCGGCCGCTCT 

AGAACTAGTG 

Sequence 497 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCCAGGTC 

AGC ACTTT GAACACTTGAACATCTATGAAATCACATAGTAAAGTGATAGGAGATGGGGCT 

MGCTTTTMTGGCCTTTAGACATAGCATTAGACATAACCTAAGCTGAAAGGCTTTGGGA 

AGTTGTTGTGTTAAATCCCCMCACACTCTCGTGTTTTCTTAGGACTTGCCTNTTATTTA 

AAAAAAAAAAAAAAAAGTTGCGGCCCGCTCTAGAACTAGTTGGATCCCCCGGGCTTGCAG 
GN 

Sequence 498 

CCGCGGTGGCGGCCGAGGTACCCTTTTATAAGGGTGTATCCCCTTTTGGTAACTTACTGT 
TTGTTAATTTGTAGTGTTCCCTGCCAGTAAGCTTGTAACACTCTAGTGACTCACCTTCGG 
GTGGGAGGGTAGGAAAGGGAGAGGCCTGCCTCCTAAACCTGGGAAGATGGGGAGAGAGTG 
GTAAACCTGAGAGCCCAAAAAACAAACCAAAACAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 499 

AGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGT 

CCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTG 

GACTTCTCAGCTTTCCTCAT TAGAA GTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAMTTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 500 

CCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTC TCAGA GTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGMTATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 501 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCC 

AGAGAAGCTGAAAGCCTTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGA 

TGTGGCCTCGAGGCTGGGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACA 

GTGATGGCTCAAATCCAGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTT 

TCCCAGGTGTGAAATGGGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCA 

CACAGCGAGCCTNAAAACGC7TGTGTCCCCTCCCTACCTCACAGCCCATTTTAGAAGTTT 

GCTGTCACTTA 

Sequence 502 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTCTTA 

TGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTC7TATGAGTGGGAGGTGACTGATCGT 

GGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGATTG 

GCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAGTGATTTCTTGTGAGATC 

CGGTTGTTTAAATCC ANAGG CACCTNCCCCTACCCTCTAGCTCCCATTCCTGCCATGTAA 

GACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGMGCTTTTTTGAGGCCTCCCCAGA 

AGCTGATGCCAGCCCTATGCTTCCTGCACAGCCTG 

Sequence 503 

CTACTATAGGGCGATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTTTTGAAATGCACGTGG 
ACACGAAGAGAGGGTATAGTAACCTGATCAGTCTTCGAGGCTGCTGTAAAGATATTCATT 
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TAACCTCACCACAGCCCCTGMTTGTTTCATCATTTCTCTGCCTGCATTTCAGGGCCTTT 

TCCATTCCTCCACCCTCCAACACCAACCCCCTACTACGCTGAGGGCTCCTGGAGGGTTCT 

TTTTCTCTCMTCMTTCATCTTTTCTGTGMCCTATTGTTATTATTCATCTTTGTGCTC 

CCAGCATGTAATACAATGCCTGGCACTTAGGAGAAGTTATCACGGACGCGTGGGTCGAAG 

CTTGACCTGCCCGGGCGGC 

Sequence 504 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTC 

CGAAATTCTTGGTTAAGGATTATTATAAAATAGAATGGTATTTCAGTAAATCCCTGAGGC 

TTAGGAGTCCAGGTACAATGTTGGTTCTCAATTAAAATATAAATCATGTCTAGGGACACT 

TAGGMCACAGAACATATATTTAGAGCTAGAAAATATACAGCTTCAGACCAGGCAAAGTG 

CTGGGATTACAGGCGTGAGCCACCACGCTCGTCCTCACATGGGGTTTTATTATTAGGATG 

GTAAGAGTATTATAAGGGATTNGGTACMGGCATAATGAGTCCTTTTGCTTTTTAGGCTT 

TTGACTTNT GGTTTTA AGACTTTTIvrrrTAGCTTTTTGTTNGTTAGACANCCATTTGGGCA 

AGGCTTNGGi I I I I I AATAAAGTTTGCTTGGGATNAAACNTGACCTTAATGGAAATTGTC 

CCCTNCCCCCAAAA 

Sequence 505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTGATACAAATAGCATGGATGAAACTCAAA 

ATCATTATTCTAAGAGCCAGATACTATAGCCTGTATTTTATGATTCACTTTCAATGAAAT 

TCTACAATAGACAGAACTATCTATCAACAGAAAGCAGATCAGTGGTTTTCTGCAGCCAGA 

GGTATGA AAGGTTTG MACATGTGGCACCAGTAGGACATATGGAMCTTTTTTGGTGTGA 

TGGMGTATTTTTTTATCTTGATTGTGTGGTGTTTGTTATACAGTGGTATACATTTGACC 
T 

Sequence 506 

GGGGCGGCCGCCCGTTCAGGTACACGTNTTTNCCAACCAATTTTATANGNATATATATAT 

TCTACTTCCAACACCCNTNTTCATCCTGGTNCAATCAAAGCCTGGTTNTGGCCAACAANA 

AACTCGTCAGGAGATCGAAGGNTGTAGATGTCTGCACGTGGCTTCCTTGGAGGTCCAGNG 

GNGACTCCCTCTTCCAAAATCCATTCTGTACCCGCTGGCTGCTCTAACGGGCAGGACAAC 

AGCGTATGAAGCCTGACTGCAACTAGGAGAAGTACCACACTCCCGGACGCGTGGGTCGAA 
GCTTGTACACCT 

Sequence 507 

GGCCGAGGNCAAGCTTTTACCCACGCGTTTGAANCCATCTGTTTGGNACCCNGAAAGGGG 

GCAGGAAAGGCTGGGGTCCCAGNCCACCCTAAGGGNATCTGAGTGGCCCAGGGCTNCAAG 

NNNNCCCACCTGNCCAATGGGACCCTTTCTGNCCTCACCCTACAAGGGGCACAAAGGGAA 

GACACCAAACCTGGCAGGAACTTTTCACGCAATCAAGGGAAGGAAAGGCANTCCTGGCAG 

AGGGAACAGCANGCCAAGCGGGAGAAGGCTCAAAGTAAGGAGGGTAAG 

Sequence 508 

CC GCGGT GGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGCTTTTCAATTTTATTGTAT 

AGTTTTTGATAATGTTAATGTCTGAGATCTTTATGGGTGAGTCTGCTGTCATTTCTGCTA 

TTTCTCGTAGTGATTTGCTTGTATGGTTTATGATTTTTTAAAAACTGAATGTGTATTAGA 

ATTGTGTCTGGTMTTCTTTAGGGACCCATTGTAGATGTATTTCTTCAAAGAGCATTTGT 

GGTTATTATATTTGGGTGCTTGGGGCACTGCCAGTACCTGCCCG 

Sequence 509 

CCGCGGTGGCGGCCGAGGTATTGAACCAGGTCAAAACATTGTTGAATATCAAACCCAATC 

TATTTAATCTGTAAGAAACAAGGACCCTGAGAAAGATTCTGACCAAGGGTATGTGATCGG 

AAACTTGACAGATAAATGTAGTATACTTGTAAAGCCATACTGTGAAAAACTTGGGGATTA 

TTTGAACACAAATTATCACCTGGAAAAAGACAGAAAACAAGGCAGAAGACTGTGCAAAGA 

GGTTGGAATATTCAAAACTTCAGATTAGAAG 

Sequence 510 

CCGCGGTGGCGGCCGAGGTACCCAANNGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATC 
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TTTAAACTCGTGAGGAAMTTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAA 
ATGTGMGCTAGMTAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 
Sequence 51 1 

CCGCGGTGGCGGCCGC GACC ACGCTTGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCG 

TCCGAAATTAATGAMTGTTTTACATTCTTTTAAAAACCTTTGAAATATGGTGTGTATTT 

TATGCTTTAGCAAATCTCAGTTTGGACCATTTCAGGTGGTCAGCAATTACACATGGCTAG 

MCTAAGAGCAATCAGTTTTNTTCCACAGTTTTTCTAAMTTTTCTTGTCAAAAATCTTG 

ATGGTATGA ATTAC TCTTTTAAAAAGTGCACTTNACCAGCAACAGAAAANAACCCTGGAG 

GGGTATGGGTTTTAAAGCTGGTACCTNGGCCCGNTCTAGAACTAGGTG 

Sequence 512 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCA 

CAGGMTGAGCCACCACACCCAGCCAAATTGGGCACAMTTTAAAATTTGACTTTTATTA 

ATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCA 

MTCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTCTGA 

AGCGGACGCGTGGGTCGACTTGTAACCTGCCCGGGCGGCN 

Sequence 51 3 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATMGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 514 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCACTATAGGGATCT 
AGATCACGAGCG GCCG GCCGCAC I I I I I I I I I I II 1 1 I I I I I ATTTCTGCCACCTCTTTC 
ACTTGGGMTCTATTTTCACTGCTCTCCAAAGTTTTGAGAAGGCAATAGTCCTGGAAAAT 
GGGTCCTGAGCTCCTCTCAGCAGTCCTGCTTTCTTTCCACCTGCACTGTAAGGNGACCCT 
AACTGGGTCTAAGACAAAAAGTCGGACGCGTGGGTCGAAGCTTGTACACCTT 
Sequence 515 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTAGTTACCACTTCA 

TTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGACTTGAAGCTGGAAGCAAATAC 

AAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCGTCTCCCTAGCTCTCTATGGCTCTAC 

CTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTCTGACTGGGGATAGGACCACT 

GCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTTCCAGGAAATCCCACCCTGGG 

ATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCACCTAATGAGTGCTCTAGAGAT 

CAGCACCAAAGATGCTTGCCAGAGTCTTCTCTA 

Sequence 516 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGGAACCCTTGTTCAGGTTCTTTTTAGGCAACCCAAGCCAAGACAACAAAGTA 

AGATAGAGCCCCAAATGTGGTCGTATAAGGTTTTTCAAAGAAAGTAACACTTGAGTTAGG 

TCTTAAAGTTTNACCTAAGAAACTGCCCAGGTGGACAAGAAGAAAGGGTGTTCCAAGTAG 

AAATAATANCATGGACAAAGGCAATGTAGCAGGAAAAGTNTTCGTAAATTCAGGGAATTT 

CAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTTGCGGTGGCAGGGGATGTTG 

GAAATGTAAACAAGAGTGAGATACAGAAGATTTTATGTGGCATGCCAACTGGGACTTTTT 

TTTGTAAACAA 

Sequence 517 

TCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGCTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAAT 

GTTCTAGGGNGGGNGAGGAAANCTAGGGTGGNTATCTAAATCAACAAATATTCTAGATAT 

TCCAATATCTAAATTATTGTTGGAAATACTCNTCCTGAAGNGNTCATTTGAACNCTAAAG 

CAGGAGNACAGCNTTTGTTGTATCAANATGGGCAGGGGTTTTTAAAGGGTNTCCATTTTT 

TNTTANTTTCCNCATTATTAAATTCCNTTNTAMTNNTTTTTAGGACCAAAAATTTTTCC 

CNTTTCTTNGAGGTNTTTAAAGGGGGATTTAANAAAATGGGNNANNTGGGGGGTn" 



WO 01/070979 



PCTYUS01/09126 



TABLE 1 

86/467 



Sequence 518 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGTAGAGAAAGATTCAA 

AATGCTGCTGTTCTACCTGAGATGGGAAAAATGAAAGCAAAATAACATCAACAAAAACAA 

ACAAACAGCCTTGTAGTTCCATGTCACTAGCCAGGGATTTAAGACCAGCCTAGAGAACAT 

GGTGAGACCCCTTCTCTACAAAAATAAAAATAAAAATACAAAATAAGCTGGACATGGTGG 

TGTGTACCTGCCCG 

Sequence 519 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGCTTCGACCCACGCGTCCGCTCACT 

TCATCCTCCCAGCAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTG 

GGGCTTMGGAAATTMAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTC 

AGAAATTTGATAAAAATAGCACTTTGCATTTNTTGMTCTTGAGCTAMTGGAMTTAAT 

ACTAAACATTCTNCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAG 

AAGMMGGAAATGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGGTTAAGGC 

TTTTGGCACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGGGAT 

Sequence 520 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGAGCTATGGACCTAAGGCAGCGAGTGGATTCATTAGTCCTCTTTCAGCTGA 

ATGCATGCTACAGTATMGAAAAAAGCTGCTGCCTATATGMGTCTTTGAGAAAGGTTTG 

TTA GCTGC TGTTAATATTTAAATCAGAGGAAACATCAGGAGTCA7TCTAGAGAATGGCAA 

GAGTTTTTCTGCAGTTTATATTGTTGACTTTTTATACGATATTGGGGTACCTCGGCCGCT 

CTAGAACTAGT 

Sequence 521 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGC 

TAGGAACTATGTTMAAAMATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAAGACTGGCCAGCATTAGAGGTGGGTTG 

GCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCG 

TAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 522 

CGACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCA C I II 1 1 1 I I I I I I I J I 

TTTTAAGTAAAGAAGGTTTTATAATATAGTGAAAACAATACGGAGATGAAAACCAGGAGA 

CCTGGGTCCCGCCTTTGTTACAAATGCCTTTCCTAAAAGCTCCAGAATGGTGCGAGGTCA 

AAACAGATGGGCAGAAAGGAAGTGGTCATCAGAGCAAGAGAAAGAGCAGGTGCCAGGCAC 

TCACGTGTGCGGTCATATCAGGTAGAGATGATGAGTAGAGATCTGCCCTAGAAGACACTG 

MTTCTGAGATTCAMGGGGAAAAGTTGATTTTATAGCCAGTGATTTTATAGCCCACTTT 

CCTGCCCCACCCCTACTNTAAGAATTGCGGACGCGTGGGTCGAAGCTTGTACCTGCCCG 

Sequence 523 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCAGAATA 

ACTAAATAAATAAAAGGCTAAAGAAAACTGGAACAGTACTGCGTCTCCATCTGAGACGCA 

NTCTTCTACTTCCAGCATCGNAGAGAAGGGCTAGGGACA A I M I N I \ CAAAAGATTTAT 

ATACAGGCTTGMTCCAGMATTAAGGNTAAAAGCATAAATATTGATAATTTCAACTAAA 

TTCAGMTGGNTTCAGAAAGATATGATACAACAATTTAGAATAAAACAAAGCAGAAGAGC 

ATNATATTTTGCGGACGCGTGGGTCGAAGACCT 

Sequence 524 

CCGCGGTGGCGGCCGAGGTGTACMGCTTCGACCCACGCGTCCGCTTAMAGATTTTTTT 

TTTATGTMACTGTTGMTATTTGAAATAGTCCACTTCACCTTAATGGGTCTTGTCTATC 

TTCATTAGTCTTCAMGAAAMCCATTTGCTACCAMGTAAATCAGTATTTTGAATGTGC 

TTCTCnGTTTTTTGTTTATTAGCTAGTTCCTGTMGCAmCCACC 

MTCGTMGGAAGCTGTTTCTTTTAAMCACAAACCACCACCAAAAATTTAAATGTACCT 

GCCCG 

Sequence 525 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCAAGACCCGCCTGGC 
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CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 

TGCCTGTMTCCCAGCTACTCAGGAGTCTGAGGGAGGAGAATCACTTGMCCTGGAGGCA 

GAGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACT 

CTGT CTCAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGCC 

TTTTCTGGTCTTTGAACACATCATAGAAAGTGACAAATGCTGCAAAGCCATGMGAACAT 

GAACTATAAACGGGTAGACTAACTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGC 

TGAATT 

Sequence 526 

CCGCGGTGGCGGCCGAGGTACCAAAACCCGGCTTTTTTCGAAATACCTGCAAAAAAAAGT 

GGATGATTCCAAATCCAACTGAAGTGTCTGCTCTCTCTCCAATTCAGAACAACCAGAAGG 

GCCTGTCTTGMTTAGGTMTGCATTAMGAAAAGTAGGATTATTATATTCCAATTCTTT 

CCATCAGATGTAAAACATTATTGGTAGATCACATCTGTTTTAATAAATCTGTAAGAAAGA 

CGTGTMTTATMTTATGTTACCATTGTATGTAAATGGCATTTTAACAAGACATATTAAT 

ACATTTTTATAGAGTACCTGCCCG 

Sequence 527 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGT 

CCGGTTAATTTGGGAGAAGGAAGAGAGAGTGACATATTTGGCTACCTTCAGGGAACAAAA 

TCTAACAGCACAGATGGTAGTAGAGGAGATACCAATTTACATATTAAGGAGCTAGAGTTG 

ATGATGGTATGACTCAGCCCTCTGAGATTAAATTCTACTTACTAGGGCTATGAATGGAGA 

TMGTAGGTATCCCACCTTTTATTAGAAGGTTCCTTAAAATAAATATGGGACTCTGGTCA 

GAGAGTAGGGCCATTAATTTGCTCCTGGTTTTTACCTGGCATCCACCCACCAGTACCTGC 

CCGGGGGGCGGNCGGCCCGCCCCGGGCAGGTCCCGCAC I I I I I 1 1 I I 1 1 1 1 I I I ICCTTT 

Sequence 528 

CCGCGGTGGCGGCCGCCCGGGCAGGTATTGCCCTTTGATGTCCCCATGAGGGCCAGGCCC 

AGGCAGMCCCATCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAG 

GATTAATATGGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTC 

TCTTAGAGTTCAATGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGGAATA 

TCTAMTATTGTT GATT TAGATAACCACCCTAGATTTCTCACCACCCTAGAACATTTAGT 

GGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGTGCAAA 

TGGTATTTCTCAGCTAAAATCTCCCTTATGAACCCTTCTCGAAATCC 

Sequence 529 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAACA 

AGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGCT 

TGT TTTC CTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTTT 

TG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAATT 

CAI I I I I IGAAAGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATTG 

CMACATGTCATC ACTCCCTTG ATGTTTACCTCCTTGTTATGCATTTTTAGCAGGCTTTA 

TTGTCACCTGAGAI I 1 1 1 I I I 1 1 CTTTGACAGGCCGGAGTCTAGATGAAGGAAAATGTGT 

TAGAAGCACCTTATCCACAGATGGGG 

Sequence 530 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAGCAGAGCAGCTCCCTC 

GCTGC GATCT ATTGAAAAGCTTGAGTCGACCCACGCGTCCGCTGCATAAAAGTTATGCAA 

AMGCATTTTATGATATACCAGCAAAAAACATGGAAAATGAAATTTTGAAAAGCAATGCC 

ACTTCAAAGATCCCTCAAGTGCCTAGAGGGAGAAAATGAGTTAATATGCTTTGAAGAACT 

GTATCC AGAAAA TAAAATTACAAAGGAGGAGAGGGATAGGATTCCAGGACAATCTCAAAA 

CTATTGCTTTTTCCTAAATTCATTGCAACCTTAAAATCCTAGCAAGTTCTTTAATGTAAA 

TTAACAAGCTAATTCTAGAATTCATATGCATATTCAAAAGTCGAATAATTGTCAAGGCTA 

TCCTGTAGAATGGGACAGAGAGGATTGAAATTT 

Sequence 531 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTTAC 





WO 01/070979 



PCT/US01/09126 



TABLE 1 

88/467 



AGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAGCC 
AAGATCTTGACCTACGCCATCTGATACCCTGAGCCCATGCTATAAAAGAGGAGCATTAGA 
MTATTTGAMGATAGAAATGAGMCTAGTCAATATTTATTTTGCTTAGCACTGTATTCA 

GTATTATGGCATCTTAMGTAGTTMGACTCMTAmCATCAAAAAAGTTTAAAATCTA 

ATCAGAGAAT 

Sequence 532 

CGCTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATAAGCCCACCC 

CACTAGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTAC 

TGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGG 

TTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAAC 

CCGTAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATAC 

CCCCTTGCCCCGCCAACGTAGT 

Sequence 533 

CTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCACTTTTT 

I I i i M i 1 1 GTAGAGACAGGATCTCCTTATGTTGCCCAGGCTAGACTTGAACTCCTGGGC 
TCAAGGGATCCTC CTGCCT TGGCCTCCAAAAGTGCTGGGATTATAGGTGTAAACCAGTGT 
GCCTAGCCTACAGTTTTTTMTTTrATMAATGTTATTTCTAATTTTTCTCCAAAAGTAA 
AAGTGGCATTCCAATGGCAATATTAATTCAGGTATCCAGAACTCTTAACCTAAATTTGGG 
TGAGATGAGGAAAAGTGTATTGTTAATTTTATGTGTCAACTT 

Sequence 534 

CTACTATAGGGCGMTTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCC 

ACGCGTCCGTAGTAATAGGAATTAACTGACCCCTTTTGGATGGGGGAGAGCATCAGGCTG 

GGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGCCCTG 

ACTTGAMCCATTAGCGCTMCTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTTGCAT 

GAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAATTGTC 

TCT GTGA TAAAAAAAAAAAMTGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGAA 

GCATTTTCTAGTGATAGMTGTATTTGTCTTTTTTCCTGGTGGTACCT 

Sequence 535 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCACGGCCCAGGCAGAACCCA 

TCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAGGATTAATATGGG 
AATAAAAAATGAACCTTAAA 

Sequence 536 

GAANTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCAGTAGATTTGGAGAGTAATACAAATC 

CTTTCTTTCTGGTTAGAACACACTGCCAAAAGCCACCTCTTTCATCTAAGGAAAAGATTA 

AAAATGCATGTTGATATCTCCTAACTATCACACAACTTCCACTATTACAATGAAAAATCT 

G GTCCC CTTTCATTGCCTTTGA MACC NTTTTGCCGAGGTGGNTTTCAAAAAAACNCGNG 

ANTTTTNAAAAANTTG GNTTTGGTTTTACCNGGGGAMGGGGACNTTTNNCNNTTTTTTT 

I I I i i i i 1 1 I I I I I I I INAAA NGGNGATTNGGTTNNGGTTNTNCCTGGGGCCAAAATNCC 
NTTT TGNGGAACCI I 1 1 I I GGGGTCCNAAAANNNACAAAANAAAGGGNTTGGGACNATNT 
TTTTGNATNCNCNCNAAAAAAA I I I 1 1 I 1 1 I I I 

Sequence 537 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGCTA 

GGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTGAC 

ACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGGCC 

ATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGTAA 

GCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 538 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCG 
ACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCA 
CAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAAC 
CCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGNGGTGGGAGGGAATGGAAA 
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AGGCCCTGA 
Sequence 539 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGACAAGCTTCGACCCACGCGTCCGCAAGT 

TTTCAAAATGTAMTACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCAC 

CAGTAGGTTGGACAGCATATMTTTGCATCATTTTGTCCCTTGTAAATCAAGATGTTCTG 

CAGATTATTCCTTTAA 

Sequence 540 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAAC 

AA GCAGC CAGCMCTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGC 

TTGTTTTCCTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTT 

TTG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAAT 

TCATTTTTTGAMGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATT 

GCAAACATGTCATCACTCCCTT 

Sequence 541 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGA 

TTGTTAAAAGAGGAGTCTAGAAAAATTAATCCTGAACCCTAAAGAATAAATCTTAAGTGG 

TGGATACATGGGTTGAATAGTGTGCTCCAAAATTCACATCCACTTGAAACTTCAGAGAGT 

GGCCATATTTGTAAATAAGGTATTTGCGGGTGTAATCAGTTAAGGATCTCAAGATAAATT 

CATCCTGAATTATAAGTTGTCCTTAAATCCAATTACTGGTATCCTTACAAGAAGGTGAGA 

GGAGACAGAATAGAGCCATCTGAAAAGGGTCAGAAA 

Sequence 542 

CTAACACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 
CCACGCGTCCGCAAA AATCA ATCAAGGGTTCCTACTCAAGTAAAAAGCAACTTGTAGGAA 
AATAATAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTGGCTCTTGGTGCACTG 
CCTGTGAGTTAGCCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAAAAAAAAAANGT 
Sequence 543 

CTACTATAGGGCGATTGGANCCTCCCCGCGGTGGCGGCCGAGGTACTTCCTGGAAATCAA 
TTAACTGAGTCTTTTGAAACCCCTAGAGAAGATAGGAGAAAATTGGTTCAGANCGAGCAT 
TTAMTTMGTCAGCAMGTCAGMTTTAAMTTGGGCAATTCCTTGTCTACATTTTCTT 
TACACTCAA 

Sequence 544 

CTNACTATAGGGCGANTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCC 
T TAAC T TAGA TCGTTACTNATGTTTCTMATCANGTGGMGCACATTTCCTTTTCTTCTT 
CTTTTCTTTTACTGNNMTATCCTNATTCTNTATTTTACCAGTGGNGMTGTTTAGAATT 
AATTTCCATTTAGCTCMGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAA 

AGCCTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCC 
Sequence 545 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCACACAGAAGGAGG 

AGGGAGCTAATCCAGTAACAAACATTCAAAGATTAAATTGTAGATATGCACCTCTGTATT 

TGGCACTGTTGATTAATATTATAACACCTTCCTCTCAAAGACAGGCATTCTTAAGCGTTA 

GTCACAATATACCA GMTT TGCTATTCATATTAAAACCACCTTTTAAACTTTATAACAGT 

AACCAATTATTATAGTnTAAGAAACAAAACGCAATGAGAACTGGGAATGGAATTCAAAT 

CCTpCAMTTCTTGCTATGCTCCAAGCTGCCATCCATAAAACAGGTTTAATTTGGGTAAT 

TTTTCCATTGTGGGGAAGGGTCAACAAGAAACAATTTAAAGACAATATTTTCCAATACAA 

ATAAAGACATACACTTTTTGTT 

Sequence 546 

TACTTAGGGCGATTGGNANNTCNCCGCGGNGGCGGCCGCCCGGGCAGGTACAAGCTTCGA 

CCCACGCGTCCGMATMTAAAGCTAGMGTMTATTTTTCCTTTTGTCTATTTTCCAAA 

TTGACTCGATATTGATGGCTACTTTTGTMGTTTTTATTTAAGNTTAAAGGGAATATTTA 

TTGATCACCTCTATGTGCTCANTACCT 

Sequence 547 
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TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCA CI I I 1 1 I I I I 1 1 1 1 1 I 1 1 I I 

TTCCAGTGCTAAAACATCAGATAAGAGCCTACCTGACATTTTGGAGAATTTGCTGNGCTG 

GGATTGATATTCCGCATTGCCTAAGAGTAAAAAATAAGACGGACGCGTGGGTCGAAGCTT 

GACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCACAGGAGGCAGAAGGAAATCCTCA 

ACCTTCCGAAGAAGGCGTAAGCCAGGAAGCAGAAGGAAACCCCAGAGGAGGGCCGAATCA 

GCCTGGCCAGGGATTTAAAGAGGACACACCCGTTAGGCATTTGGACCCTGAAGAAATGAT 

AAGAGGAGTAGATGAGCTTGA 

Sequence 548 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNAGGGTCGCGGTGGGTCGA 

CTNANGCTAGAGAATTGTAATACGACTACTATAGGGATCTAGATCACGAGC 

Sequence 549 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I 1 1 I I M 1 1 I I I I I I 1 1 

TACCAGMCATCACATMGTTTATTTCAGATGTAACAGCAATGTTAAAATTGACAAGTTT 

MTTCTTAACTGCACCMGTAMCTTAGCCATTTAAGTAT7TTTTTAAGTTATTCCCTCC 

AAMMCTGA GGGA GCTTTTCTTTTCCACCACCACACCATGGTTTCCCAATAGTTCTCTT 

TTTGGAGGACTTTTCMTTGATGAGTAMCTGCTTTAGATATTTCAGAACTTCATTCCCC 

AAATGAAAGCTAATCTGGACAAACTATATATTGCATAGATTTCTCTACAGATTCTTTGCT 

TTAAAA 

Sequence 550 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGATGAGGCGGGGA 
GGTGGGACCCCCAAACATATATCAGCCCAACAGCCCTAAGTCTCCTTCTTTATTATTAGG 
AAAACAACAACAACAAC AAACAA AAAAATGGCGTCATGAATATGAACAGCATTGTCAGAT 
GAATTAGTTGAAGTGGN I I I 1 1 I I I G I I I I I I I I I 1 1 1 1 I I IGAACCTGCCCG 
Sequence 551 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGTGCTTCGCCCCAC 

GCGTCCGNAATAATTGGAAANGGCCTATAGATTAAAAAGCTGAGAAAGTATATGGTAGGG 

AGCACACTCCCCACAAGTATGAACTCTGNGATTACGACATCTCATAAATNCATGAGCACT 

CATGTTGGCTTGCTTTGTAGCTATGAACTTACCCTGTATTATTGAAACGTCAGCATAATG 

ACTGGAAGGAGAAATTGGTCCATTTTAGAGCATTACTATTATGCTATCTGTCCATTTAAA 

TTAATAATTGCATTAAATTCATTTTAGAAGGNGCTATTACATTNGTAGTAAGAAAGTAAA 

TTCATATATAAATATTTGATTATCAGATGGTTTACTTACAGATACTTATTTTCCTGTAAA 

ATAGGAGAGTTTACCTGAAGAAAAATAAAACTTTTNACTTTTCTGGGAAAAAA 

Sequence 552 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACCAAGTGAA 
TTTAAATMTTG GTGTG GATT GGCCAG TAGCTAAGAAGTGGGCTTTTAAAGAGTNTTGAA 
NATNGAANGGG I I I 1 1 NrrTTCTTTTTTAAAAAAGAAAAACAAACTATTGATTGTCTATAA 
TGA AAAGC TAGGNNTTGCCCTNTTCATGTNTACTCTCCTTCCAAATAGTTATATCCAAAA 
CTGi I l 1 1 CCCTCTCCCCTACCTTGTCCCCCCTATTAAAATAN AAACNG GG ATTG ATTAA 

TGTCCCGCTCCTGAATACATGTAAAATTTGTACCTCGGCCGNTCTAAAACTAG 
Sequence 553 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGCTTGTCAATAAAAAACAAACAATTGCTAAATAAAACAACCTGAGAAAATCTCCAGAGA 

ACTATACTGAGTGAAGGAAGAAAAATCCCCAAAGATTACACACTGTATGTCATTTATATA 

ACATTCTTGAAATGACACAATCACAGAAATAGAGAATACTGGTCACTANTGCATTAAGGA 

AGGTGTGGAAGGATGTAGTGATGGGAGGAAATGTGTATGGCTGTAACAGGGCAACAGAGG 

CNTCATTGTGATGATGGAAGTGTTCTGTNTCTTGGGTTTTTTGAATGTCA 

Sequence 554 

CACTACTTAGGGCGAATTGGAGCTCCCCGCX3GTGGCGGCCGTTAAAGGAATAATCTGCAG 
AACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGNCCAACCTACTGGTG 
AACTGGATCAGAATGGTCCAAGGACTGTTAAACAGAGGAAGTNTTTACATTTTGAAAAC 
Sequence 555 
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CTACTTAGGGCGAATTGNANCTCCCCGCGGGGGCGGCCGCTAAAGGAATAATCTGCAGAA 

CATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAA 

CTGGATCAGMTGGTCCMGGACTGTTAMCAGAGGAAGTNTTTACATTTTGAAMCTTG 

CGGACGCGTGGGTCGAAGCTTGTACACCTT 

Sequence 556 

TACT TAGGG CGAATTG GAGCTCCCCGCGGTGGCGGCCGCACI I 1 1 I I I I I 1 1 II I I I I I G 

TTTTGGTTTGTTTTTTGGGCTCTCAGGTTTACCACTCTCTCCCCATCTTCCCAGGTTTAG 

GAGGCAGGCCTCTNCCTTTCCTACCCTCCCACCCGAAGGTGAGTCACTAGAGTGTTACM 

GCTTACTGGCAGGGAACACTACAAATTAACAAACAGTAAGTTACCAAAAGGGGATACACC 

CTTATAAAAGGGTACCTN 

Sequence 557 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCACGCGT 

CGGCTTTCAATAGATCGCAG CGAGG GAGCTGCTCTGCTACGTCACAATCTTTCAAAAAAA 

TGAACATGTAAGAAAAAGCAGTTTTCATTGTGCTAATTATTGCAGGCCTTCATGCACGTA 

AACCTCAACAAAATGTGTGCCAACAATATACAAATTTCCATATAAACAAAGTCATTGATC 

ACTAACAAAATATAAACATGGNTTCTTTTATATTAG A I I I I I I I I A AAAAAAAGCTATTT 

ACCAGCAAGAAAAAACAAGTACCTGCCCG 

Sequence 558 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATT 

GATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTAT 

TA AATCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGG 

TGTTTTTTTGTGGACCATTTTAAANCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGT 

TAGG 

Sequence 559 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGGCTTCGACCCACGCGT 

CCGCTCCAGGAGACTTCTGCTTACCTCTCAGTGATCAAAAACCGTTTCACCACAGTTACT 

TACCA CGTCCTACCGATCCGCATTCTCGCAAGTGTCCTTCACTCCATTTACTCTACTGCA 

I I I 1 1 CACTGTATTTCTCATGCCAAAACTTGGGCTTCTCCACCAGTCTGCACACGTTCAT 

GCTCTCAATTCTCACAGCCATCTATTTCATTCTCCACTAAACTGTTAGAGGGATTTCTGN 

AGAAATTAAAGAAATTCCTATCACTCCTAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAG 

AACTAG 

Sequence 560 

CTATAGGGCG AATT GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACG 

CGTCCGCAAGTTTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTG 

ATCCAGTTCACCAGTAGGTTGGACAGCATATMTTTGCATCATTTTGTCCCTTGTAAATC 

AAGATGTTCTGCAGATTATTCCTTTAA 

Sequence 561 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTTATTTATTTCTT 

TTAGGAATTGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTA 

GGCAGGAACATTAGACTCCAAAAGCAGAGAMTGCTTAATTTTTCTTCTACCTGTTTCAC 

CACATTCATGTANAACTGTAGTAAAAAAGATGGNGAATCAGGCTGAATCAATCTAAATAA 

CAACTTAAGGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTT 

TAAATGGNGCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTG 

TCATTATGACATTCCATGAAAATGAAGAAAATCTTTCANGAAAAATTTAGAAAATAAAAA 

TGTTTACT 

Sequence 562 

TTAGGGCGA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTTAACT 
TAGATTGTTACTTATGTTTCTAMTCTGNGGAAGCACATTTCCTTTTCTTCTTCTTTTCT 
TTTACTGNTAATATCCTTATTCTCTATTTTACCAGTGGAGMTGTTTAGTATTAATTTCC 
ATTTAGCTCAAGATTCAAGAMTGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTAC 





WO 01/070979 



PCT/US01/09126 



TABLE 1 

92/467 



TGTCTTCTGCTTTGGMGTCCCACMCAGCTCTTTMTTTCCTTAAGCCCCACTTTCCTC 
ATCAGCMGTTGGTGTGGCMTGGATCATAATAGGTTGCTGGGAGGATGAAGTGAGCGGA 
CGCGTGGGTCGAAGCTTGTACCT 
Sequence 563 

CTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGTAATCACATA 
AATTCTGCAATCATCT GTTTA TTTAGCTTAACTGI I I 1 1 I I I 1 1 1 ATTTGTTGAAGTTGT 

TGTTGTTATTNCAGTCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAA 

GTTCCATAAT 

Sequence 564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTCTTCAGATTCAAAT 

GCCMCACTMTTTGMCTTCTTTGGGTCTATGACAGTTTGCAAGCCATACAAACCCAAA 

GAGCTAATCTGTGATTTCTTAACTTGAGAAAATAATAATNATAACCACCACTGGAACCTA 

CATAGGTTTGTNGNTTATTTMCATGACTTMCCTTTTGTTTGTATTTTTTTGAAAAAM 

AAAAAAA AAAGT GACCTGCCCGGGGCGGCCGGACGGCCGGGCAGGTCGCNCTCAAATTNT 

TNMTTTNTTTTGGAAAGACANGNATTTTTTATTTTTGCCAANGCTAAAACTTNACNCTG 

GGCCTTTAAAGGGGATTCCNTNCTGGCCTTTGGGCCNCCAAAAAGTGCTTGGGATTNTTN 

GGGTNNAAACCCCGGNGGGGCCCTAGCCTACCAGTTTTTTTAAA 
Sequence 565 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 

GTGCAAATTTGACTTTG TAAA TGGCCCTTGGGCTCTGGGAGGAAAGCAACTGTTGGGCCA 

TGTGGTTGTATCTTTAGTTTTGTAAAGAATTGCCAAACTGTTTTATAATGTGGGTATATC 

TTTCCACACTTCC AGCACAA TGTATGAGTGATCCAGTTTCTTAGCACCATAGTCAGAATT 

TACTGTTGCTACTATTTTTTAGCTATCCTGATAGATGTGTAGTGATATTTTATTCTGGTT 

TTGAAGCAGTGTCATTGTCTGGGGTAAATCCTTGAGGTTTGTTGTCTCAGTCAAGGGGAA 

TCAAGGGACATGGACACACAAGTAGTGAATTTAAGAGTGGAAGTTTAATAGGTGA 

Sequence 566 

TACTTAGGGCGAATTG GAGCTC CCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCA 

AATTCTCCAAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAA 
AAAAAAAGT 

Sequence 567 

ACTTAGGGCGAATTG GAGC TCCCCGCGGTGGCGGCCGCCCGGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGCATATTTTCATTACTTGACAATGTGGGATGGGTGCAATTTATTCCATCT 

T CACTA AAATAGAAGCAATTCCATAGGTACCATAAACCTATTTTAGGTACCACAAGGTGT 

CTTTTTACACAGCTCATTTGMTACAGGTGTTCTGAGMGGGGTTTCTATTTTAAAATTA 

CCATATCAAAATAAATGTGCCTTATTTTTTTATAAGNCTTGTTAAATCAGTGTCCATATT 

ACTGTTTGGGGAAGG 

Sequence 568 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCAGGTACAAATAAGCCCACCCCACT 

AGGMCTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAMCTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGT 

AAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNNAGAGAAAAAAAGGCCATCTATACCAATTCTAAG 

Sequence 569 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAGCGGCCGNC 

CGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTNAAAATTCCAAG 

CTGACCTAGTGCAACACATTTACA CTTGG ATAAACTATCACCTTGAAGATTTACTTTGTA 

TACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGNGCCGAATTGAAATC 

CAGAATACAATAGCTNTGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAAC 
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TCAGGTTGAGATTGCTTTAGAACCTATCATTGGGCTTTAAAATGGTCCACAAAAAAACAC 
CTCATGGGCACACTTAAAATAGTTTGCATCCCACACTACAGGCTTCTCTGGAGGGATTTA 
ATACTTTGGG 
Sequence 570 

CTACTTAGGGCGA ATTGGA GCTCCCCGCGGTGGCGGCCGAGGTTATTTTCCCTCAGTAAC 

ATGTMTTGCTACATTTTTTATAAGAAGGTATGGTTAGAAAAAAATGTGAAAGATCACTT 

AAACCAAAGCCAGTTACAAGGAGTAATCTCTCCTGTTGGTTTACCTTCACCTCANAACTA 

CMGMTATTACAATACATAGTGAATAGTTGTCTGTAACATTTCTACCAGTTGTTTCANT 

AGCATATTGGTCT TGGCATTTCTTGG CACTGTGGTTCTGCTGTATTATTTGTGATGTCTT 

ATTGTTTGTGAGCTTTTG I I I 1 1 I I I I AAAGAAAAAACAAAAACTAAGTG 

Sequence 571 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGTAATCACATA 
AATTCTGCAATCATC TGTTT ATTTAGCTTAACTG N I I I I I 1 1 I ATTTG NTG AAGTTGTTG 
TGTGTTATTTCAGNCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAAG 
NTCCATAATTATGGGGAAGATTTCCTCTGAATTGGCTAAATTCCTGTAGCTGAAAAAAAA 
AAAAAAAAAAAAGT 

Sequence 572 

TTAGGGC NATT GGAGCTCCCCGCGGTGGCGGCPGAGGTGTACAAGCTTCGACCCACGCGT 
CCGCMGTTTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATC 
CAGTTCACCAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATCMG 
ATGTTCTG CAGATTATTCCTTTAA 
Sequence 573 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTACGGC 
TTCTGACAGGCCTCAGAAG AACATTCC CTACCCCAAAATTATAAAAATAATCTTGTATAT 
ATTCT TCTCAAMCTTTATACTTTTTTTMGGCTTGGATTTTTAATCTATCTGGAATGTA 
I I I I I AAATACTGAGTGAGTCACTTTTCTCCCGGACGCGTGGGTCGAAGACCT 
Sequence 574 

GAATTGGAGCTCCCCGNGGTGGGGCCGCTCGAGGCCGCTCTGACCTCTTTAAGGNANACT 

TA7TATGCTAATAGTTGATGCGCCCTTTTGCTGNANCAGTTACCATAGGTTACATGATAA 

NTATATATTGTTGNGATGATCTGATTNCCTGNNTNCCCCACCCNTGCAAAACAACAACAA 

AAACCTTTACCAGGCTNTATMCANGGGGACCAAACTTG^ITTTTGCTCATCATTGCCGGA 
CG 

Sequence 575 

AGGTA CTTACCCCAGACAACGACGCCGCTTCACCATGATGATGGACAACAGGCAACTTTT 

I M I iGGAGTTTCAGCTTGCTTCCAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTC 

TAAGACGATAGAAGTTTTCAGCCTAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAAGT 

TTTGTCTTGGAAATTCTCCTTAATGACTAAAAGTTAAAGAATTAAACAACCTAGCTGGGC 

TTAAATTTCTTNCTTACCCATTAGAAGTACCCTTGCCC 

Sequence 576 

TAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGNGGTCTAAGAGACAGGGTCTCACTACA 

TTGCCTAGATTGGNCTCACACTCCTGGGCTCAAGCAATCTTCCTCTCTTGGCCTCCCAAA 

GTGTTGGGATTGCAGGTGTGCGCCACTAGGCCCAGCTTGAAAAATTTTTAATGCATGTGG 

TAATCCACAGGAGATCACATTTAGTATATGACCAAGTTMTTAAGAAGNCAAAAAACACG 

TTAMTTTAAGCAGAATMGGCTGGGTTCGGTGGCTCATGTTAGTTTTTATCCTTAAAAT 

TGTCTGAGTTCTTAGAACACAGAAAAAACAAATTTGAATGCATTTCTAACAGCTTAATAA 

TTTATATGTCCCATTATG 

Sequence 577 

GGGGNGGCGGCCGCCCGGGCAGGTCAATGCTTCGACCCACGCGTCCGATTTCAGGTTGAC 
TTTTCTCACCTTTAACCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCAT 
CTGGGTTATCTGTAGCTTGAGTTGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGACCT 
Sequence 578 
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AGGTGTCGACTCAAGCTTTCAGATATAGGCATTCCAGAATCTTCTCTTTACGAGTTCACC 
TGCTAGTATAATCTCCACAACTTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGC 
ATCACAAGTTGTACCTGCCCG 
Sequence 579 

CCGGGCAGGTTACAAGTCG ACCCAC GCGTCCGCTTCAGAATATCCAATTCATGTGMCTA 
CAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGT 
GTCATGACCAAGCTAGATCTCTTTACCATATCATTMTAAAAGTCAMTTTTAAATTTGT 
GCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 
Sequence 580 

GTGMCCTGCCCGGGCGGCCGGCCGCAC I I I I I I 1 1 I 1 1 I I I I I I I 1 1 CCTTTTGAGACA 

CAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCCCGCTTT 

CAGCCTCCCAMGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTTCTTTTT 

CATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTGAACTCA 

AGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTTAGCCTT 

GTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTC 

Sequence 581 

CCGGGCAGGTACCCTAAAACTTAAAGTATAATAAAAAAAAGATTATTTCCTATACTTCAA 

MTCMCAAGATTTGATTGCATTTAMTTTTTCTGTCCATTTGTTCTTCTATGTAATACT 

TTAAAAATAATTGGCATAAAAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAA 

ATCTACTGACATCTCTTGAGGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACA 

GAGACCACTACACACACACACACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTT 

AATMGACTGAAATGATGCCATACATGCTnTAAAAAATAAAAAGTATTAAATTTTAA 

Sequence 582 

CCGGGCAGGTACTTATACTAGAAGATGCTCCAAGGTTTCAGAAAGGAATTAATTACTTTC 

AATTTGCACMTTTAGAACAAATATCTGGCTTTTCCCTAAGCTTAATGATTTTCCATTTC 

ACACAACTAAAATATAATAGCATTATTTTATAATCAAGTTTAACTGATGGTCTATGATAG 

TAGAGCGATTTAGTATTTTGACAAAAATCTTATGAGACATGAAGTCATTCAATTTGCCGG 

ACGCGTGGGTCGACTCAAGCTAGACCT 

Sequence 583 

AGGTCGCCATCACACCCGGCTAA I I I I I I I I GTATTTTTAGTAGAGACAGGGTTTCACCA 

TGCTAGCCAGGATGGTCTCAATCTCCTGATCTCGTGATCCGCCCACCTCAGCCTTCCAAA 

GTGCTGGGA TTAAA GGCGTGAGCCACCACGCCTGGCCAGGAGATTCTTAATTATTTCTGA 

ACTCTATCAGTTTTGTATTAGGACATCTTATTTAATATTATCAAAAGATAGTTCCTCTTA 

GAGGCATAAATCAGTCAATCAACAAACAATAGGCAATCACGGACGCGTGGGTCGAAGACC 

TGCCCG 

Sequence 584 

AGGTGTACMGCTTCGACCCACGCGTCCGCATTTTTCTGGTGTTCCCTCTTACGTGCACA 

CCCCTTGCTCCCCTTTGGGTTGACTTATAATCTGACTTTTGTGACAGATGTTAGGAGGTG 

GAGCAAAGGAATTTCAGACCAATCAGTTAAGAGACTGCTGTGGGGTAAGAAAAAAAATTA 

GCCTCTTAAAATTACTCTTATCAAAGGAAAAAAAGTTGGAAGCACATGATAGTATAACCA 

GAAACATGACACAGAAGAATTAAGGGAAGAACCTGCCCG 

Sequence 585 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTAAGTTATAGCTGTAATTAATTT 

TCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTGGTTTAGTCATATAGATAATATCAATCAATAAT 

Sequence 586 

CCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTTTGTTGATTGACTGATTT 
ATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGATGTCCTAATACAAAACTG 
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ATAGAGTTCAGAAATAATTAAGAATCTCCTGGCCAGGCGTGGTGGCTCACGCCTTTAATC 

CCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGAGATTGAGACCATCCTG 

GCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAATTAGCCGGGTGTGATG 
GCGACCT 

Sequence 587 

AGGTACATTGTTAGACAAGTGTTTATCACTAATCTGGAATACATCATCTTCAATAAGGCT 

CTTGTTTTCCTCCAAGCTGCACTGCTCACACTGCTCAGTTTTCTGTTAAGCAACCTGCTC 

ATTATAGTAGAGCACCMGGTGATCTGTTCTTCTGTTCTTCAGAAGTTCACTATTTCTTG 

TTGCMCAGGGCTACATGATTTTAAGATTCCTCAAAGTCAATACGAATTAACATTATTTT 

CCATTTCCATTCTGTATATCTTCACATTCCATAAATATAATACTCATGTATACGTTAAAT 

TTCCTTATAAGTTCAACACATTGAAAGCTAAAATAAAGACTTCCTACTAG 
Sequence 588 

CCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTCCAA 

GCTGACCTAGTGCAACACATTTACACTTGGATAMCTATCACCTTGAAGATTTACTTTGT 

ATACTTCAMAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGTGCCGAATTGAAAT 

CCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAA 

CTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAACAC 

CTCATGGTCACACTTAAAATAGTTTGCATCCCACACTA 

Sequence 589 

CCGGGCAGGTGACTTGGCTGTGAAAAGTGCTAAAACAGATAAAAGACTATACTGACAGGC 

AAATGGAGCCTGTTATGACACTGACATTGAAGGTGAAAGGAGAATCCAGTTCACATTAGC 

CAGGGTCTCAGGGACCAGGTTTTGAGGCAGTATTTCTGCCTCTTGAGGACAGGGCAGAGC 

AGGTGGGTAAAAAGCAAAGAGACCAGGGAAGGGGGACTAAAAGTAAGGGAAACAGCATCT 

GAGGMAGGCTCCTCTGACTGGATTTTCACAAACATTATTTATTAACCTCACTAAACAAG 

GATAATGGGACAAAACAGGGGCAAGCTGGAAAACCAGCAGGGGTATTTGGCAG 
Sequence 590 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTTGGTCACTGNCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGTTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGAe 
Sequence 591 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTAAGTTATAGCTGTAATTAATTT 

TCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 
TAAAGGTG GTTTAG 

Sequence 592 

AGGTCAAGCTTCGACCCACGCGTCCGCAGCCTGGGTGATAGTGAGATCCTGTCTTAAAAT 

GAAGAAAGAAAGAAAAAAAGAATGAGAAGGAAGGATATTAATTGAAGTAAGAGCACATTT 

GATTACAAAATAGAAGAGGAGTAAGTGAGAACTAAACGGGGAATACAGATAGCAGAGATT 

AAATAGGCTATAAGAAAAAAAAGGGATGATAATAAGACCATGGTAGTACCTGCCCG 
Sequence 593 

AGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGTTTA 

CAATATGGTGACAACAGATAACTGTAAAAACTTCTTTTTCAAATAGAACCAGCAGGAGCA 

TGCATGGMCACATATACCAMCATCTTTCTGATMCATTAMCATmTAAMGATGTT 

AAATGTTCTTTTCATTGNGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTGTGTGGAAC 

CTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAATTTTCAAAC 

TAAATGATGGGGGAAGGAACTAACATTTTTGCAGNCCCTGGAAATGTAAAATGTTGTACC 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

96/467 



Sequence 594 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAATAGC 

ATTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACAC 

CGCTG<XTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTTATCAGGCTTCAGT7TCC 

TCATCTCTAAAATGAG AGGAT TGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAG 

TGGCATC TCCCA TTTATTTTTCATTTGAMTAAAACTTTTGAATTTTATCTTCTACCTAA 

ATAACATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAA 

AGCAAAGGATCAAAGCTG 

Sequence 595 

CTATAGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTT 

AACTTAGATTGTTACTTATGTTTCTAAATCTGTGGAAGCACATTTCCTTTTCTTCTTCTT 

TTCTTTTACTGTTAATATCCTTATTCTCTATTTTACCAGTGGAGAATGTTTAGTATTAAT 

TTCCATTTAGCTCMGATTCAAGAAATGCANAGTGCTATTTTTATCAAATTTCTGAAAGC 

CTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCCCCACTTT 

CCTCATCAGCAAGTTGGTGTGGCANTGGATCATANTAGGTTGCTGGGAGGATGAAGTGA 
Sequence 596 

ATGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 
TTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCATATTCT 
TCCAAAAAAAAAAAAAAAAAAAGT 
Sequence 597 

CCGGGCAGGTCTTCGACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

AAAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTA ACCAT TACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTGGGATMTATGG 

CTCAC TTTTTACTTCATTTACMTATTTAATAAGTGCGATTTTAGACTTGAGAAGAGAAT 

ATTTTCTGCTAAAATTAT CCCC ACTAGAGATAATCACCAGTGAATTAATACACTGCAGCA 

ACGGAACCAGTCAGCTTTTTTGGTAATCATTCCCTTCCT 

Sequence 598 

CCGGGCAGGTCTTCNACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 
AAAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 
AGTTAA CCATT ACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTAGGGATAATATG 
GC TCACT TTTTACTNCATTTACMTATTCMTMAGTGCGATTTTAGACTTGAGANGAGA 

ATATTTTCTNCTNAAATTTATCCCCNCTAGAGATAATNNACCAGTGAATTNATACACTGC 

ACCNNACGGAAACCAGTCA 

Sequence 599 

AGGTGCTTCG ACCCAC GCGTCCGGTATTTCTCTTAAAGTTAATTTTGATAGATATTTATC 
TAGATGCTTTCTTTTTTCCCTTGCCATAATAGCTGGCTTGTAGAGAGAGTTATGTTTGAA 
AAGGCTTGCC I I I I I ICCGTCGCTCTG 
Sequence 600 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

TGTGGTGCTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACC 

TTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTTGAGTATCTGGAGGAGGATGCCCA 

GAAGTCCNCACAGGAGGGGGTGCTTGGGACCACACACTGATGCTCTTGNCATTCAGACTC 

TGAGAAACATGCCGCGTGATGAANAAACCATCCCAATTANANGAAGCTAGCTGNNTTNCA 

TTGNAGCAGCTTTACCCCAATTT 

Sequence 601 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATT 

GTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAAT 

TGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAG 

GGTGGAGGAATGGAAAAGGNCCTGAAATGCANGCAGAGAAATGATGAAACAATTCCNGGG 

GCTGCGGNGAGGTTANATGAATATCTTTACAGCAGCCTNGAAGACTGATCANGTTACTAT 



• 
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ACCCTCTCTTCTGTCCACGTGCATTTNA 
Sequence 602 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTA TGAT GCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAMCTATTTTMGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 



Sequence 603 

ATTGGAGCCTCCCCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAG 

ATCCGTTTCAGAAACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACC 

TGATCJTATTCTCATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAAT 

TTACTGGAGTTGGCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCAC 

CTGAGMTTTGATAGACATAGACCCAGAGTTACTGAGGGCAGGTGCTCTGTTTTGGGGACx 

CAGCAATCGGTGCTTTAGCAAGTNCTTTGGGTGATAGGGGTTNTGGAAACTACTGCTCTA 

AAGCATNATCTGTTTTTGAC 

Sequence 604 

GGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGTC 

CTGAAAGCGTNTAGACAAAAAAAGACTACTTGTMGTTGCAAGAATCAGAATGGCATTGG 

ACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGGA 

AAATT AAGT CTAATAAATANTTTTCTTCTTAGCCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAGTGCGGC 

Sequence 605 

CCGGGCAGGTACANNTTGTGATGATTTTGGCAGCAATTACAGAACAACCAAGGCCATTCA 

AGTTGTGGAGATTATACTAGCAGGTGAACTCGTAAAGAGAAGATTCTGGAATGCCTATAT 

CTGAAATCAGAATCCTAGTAGTTTGTAGTTTGCCTCTTCCTAGAAGTTCAAGAGACTCAA 

GTCATAGGCTACAGATGTACCTN 

Sequence 606 

AGGTCTTCGACCCACGCGTCCGCAACTGTTGATCTAACTTTTCCACTTGAATGTCTAATT 

GGCAAATCAAACCTAACATGTTCCAAACGAGTTCTGAAGCACCCCCTCTGCCAAATCTAC 

GTCTCCCACAGCCTTCCCTATTTCTCTACCTGGTACCTGCCCGGGCGGCCGCTCGACCTG 
CCCG 

Sequence 607 

AGGTCTTGAGTCGACCCACGCGTCCGGAGATGTATACGCCACTATAGGAACTATAAGAAA 

AAGTCAAATGGAAATCTTATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGA 

TAAGGTGTCAGTGAACTCAAAAATCAATCAATAGAAACTACTCAAACTAAAACTCAAAGA 

GAAAAAAAAAGATGGGAGATMTTATTTTTTAAGAATTGGTCATCAAAATGTAGCAACAA 

GTTT GCCT TATCCTATATCATTTGAATTTTCAAAAAATAAGCTCATTATACAATCTTTAA 

AATATTTTGAATAGAACTGTTTCATGTGTTATTTGT 

Sequence 608 

AGGTCMGCTTCGACCCACGCGTCCGGGGAACCCTTGTTCAGGTTCTTTTTAGGCAACCC 

AAGCCAAGACAACAAAGTAAGATAGAGCCCCAAATGTGGTCGTATAAGGTTTTTCAAAGA 

AAGTAACACTTGAGTTAGGTCTTAAAGTTTCACCTAAGAAACTGCCCAGGTGGACAAGAA 

GAAAGGGTGTTCCAAGTAGAAATAATAGCATGGACAAAGGCAATGTAGCAGGAAAAGTCT 

TCGTAMTTCAGGGAATTTCAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTT 

GCGGTGGCAGGGGATGTTGGAAATGTAAACAAGAGTGAGATAC 

Sequence 609 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

GTGGTACTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACCT 

TTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTGGAGGAGGATGCCCAGA 
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AGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTGTCATCAGACTCTGAGA 
ACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCTGACTTCATGGAGCAGC 
TTACACCAATTGAA 
Sequence 610 

AC I I I I I I I I I I I I I I I AGCTTGAGTCGACCCACGCGTCCGGGGATCTAGATCACGAGCG 

GCCG G CCGCC CGGGCAGGTACGGAAGCCATGCACTTGCCTCTCCTTCAGAGCTGGGATTT 

TTTT TCATT TTGCTGGCTGTGAGCACACACACGCCCACAGGTGCCTAAGCCTCTTGTATG 

TGTGTTTTGAACTGTGTCCTCTGAGTTCTGTGTCTGGGTGCATGCTCTCCTCTTAGCGTG 

GGTCTGCTTCCCCTGTGTAGCACTTCACAATGTTAGGCATTTGTCTGTGATAGCAGCTGT 

TCAGTAATTTCCTACTT 

Sequence 61 1 

AGGTTTCGACCCACGCGTCCGGMTTTATCTGGCCAGGCATTGGTAGTTTACAGAAGTCT 

ACCAGATGATTCTAA TGTGT GGTCAAGACTGAGMCTATGTGTTTMTTGGGTTCATTTC 

AAGAATAC TGTAAA AATTTTATCTAAATACTAAATATCCATAAAAGAAACCTCGGTAATC 

AGGCCAGGTTTTTGAGTTTTTCCAGATTAGCCCMCTACAGGGGAAAGAGACTTTCGCAC 

TATATCCCAGAGTCTCTGCTCCTGCTTCCAGCCTCAATGCACTGGGCCTTTCTGCTGCCT 

TGGAGCACTTAGAGGGATTACAGGAGGAGTGATCTGTGGAGTT 

Sequence 61 2 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGCCCGGGCAGGTACATACAGCCTGAAG 

TTMCCTTTCTATGTTAAATGAAAAAACTTTGTCATTCATCAGGTCACAGAAACCAAAAA 

CTAMCAATGCAAAAAAAAAAAAATCTAAAAATAAMGAATTTTATATTTGAAGTTATTC 

T GGATA TTCGCACCATTTTAGCTTCTGAAAAAAATGCAACTATGAAATGAAGACCTCATA 

TATTTTCATTTATCMTATMTGTTAAAAGTTTCATTCCACCGGGTGTGGNGGCTCACAC 

TTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGATCGAGA 

GTATCCTGGCTAACATGGTGAAACCCCGTCTCTCTAAAAAAATTCNNNAAAANAAAAAAA 

AAGGAA 

Sequence 613 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGTAGTTTTCTTAGCCATTTTTC 

AATAGT CTACACAGTGTTTATGTTTCCTTTTATTTGTGTATAGTGGAGTAGAGGGGAGGT 

TTTTTTATTCAAATAGAAGAAGCTAAAACTCAAATGCAATGTCAGATCTCANAATAAACT 

GACCCMTTTCTGAMCCCMTAAACACATTTCATTTGTMTATTCTTTATTATATAGCT 

CTATGAAAAGTAATTTGTGACTTTCGATCTTAAMGAGAGTTTTAAAAATACACAGTAAA 

TTGM AGAA AAACTACTACATTTAAAACAGTATITTCCTGAAAACATAGAATGAAAATGC 

AAGTATTTTGTGCATGGCAGCTGTTTTTMGGMCCAATGTTATATATGGNGMTTTTGT 

GGMGACTATGTCTCTTMMTATTTCTTATAAMTANCATGGCTTTTTAATAGCTGGGA 

ATCTGANGNNGGATTTCCCATGAAGACCTTAAATGGCTNNGCAGGAATTATAAAAAAG 

Sequence 61 4 

ACTTAGGGCGAATTGGAGCTGCCCGCGGTGGCGGCCGAGGTACATGCCTGTAATCCCAGC 

TGTTGGGGAAGCTGAGATGGGAGGATTGCTTGAGCCTGGGATACTGATATTGAGGCTGCA 

GCGGGCCGAGAATACCACTGCACTCCAGCCTGGATGACAGAGTGAGACACTGGCTCAAAA 

MAAAMGAAAAAGGAAGAMAMGTTTAAATCAATGAATGTTCTCATTTCTAATGAAAT 

AATGAM CATTATT GGGAGAGTTATAGTCATAATCATCTTACTGCACTATCAATTAAAAA 

ATACATCA J I I J 1 1 AGAGCACAATATATACCATAAAGAATTATTCAAATAGTCTAAATAT 

TACGATCAAATTT1TMTAGACTTTGTTACTTMACTAAMCTGTATTAGTCTGTATTAG 

TCAGCTCAAGTTGGGATTCACACCTGTAATCCCAACACCTAGGGGGGC 

Sequence 615 

CCGCGGTGGCGGCCGAGGTACACTGTGTMGTGGTCAMGATAGACATGGTTTTATTTAC 

AAGGAAATTTGCTGAAGTGTAAATTATAACACGAAGAGATGGGAGGGAGGGGTAAACACC 

TAAATGTCTAACAACAGAGAATGGTTCTCTGTTGATACAAAATTATGATACATCAAAAAA 

GAACAACAATCAAATTCTCTGAGAATTCCCATTACAGTTAAAAGGAGCTCCCAGCCAGGT 

GCGGTGGCTCACGCCTGTAATCACAGCACTTTGGGAGGCCGAGTCGGGTGGATCACGAAG 
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TCAAGAGATTGAGACTACCCTGGCCAACACTGTAAAACCCCGTNTCTACTAAAATACAAA 

AATTAGCTGGGCCGTGG 

Sequence 616 

TTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACTGAATAACTGCCAATGCCA 

TCTGCCTGTGGCCTTCTCAAGTTTGTCTGCACCTGTGGTTATCCTGACTTCAAACCCGGG 

GAGACAGAGGCTAGAAGAGGCAGACAGCTCTTGTGTATTCTCCTGTCCAGTGCAAAGAAC 

ATCT GGAACTCTGAGCCCTAACCTTAAATGCAAGACCTNATCTGCAGGTGTTCCTNATCC 

TTTTAGCCCCTCAGTGATGTAAGCAACAAACGTCACCCANCTCCTGGGGCACACTTNACT 

CCCAGATGAGCTTGTCCTGGATTTGCAGGGAGCCTGGCTCCC 

Sequence 61 7 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTThrrACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGA 

GGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGC 

Sequence 618 

GCCGGGCAGGTACAGATGGGGTTNCACCGTGTTAGCCAGGATGGTCTCGANTTCCTGACC 

TCATNANGCATCCANCTCGGCCTCCCAAANTGCTGGAAATTACAAGGGCGTTGAGCCCAC 

CCG CACCTGGGC CAGMTCTTACATATTTCTTAMCATCATTAMTATATATTGTATTTT 

ttaci mrrn igmtaggggtcttgctatgttgcccaggcttggttttgaactcctgg 

CCTTNANGAGATCCTCCCGCTCTCAAACCTCTCAAAAGCAATGGGTA 
Sequence 619 

AGGTACCCCATTTTATGCCATAAGTCAGGTTTCTCCCTCMTAGCCCTTTGGAACTCTCA 

AGGTGCAGAGTGGCATCAAACCAACTGACACATGAGTTGATACATCATGTGCTGCCAACA 

GAGAMTTAGTCTGTGCCAMCTCAGCACMTCCTGCAGTTCAAACCAGAATTTCAAAAA 

Sequence 620 

ACCAAGATT TGAA TCATGCTTTCAAMGCTAATGTGAAGTTAGACATATTTGGTTTCATA 

ATCACAGMTTTTAAAAACACCAGGTCTGCAATATTCAGAAATCACCATTAACGCTCTCT 

TGACACATACAATCMTTTCACTTTAGATCGCTGATTTTCTTMCMCTGATTTAGTTAT 

TTCTGAATACTGCTAGAAAATTCAAAATCTACAATTAAT 

Sequence 621 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCMGGCATTGTTCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTNTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATNAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAAGCCCAGCC 

AGGCACCACCAATTACCAGAGGAACAAAAGGMT 

Sequence 622 

NCCGGGCAGGTACTGGATGACAGCAAGTGCACACATCAAGAGAAAGTTACCATTCAGAGG 

TGCAGTGAGTTCCCTTGTCCACAGNGGAAATCTGGAGACTGGTCAGAGGTAAGATGGGAG 

GGCTGTTATTTCCCCTAGGTCATCTCTTACATTCTAGTTCTGGTGCTCTCTATCTGTTTA 

AGACAMCCCTTGNGCACCTTTCTCCCACCCCTCCCTTTCTCCCTTGTCTCCCTTGAGAA 

AACAACTNCAGTTCTCTGCCTGCACCATGACTGTCGATACGCGGGGGCAGTTCGGCGGTC 

CCGCGGGTCTGTCTCTTGCTTCA 

Sequence 623 

AGGTACAAGCTGTGCACTGCAAGGTAACCACGTGGCCAGAGGCACATCCCTCCCTCACAT 
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ATACTGAGTGGTGTAATGCAGTCACCTTGTCATCTGGCAAGAGGTGATCGATGGACACAA 

ACTCCTCCCGGAACTGCCCCTCCAGCGAGCTCACTCTGAGGTTATCTGAACTCACATAGC 

TTGGGAAACCCAGCTGGGCACGGGCAACATTTGCGTAGTGACCCTTCCAGTCATCGGAGC 

ACATGGTCTTCCACGAAGCAGCTGTGAACACCTGGAGCACGGCATTCTGACCACTCACCC 

GGACACAGCGGTACCTGCCCGG 

Sequence 624 

CGGGCAGGTACTTTGCAAGACACGCCTGGCTACGAACAACATGGGACAATGGGCAGCCTC 

GCTGCACTGNACAGAGGAAAGGAAAGAGGCCTTGCAGCCACTGCCTGGGAAGGAGCAGCA 

CATTCTGCATTAACCAGGCATGCCTCACTCACTGCAATCCCCAAACAAGCCCAACTCTCC 

GTGTTGATTATTCTTACCATACTCCACCAGAMGCAGCATGATTTTCTGTCCTCAAATAC 

TTCAGATTCCAAGAGAACTGCACCTTCTAGAGTCTCTACTGATAACCTCAGNCACTTACC 

CACTTGAAGCATNAGCACACACTTAAAAAGGAAA 

Sequence 625 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCTTCCAACCTTTCT 

TCTGTGCATAATCATCTAGGTGTGGTGCTTACATTTTCTTTTGGCAGTGTTATCTTAGTA 

TCTTCCAGCATGGTTTTCTCACCTGATACTGTAACCATACTTCCATATCCTCAAATGTGT 

TGTTTTCTAAAATAAC I I I I I I I I I CTTTTTTTAGAGACGGAGTCTCACTTGGCCAGGTG 

CGGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGT 

CAGGAGTTCGATGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGC 

GCACGCCTGTGATCCCAGCTACTCGGGAGGCTGACGCAGGAGAATCTCTTGAACTTGGGA 

GGTGGAGGTCGCAGTGAGCCGTGATCGCGCCGCTGCACTGCAGCCTGGGTGACAGAGTGA 

GACTCTGTCTCAAAAAAAAAAAAAA 

Sequence 626 

AGGTACGCGGGACATACTCCCTAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATG 

CCGGGAGGCTTCCTGGAGCAGGTGGTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATA 

AAGCATACTGGGAGCTGAGCTGCTGTTTAGTMTTAMMTCCTTTCCATTGTTTAGAGC 

TCAGCACCTTTGTGCATTCATATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCAC 

TTCTGCTACTGCAATGTATGTCTACAGCTGACAAGTCTTCCTTGGGAGCCCTACGTAGCT 

C I I I I 1 1 I I CTTTTCTTTCn I I I I I I I I I GAGACGGAATCTTGCTCTGTCACCCAGGCT 

GGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTC 

TCCTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGGTGCCCACCACCACGCCTGGNTAC 

I I I I I I I IGGTAT 

Sequence 627 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTCTGCCATGAAGGT 

TCTGGGGTGGAGAGGGAAGCAATGTATATCCTACCCATGGTGATTGGTCCGATGGAAGTC 

ACATCCTGATGGGAAAAAAGGACTGAGCCAGAGTGGACTGTCTAAACCAAAATGGGATAA 

ACAAGCATGGCATGGAGCCAAAACAAATGGCTAAGTCAGAGGTCCTAATGCAGAAGGCTG 

GACAACTAGGATGGTGGGGAAAGACATGAGCTTGAAGGACTTCCCAAGATAAAGCAGAAC 

TMCCAGAAGAGCCTGTTATAGATTCATATTGGGGGAGTTTGGGGGGTTTGTGCAGGGTG 

CATCAAAAAGCACTCGCATGGAATAAACATATCTTGCACAGGAACATATGACAGGTAATT 

GAATAGTTTGATTTGAACTATGTAAAGACATGATCCTGATGGTAGAAGGATGGTACCTGC 

CCG 

Sequence 628 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I M I I 1 1 1 1 1 1 1 I 1 1 I I I 1 

TTGGGACCGAGTCTCACTCTGTCGCCAGGCTAGAGTGCCATGGCGCAATCTTGGTTCACT 

GCAACCTCCACCTGCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCCCTNACCACCACGCCCAGCTMTTTTTGTATTTAGNAAAGATGGGGTTTCAC 

TGTGTTGGCCAGGCTGGTCTCGATCT1TTGACCTTGAMCTTTNACATAAACTTTCACAT 

TTCCATGACAAAGTTTTAGCAGTAACTTCCAAATTGGTCTTATTCAACTCCAACATTAAA 

CTTTGTATGTACCTGCCCG 

Sequence 629 
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CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGATCC 

AGGATGTGATGGGATCTTAGGGCTTGGCTGGAAGGTTTCTCCAGTCAGCCATCTAGCAGA 

GCTGCAGATCTGGGCTGGGCTGTTGGCTAAAGTGCTCTTCACAGACACCTCATTCGGCTC 

TTCCTTCAGCTTCTTCACTTATTTCTTACTCAGTCACTACTCAGCTCCTTGTCCATGTGT 

CCTTGMGCCATCCTAGGTCTTATTCTGATTCTGAATTCTTCAGTCACCCATAAGCTTCT 

CCTTACCCCGGGAGTCAGTGGGTGTGTGTTCCCAGGTGGACTTAACCATTCTTCTCCTTT 

ATGATCCTTTCCCTTGGGTGGACAAGTGTGATTTGGTTGTMGGCCATTTTTCAAGTTGC 

CTATACATTGATAAAAGAAATCCCACTAACGGAAGTAGACTGCATGCCAAATTTCAGTGT 

CTTTCTCCAGGGGCCAAGGTTGGACCCANAAGTGCATGGG 

Sequence 630 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTGCTTCCTGGGAGAGCTGACCATGA 

GTCAATTGGCCCACAATAANTTATNAAATGAAAACCGGCCATCATCTGCATCTTATGAGT 

GCACGTCATCAGAGATGTCCACTCCAGTTACAAGAAAGTCCTGAGGGCTTTCTTGGAGCC 

TGANGGGCGCTGGAGGTGAGACCTGGAGGTGAGCAGGAGTTAACTAGGATGAGGGACNGG 

CGCAGCATACAGGAAAAGCTGCCTGGGGGAGAAAGGACCAACAGCAAAGACTGAGAAAAA 

AATGCTGTTGTGACCAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCAT 

GGAGGACTGAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCATGGAGG 

Sequence 631 

CCGGGCAGGGTACTAAGGACAAAMGACATTTATTCTCTTTGACCCTTGCTGCCAGNACA 

GAAAATGACTTCACCCAAGGACACAGCACTTGCGGGTGGCCTTCTCCACCTCCAGCTATT 

GCTTGGTTTCAGGTGACCACTCCCTTTCTCTTCTCAGGCCTATGGGTGGTAACAAGCTCC 

CATC CACT GCTAGTCTTAGACATCTTTACTTTCCTTGATTGATNCCCTTGACTCTGCCCA 

CATCTTn'AAAATATCCCATATTAAACTTTTTACACCCCTTTGAATGTGTCCTGCTTCCT 

GCTGGGACCATGACTAGTCTCTTCTAGTNGGAATCCATATCACCTTCTGTGATGTAGTCT 

CCMGTCAGGCAGNCTCATTTCAACTACAGNCTTTCTTTATGCTTCTCTTTTTCCTTT 

GGACTCTTACCTTTCTTTTATTTCTTACTCAGCMCAGTGTCTGCCCATTAAAATGCACC 

TTTGCGGNGGNGGTTNGGGTATCmATCTCTCTTATTTCTTCCTTNCTGGTGCATm 

ACTGGCATTGCATGGGAATTTTGGTT 

Sequence 632 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACATACTCCC 

TAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATGCCGGGAGGCTTCCTGGAGCAG 

GTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATAAAGCATACTGGGAGCTGAGCT 

GCTGTTTAGTAATTAAAAATCCTTTCCATTGTTTAGAGCTCAGCACCTTTGTGCATTCAT 

ATTACGCATTCATTTTCGTATCATTGTTGMTTTCTCACTTCTGCTACTGCAATGTATGT 

CTACAGCT GACAAGTCTTCCTTGGGAGCCCTACGTAGCTC l MINI I CTTTTCTTTCTT 

I 1 1 1 I I I I IGAGACGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTC 

GGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGT 

AGCTGGGACTACAGGTGCCCACCACCACCCCTGGCTACTTTTTT 

Sequence 633 

GCCGAGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGNGCTAATAT 

NCAANGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAATGGCAGCAGTGACAGGA 

TCTGTGCAGCAACAGCATCTGCATCTGGTGCMCAGGACTTATTTTCAAATCATCAAGGC 

CAMAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGG 

TGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGA 

GTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCC 

CATAAACACAGCCCTTTCCTGGCTCACACGGGCATGACCTAATTAAGAACCCCCGCGTAC 

CTGCCCGGGCGGC 

Sequence 634 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGTGA 

MTCCC TMG TCAMCTGTGGCTTATAAGCAGAAATCCTGGTTAGTATTTCAAAGTTCTC 

TTAGCGTTTTCTCCTGCGACTTAAAAGACTTAAAAACGTGAAAAGACATGGACGTAAGAC 
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TCCAAACMAAATACATTTCTTTGAAACTAAATAGCTCTTAAGTAAGAAAAATTTCTATA 

GATCTTCAAATCATCCCCTAAGCAAAATATTCTCTAATTAAGTATTTCTGTATTTCCATC 

TATGTTCTTCCCAGGCTTGGGGCTGTTGATCAGACCTATTTTTAGGGGTAAGTTTCTAGG 

GGTCATAGAAGATACAGATTTTGACCTGCTTAATGTCAAGAGGTTGCACX3GTTGATTTGT 

CCAGTTGTGMTTCTATGMTGMGCTTTTTGCTTAAATAAAACGATATTCCCCTCTGGC 

TGCTGTGAGCACCGGGAGACTTGTTTCGGCAGTGCCTGGGTGCTGGGGCAGGGCCCG 
Sequence 635 

TG GGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGAGTAAAATAAAAGGGTCG 

GATTTTGTAGGATTCTAAGGAAGAGGCAGTGTGCTCTGTCCACAGGCTGCAAGGTGAGAA 

CCTAAAAGAATGAGATATATTCCCATTTTGGAATGGCAATCAAAAAGAGGATCTCTCTGT 

CAAGTCTTTACATTAATGCTGAGTAACAATCTCAAAAGCCTGCCCATTCCCCTTTAGACA 

CATGTGGCAAAAGCAGAACTGAAGGAATGGCCAAGGGGCTTGAACAAGTAGAGAGACCGA 

CAGTCTTTCAMTTTCAGGGAACCCCAGATACATTTTGGGGGAGCCACTGTTTTCCCATT 

TTCCTGAAAAGTTCTTGCAGGTATAAGAAATAGGAATAGAAATTGAATAGGTTCTGGAGC 

CAGGGCTACAAAGGCCCCAGCTCTGATCTGTTAGACTGAAAACCACACATCAGATGAAAT 

TATATNCACAAAAAGGAGAGTCCTTAAAAACAGCCATTTCGGTCCCT 

Sequence 636 

AAGAGCACGTATAGCATGGGGGAAAGAACCTAAATGTCTCTCTGTCCTGTGAGCTGGTGA 

AAAACCCAGCATGAGAACGCAGTGTCAGGTGTGGGACTCCTTCTGCCCCTGCAGTGGGTG 

TTACGGGCGGTGTGCCCTGGCGAGCAAGCTTTGATTCTTGGTTCTTTGAGCTCGTTTCAG 

AGGCTGAGTCCCCACATCAGCTTTAGTTCTTGGACTTCCCTGTATTAAGCAAGAATTAGG 

AGAATGGCTG TCCC TGCAGGCGCCTCCCGTAAATCCTGAGCTCTCTGGCGCAATCTGAAA 

CTTCTCTTCTGTTTTCTTTGGCTGTATCAGCCGAACCAGGAGAGGCC 

Sequence 637 

CCGGGCAGGTACCAGGAGAGATCTGAGACANGGTATGAAGTAAAAGAT7TAAGATTGGAA 

GTGGAGAGTGTCATGGACCAGTGCCTTTCGGATGGGTGACTTCTGGAATTCTTGTTAGGC 

ACAGCGGAGGTTGGT CCTGT GGGAAAGGAAGAATATTTCCGGGGTGAGGAGACTTCGGGG 

TGTGGGCCGGGTGCCTTTTTAAATTTGGAATGGTGTATACAATAGGGAAAGGATGTTAAC 

TTTGCAGCAGCGGGGATGGTGAATATAACCTGATAGGGACCCTTCCATTTTGTTGGAAAG 

GGGAGGAGGGGTGTGCTACCCAGACCCAGTCTCCTGGNTGTAAGGGTAAGAAAGTGAATT 

GGGAAGAATCCTCAGG 

Sequence 638 

CCGGGCAGGTACCTGGACTCCTAAGCCTCAGGGATTTACTGAAACACCATTCTATTTTAT 

MTMTCCTTMCCMGAATTTTMGGATCTTAMTTTTTCTGTGGTTCTATTGTTATCT 

GATATATAGATGATCTGCTGCCATATCCTAAAGAGCAGATGAGGCCGGGTGTAGTGGCTC 

ACGCCTGTAATCCCAGCACTTTGGGAGGCAGACGAAGGTGGATCACCTAAGGTCAGGAGT 

TTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGC 

TGGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACTAGGAAGGATGAGGCAGGAGAATCA 

CTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCTGA 

Sequence 639 

AGGTACCACTTAACAAGGGTTCTCAGCTGTGNGGNCACTGGACCACTGGGATATGCTGAG 

CTATTGCTTAAACACTGACTTAAATAAAACAAATATTTTAAATAATGAGAATGCTACTGT 

AATTAGAAGGCAATCATTTCAAAGTCTANATGGAGGCCAGGGGCGGTGGCTCATGCCTGT 

AATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACATGAGGTCAGGAGTTTGAGACC 

AGCCTGGCCAGTATGGTGAAACTCCATCTCTACTAAAMTACAAAAATTAGCCAGGCGTG 

GTGGTGTGCACCTGTAATCCACTGAGGCAGGAGAATCACTTGAACCCGGGAAGTGGAGGT 

TACAGTTGAGCTGAGATAGCACCACT 

Sequence 640 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCCTTTGGAAtAATGAACTCCAGT 
TGTTCCTCATAGGTGC AGCA GAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 
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GATTCATTGACGGATTGATACAGGGAAATGAAATG A 1 1 I I I I I I I I 1 1 1 1 GGTTGGGGGA 

GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 

AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 

ACAGGTGCACACCACCACACCCAGCTATTTTTTA 

Sequence 641 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCANGGCCTTTGGAATAATGAACTCCAGT 

TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 

AGATCATCCMGGGCCTTTTGCTTGGTMGCCATTTTACTTTAATCTTGAGTGCCATAGG 

GATTCATTGACGGATTGATACAGGGAAATGAAATG A I I II 1 1 I I I I 1 1 I I GGTTGGGGGA 

GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 

AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 

ACAGGTGCACACCACCACACCCAGCTATTTTTTATAT 

Sequence 642 

AGGTACCTCGTTTCTGAGGATCAANACCTNAGNGACCGNGTGTGTGTGTGTGTATTTGTG 

TGTGTGTGAGTCCTATTTGGGCCCCGCCTTTCAGCCCTGTCTTGCAGC 

Sequence 643 

AGGTACTTTCAATTTCTGTGGGATAAACTCCAGCTCCAGTTTCAGAACCCACTCTAATTG 

GTTTAAGCCAGGAAAGGGAGAGGGACATGTTGCTGGGAGGCCCCCATCTGGGGCCTGAGC 

TTGGAATCAAATCAGAGGAAGGCAACACATGTAAAGTGCTGAGAGTGGAAGGATGAAGAG 

AGCTAGGGCTTTGTGCCATCACTCGTGCTCTGGACATAAGTGGAGCTGGGATTCAGCATT 

ACCTGCCCCTGTACCTGCCC 

Sequence 644 

CGGGCAGGTACTAGTCCAGGTGTGAGATGAAGGGGGCCTGGATGAAGCAGAGGGTGAGAG 

ACAAGGAAGATTCTGAGGACCTTGTGGCTAGATGTGGGGGTTAAGTCAGGTTCAACTCCT 

AGGCTGGATGAATTGGCAGATGGCACATGAACTACAAGAGAATGGMGGCAGAACCTATT 

TTGTGGGCAAAAMTAMTTACATTTTGCMTACTGAATTGAGGGGCTTCTTGGAAGTCC 

AGGTGTAGATGTCTTACAAAAATAGAATATTCTGGGCTGGGTGCAGTGGCTCACCCCTGT 

AATCCCAGCACTTTGGGAGGCCAAGGTAGGGGGATCACCTGAGGTCAGGAGTTCGAGACC 
AGCCTG 

Sequence 645 

GGNCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCAC 

CMCTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGA 

CCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCT 

GGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACC 

Sequence 646 

CCGGGCAGGTACAGGGGCTTGGGGGCTTGGCCAGGCTCTTCTCCATCCATGCCACGGGGC 

TGACAGCCACAGATCTGGAAGCTCAGGCCTAGGAGTGCAGGCTCCGTTAAGCCCTGTGTC 

CAACATCCTGACTCCTAGGGGTGCCCMGATTTGAGTGGCCACTTTCACCTCTGGAGGAA 

GTAATACCTAAGGCGCTGATAGMATAGAACTTCCGCTGCCAGGCAAGGTGGCTCACACC 

TGTAATCCTAGCACTTTGGGCAGCCTCAACGCAGGTGGATCACTTGAGGTCAGGAGTTCG 

AGACTAGCCTGGCCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 

Sequence 647 

AGGTACNTGTTTCAGTCACTGGGCTGANNTGGNNCACAGCACAACCTTCATAGCCACTGT 

ATGMGMGTANMGACCCAGACTCTTGCTTTATGTTGGTATCAAAAGTCATTTCAGAGT 

CAGGCTGATCACTCCCAAGTAACCCACTGACTTCTTTACTCCAGCTCTCTGTCTGCTGNT 

GACTCANAANGTNACACTTNATTTTCTCCATTGCTGATATAATCATATCTGCAACATAAA 

AGTGGGCATTTTCCTTTTCTACATCAACAGGCAGCACAAATACCTCTGGTGAGAAGGAAT 

TCNAAGAAATGGTTNTTCTACTGACTTGAACAGCACCTTCATCAGCAGCAGATGTCAGAT 

GGGAAGGC 
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CCGGGCAGGTACCA CTAGATT GCCTCCCTGTGCCTGGGCAATTTCAGAAAATGGTGQTTT 

TCCTTTCGTTTCCATCTTTTTTAAGACTTAAAAAGTATCTGCTCTCATTTTCTCCTAGCG 

GCCTCCATGCCTTGACTCAAAAAATGCTGTCTTAGTTGACAGCCTTGAAATGAGTATGAC 

CCTAGCTCTAGTTGGGTGGAAATCACCTCGCATAGAAATAGACCTGGAGGGCCGGGCACG 

GTGGCTCACTCCTATAATCCCAGCACTTTGGGAGGCCCAGGTGGGTGGATCCCGAGGTCA 

GGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAA 
AAA 

Sequence 649 

AGGTACTAGTATGMGGAA ATAA TATCCACACACTGATACTGGTCCAGCNGAAACCAAGA 

CCGCTCCTGGTGCATTAACTTTTAACAGAGCANGGACTCANTTCTCTGAAAATAGTGCCA 

TAAAACATGTGCTCCCAGAAGAATAMTATTTGGCTTGCTAGMTTTCTGCNGCTTTTNT 

GTAAAAGTTGATTATTCGGTATTAAAGAGGAGTATCAAATATGNGTNNATGNANNAAAAA 

CTTGGAAANAGTANNGGACCNNGGCTTATCTCNTCATTTTCATTCTGCACACTNCAANTC 

ANTCNTTTTCCCATCTTNNTTCCCNTCTCTGNAATTTATCACCCTCCCCCTCT 

Sequence 650 

GGCGATGGACTCCACCGCGGTGGCGGCCGAGGTACTTACCACAGAGAAAAGCCAATAATC 

ACAATATATGTTGTCTTACTGACCCATAACCCATTTTCTGAGGGTGGCAGGCATTGTGCC 

CCCACTGTGAGGTGGGACTATATACATATACAAAGGAGGTGTTTAACTGGGTGGCATGTC 

TCAGGGAGATGTMNGACTTACCTGCATATCCTGGCAGTNTTGAAATGATAGTGAACTGT 

TCNTANGGCNTCCNCTTGATGGCATAGNCNAAACACCAGNATTTTCTTGGNAGAATGATT 

CGGNAAATGCTACATAGAAGAAATGGNTGGTGAGCTNTTACTGTGACTGTGCCCATAGTA 

AGTCATCCTGGACCCTCTGAATCTTATCCCAC 

Sequence 651 

GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I 1 1 I I 1 1 I 1 1 1 1 1 I I 1 1 I II 

l I 1 1 l ATGATATTCATCTATGMCCAATTTAATTTTAATTTAAAAATGACTTCTGATCTG 

GCAGATGATTTGGGTTCTAGAAAGAAATTGTGCCAGGCATGGNGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGA GGNGGG AGAAACCTGTNAGCTCAGGAATTGGAGACCAACC 

CTGGNMCGTMCAAGACTTNTTTTTTACAATATTAATAATAAAAAAAGGCCAGGTGCCG 

GNGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCCACCAGACAGGTGGATCATNAGGT 

CAGGAGTTCGAGACCAGCCTGGCCAACGTGGGGAAACCCTGCCTNTACTAAAAACACAAA 

AATTATCTTTGCTTTGNTGGCGGGAGGCTNTAATCCAGCTTCTAGGGAGGTTGAGGCAGG 

AAAATCNCTTGAATCTTGAAAGCAAAANTTNCAATNAGCCCNGGTCACACCATTTGCCTT 

CCMNCTGGGCAACMGAGCCAAAT1TTNTTNAAAAAAAAAAAAAAAGGGCCGGCCTTGG 

GGGTNNTCCCNGGAA CCCC ACNCNTTNTGAGGCCCCACCGGTGGTNNTAGGGNCNGAGTT 

CAAAACNNCCTTCCACTTTTNGAACCCGTTTTTTAAATTCCAAAAATTNCTGGCTGGATA 

AATCCGNAGCCCCTTCTTGNGGGTGNGGNNGGANTNTTTCCCNCAANNGGGGGTGGGTNA 

NNNCCGAAANCCCTTATTTCCCCTNTGNAAAGGGNCTTTTCNAAAAAAAAAAAANA 
Sequence 652 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTMGGCAGAAGGAAGGAAGGGGAMGA 
AACA ATTCCAGGGCCCTGGTTTCTGGGATGACAGGCTTCCCAACACTCATGCCAGGACTA 
TTTTCCACCTCGGTTCACTATGGG I I I I I I I I CI I I 1 1 1 1 I I AATATAATG AATTTTTAA 

AATGTGTGTTTGTT GCCC AGATTATTCCANAAAGAGTTGAAGGGAGGAAAGGNGTGCNTG 

GGGTGCNTGGGANTTTTANCCCTCTNTCCACCCNGATTTCTAAGTTGGGGGGGGCATCCA 

AACAGCTTCACCCANGTGCCCAGGCTNTTTTTTGNTNTCCAAAGCCAACCCTTCCAGGGC 

ANGGANGGGTGAAGNTTAGGAGGGCAAAGGTTAGCCTGGAGGCTGCAATTAACAAGAATC 

AAANTGGGGTTTAAGGATTCTCACACCCAGTTTGCTAATTTCAGCTGGTCCTTGTAGAGG 

TGACACCTAGTAGGACAACATGGNT1TTNGGGCAGGGCTGGGGTGCTGGTCTCTGCTTTC 

TAGGGTAGAA AGGAA TCATACATTGAAAATGCTTAAATCGATGGAATGATTTATGTTCCT 

NATCTTTCATCTTTTTCTGNGTGGCTGGTTTTCTGCCANCCTTACTTGGACAAGCACCAT 

TCTANACCTTTTCTNTAGGCATNTCCNAGAANGNGAAGTNGAAAGGAAGAAAAAAAACTT 
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ATTTTTTNNGNCCAAAAATTNGGCAAAAAAAAA 
Sequence 653 

TCCACCGC GGTG GCGGCCCGAGGTACGCGGGGGCCCTTCTATCTCAGGATGTTTGCACTT 

GCTATTTC CTTTT CTTAMGGCTCATCCCTAGATATTTGCATGACTGGCTTCCTMTTTN 

NTGTMGCTTTTGCTGAGMGTTACTTTACCAACTGTCATTGAGGTTTTCCCTGAACATC 

TTAGG TMGATAAC MGCTCCCCTCCTTTTCTTTCCTCACTTCTTGGTATTCCTTATCTC 

GTAACI I 1 1 I I I I INGGGGGGATNGANACTTNCGTNTTGNTTTTGTTGGCCAAGCTTGGA 

GTGCANNGGGTGCANTCTTGGCTTNACTGAAACCTCCACCTCCCGGGTTTAAAGCGATTC 

TTCTGC CT TNAGC CTNCCGAGTAGCTGGGACTACGGGCAAGTGCCACCACACCCAGCTAA 

i i 1 1 1 iGTATTTTTAAGTAGAGGTGGGGTTCACTGTGTTAGGATGGTCTCTATCTCCTGA 

CCTTTTGGGCCACCCACCTCGGCCTCCAAAGTGCTGGGATTATAGGTGTGAGCCAGTGCN 

CCCGGCCTCTCATAATTTTCTTAAATT 

Sequence 654 

CNAATTGGAGCTCCCCGCGGTGGCGGTCAGTTNGTCTTAGAGATACCCATGAGGTCACCT 

ACTCAAAATGGGGCTCAGAGTAGCCTTGTCCCATTCTTGTCCAGTGGGCGCAGCTACAGT 

CTNNCTGGNNNGGAGTGACTGGAGGCTGTCCCCACGTCCCACTTCAGTGAGGCATTCATG 

TGCACCCAGCACACTTTCTAGCTTTATTTGCCTGGAGGGGAAGATTCTCCAGAACCTTGT 

TAAGATGCACAGNGNGGGCCCTTGGACTGGCAAGTGTGGCCTTNGGCAGTCCCTNGGAGC 

TTGTTAGGMTGCAAMTNTTMGCTTCTTCCTACTGNATCTAMGGTTGANTTTAAACA 

AGATCCAGCTTGTTTCGTTTCACATGAAAGTTGAGGCACACTGCTCTAGAAAGTTCTTTT 

ATCTTTACTGGCCCACCAAAGTAATCAAACTTTGNGAAGTACCCTCGGNCCGCTCTAGAA 
CTAGTG 

Sequence 655 

GCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGATATGAAGTGAGGTTAAGTCAGA 

TGGMTGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTTGTGN 

NCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAATAGGT 

GCTCGAAAAACATCTGTTGAATGAAATGAATTCTTTTGTTTGCAGTAGGGCAAAGAAGGG 

TAGAGAGGAACNACTTTGCCAAGCTGATNTGTAAATGTTGCAAAAGGGTTTNGGCCAGAA 

AATTCCNANAACCCATTNGAGAGGCAATACATGTTAAGGGACCTNTAAGATGTTTCACAA 

CCnGGAMTMTTAAAAAGMCTTTCTACTGmTTACTTATTTCCCACTCCTGGCTGCC 

CCTCTTGGGTGGACTGCCTNCTGTTGGAGGGAATACTGNGTGAGACACATCTTTTAGTAA 

AACAGAAATGTGAAACCAACTTGCAGAAATCACAAGCACACTGTTACCAAATAGGTCTTG 

ACTGG CTCCC TTNCTGGGGGACAAATGTTTTGATAATGTCTGTCAGTAGATTCAGTTCCC 

TATTTCTTTTTMG ACTTGATATTTAAG AATACTG GTTCTTTTTTG GCC AG CATCG CAAN 

GAAG I I I l l CTTTAACTTTTGGGCCAAAAAAAAAAAAAAA 

Sequence 656 

CGMTTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGATGTTTTCCTAAATTTTAAGT 

CAMTTCTTCCTTGACACATACCTATTTTTATTTTGTTTTGGTTCTCATCTCTGTGAACA 

GAGCAMGCATGCMCCATTGTMCACTTtCATTTGTTTTTATAAACTCAAGTTCTAGAG 

TTGGATTTCATGATTTGCATAACTCGGCATAGTGTAAGTGCTTGTAGTTTTAAACAGAAA 

AAGAGGGAAGAAATGACNATTCCANAAAAAAAGATCAAATCTTATGACTGTAATTTATTA 

AGGNATCCMTGGMTTCTTTCCCTTTTTCTTT CI I I I I I I II I 1 1 I I A AAG AGACAAGC 

TCAAGTTCCATAAGCTGGGAATGCAGTATCATGATCCATAGTTCACAGCAGCCTTCAACT 

CCCTGGGGTTCAAGGNGATCCTAAGAACTTGNGGGCCTCAAGCAGTCCTCCTGCCTCAGC 

CTGCCAAAGTGCTGGGGATTACAAAGCATGAGCCACTGCTCCTAATTCTTAAGAGATA 
Sequence 657 

GAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGAAC 
TTTGGGAAGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCAGGACCATCCTGGCTAACA 
CGGTGAAGCCCCGTCTCTACTGAAAATGGAAAAMTTGGCCGGACCGTGGTGGCGGGCGC 
CTGTGGTCCCAGGTGGCTGGATACACGGGTGTGCACCACCATACTGGCTGATTCTTGTAT 
i i I iGGTAGAGATGGGGNTTTGGCCNNGTTGGTCCAGCTGATCTTGAACTCCTGCACCTG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

106/467 



CCTNGGCCTTCCAAAGTGTTGGGATTACCGGTGTGAGACACTGGCCCCTGGCTATATTTT 

ACTATTTGGAMTCACMTGCATCTTAAAATTGATGGCTTCTTGCAACCACTTTCAACCA 

GGTGCCTGTCATGATTTAAGTGCTAGCATCAAGGCAGGTTAGTTATGAAGAAATAGAGTG 

TGTGTTTATATACTCACACAGTTAGAAATCGACCCTTTTAAAMTTATTTCTTTTTGAAA 
A 

Sequence 658 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCNGGTACCGCGGGATATGAAGTGAGGTTAAGT 

CAGATGGMTGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTT 

GTGTTCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAAT 

AGGTGCTCGAAAAACATCTGTTGAATGAAATGAATTCTTTTGTTTGCAGTAGGGCAAAGA 

AGGGTAGAGAGAAANCAACCTNGCACAAGCTGNTTGTTNAATGTTGCAAAAGGTTTAGGC 

CAAGAAAANTTCNAAAACCCATTNGAAAAGCATACATGTTTAGTGGAACCTTGAAAATGT 

TTTCACAACCTTGGGAMTMTTTAAAAGTAACTTCTACTGGTTTTACTTATTTCCCACT 

CCTGG CTGCC CCTCTTGGGGTGGGACTGCCTCCTGTTGGGAGGGGAATACTGTGTGAGGA 

CACATCTTTTAGTAAAACAGAAATGTGAAACCNACTTTGCAGAAATCACAAGCACACTGT 

TNCCMTTAGCTTGACTGGCTTCCTTNCTGGGGGGACAAATGTTTNGATAATGTCTGTCA 

GTAGATTCAGTTCCCCTA TTTCT TTTTMGACTGATATTTMNMTACTGTTTCTTTTTT 

GCCACCTCGCANTGGMGTTTTNTTTACTTTTGGCCAAAAAAAAAA 

Sequence 659 

CCGCGGTGGCGGCCGAGGTACTGGTAAAGGGATAGTCACATAGATCAATGAAAAAGAACA 

GAGAATCTGTGAAGAGACCATGCAAATATGCCTGCCTGGTTTTTCACAACAGTGCAAAAG 

CAACTCAGCCAACAAAAGACAGCTTTTGGCCAGGCCGCAGTGGCTCACTCCTGTAATCCC 

AGCACTTTGGGAGGCCCGAGGCGGGTGGATCAACGAGGTCAGGAGATCAAAGACCATCCT 

GGCTAATATGATAAAACCCCCGTCTCTACTAAAAAAACACACACCCCAAATTGCCCGGTG 

TGGTGGCAGGTGCCTTTNGTCCCAACTNCTTTGGGANGGTTAAGCAAGGGANAATGGCNT 

TGAACCCGGAANGGAAANCTTTGCCNTGGGGCCCANATTTNNNCCCNTTNNNCTTCANCT 

TTGGGTGNANAAAAANCNGGACTTGGTTTCAAAAAAAAAAAAA 

Sequence 660 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGAAGGTGAGCTTTGAA 

GATGCAACATGAATTTGACAGTANAGATGTAGGGAGGAAGGAAGGCAGGACAGGTCAGAC 

AGAAGTGCAGGAACAGCCCAGGCCTTTGCAGCCTTCCCACACCCCCTACAAGACCTGCC 
Sequence 661 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGGACT 

TGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTnTCTCCCCGCTTGCCGGGGTGGTCC 

TCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTGTGGAGAGGGACTGACGTGTTTGGA 

GGGAAATGGGAATGTACCT 

Sequence 662 

AGGTACTCCAAGCTCTGAGACCACCTCTTCTGCAAAGCCTTCCTGATTCTGCAAAGAACA 

GGTAGGCATTTCATCCTTGGGACCTCACAGCAATTCAGGACACATTTGTGTCCCAGCCCT 

GCTTGGCTTGGCTGTCTCCATGAATATACACTTTGTAACTTCTGCACCAGGCATCATACC 

MGCACACAGTAGGCACTCCTGTGTTTTTTGAATAAGTGACTATATCATCACCACATTTC 

AAATGCGGAATATATGAGCTACTAGAAAAGACATAAGGGTAGATTTTACATCTTTATTGT 

ATCCTAGATATACAAGTCTATTACTGCCTTTTCCCATGTTCTGTCAACATAGCATAAAGA 

ATGTGGATTTACCTGTTAGAAATTGAATAAGCGGCCGCTCTAGAACTAGTG 

Sequence 663 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTACACGGCAATTT 

TNTTAATAACTCCCTTTACTATGTCAGACAAGCTATGTCAAGCGTCTTCTGTATCCTNTA 

CGGGGAAAAAA AAAAA GTTAACCAGAGCCAAATGCTTGCTTTCAAAGATAACTTGCCATC 

CTGAAMTATAATTTTTACMTTCAATAACACCTTTTTGAAAAATAAAATAATNTGGCAA 

AAATGCCCATGCATTMCAAACCATTTTTTCAGTTTAATCTCTTTATATGTTCAACTTTG 

ATGTATTTTAAATAAACAAAGCAAAATTCAACTAAAAATACAATCTGGATTCCATAGCCA 
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ANGGTTTTATTTACMTTCCTANTAGGMGGCTTTAT1TTTAGCTNTCAAATGGGGNNGG 

ACCTATMGGGMAmAAACCGTTTNCNTTGAGTTTTT^NTTTNAAGGGGAANGGGGG 

AGGAN TTCC CAAAATGGGGAAAGGGGAAAAAAAGGGGGNAANNCCNTTTTGGCCTTTTNN 

GGNANTTTTAMAAAAMNTTTNCCCCCCCGNGNCCCCCAAAAAAAAAANNAANNTTTTT 
TTNAAAAAAAAAAAA 

Sequence 664 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTGCTGGTTTTCTGGT 

GTTCCTACAAGCTGCCTAGGTCTCTTTTGTTCTCAGCAGTTCCCAGGCATGCAAAAGTTG 

CCAGTTCTGTCAGCATTCCAAGTCAGGTAAGACAGAAAGCCATCTCTTAGGCAGTCCCCA 

GAAAAGCTG AAAG GTTGGATATACTTTCTACTCTTCTCTTTTCTTCATGAGAGAAAGGCC 

ATGTGGGCATTTTCTCCCAATAACACTGAGTTCTGTTGTCTTCTGTCGGCTGTGCTGCAG 

GTTCTCAGGTGCTGCAGTTAGCTGCT 

Sequence 665 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGTAAGATTTCCTGAAGGGATCCATAGCC 

AAAACATGTTTGAAGGCCACTGGGCTCGCTAACTTCTAAAAGCACCCCAGTTCTAGCAGA 

CATCCTAAGGAACATTCCCAGGAAAATTCCAGCCTAGAACCTCCTGGGGTCTGACAACCT 

TAGAGAACAGTGCTGGCTTTGAATGGGCTTGGGGGCAGCCTCGAAACCCTCTTTCCAGTC 

TCCATGCAGGCAGGGGAGCTCCTTAAGCAACACATAGGACATTTCTGGGAGAAATGGGAT 

CCCCAACACAATGAACACTATAGATTTTAATGGTCTATATGGTTAAATACACAAGGCCCC 

TCATTTCCAACCCCGCCTGTTCATCTGATTCATCTGTACCTCGGC 

Sequence 666 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGACATTCCCCCAGTGGCT 

GAAGTGGCATATGAATTATGAAGTTGGATCATTTGGAATGAATGTAAGAGAATTGCCAAG 

GGCTCCTCCTACTCCAGAGAGGAAACCTCATCCAGGGCCATGAAGCCACTTCCTCACCAT 

CTGTGTGCTGCTTAAGCTAATGCTGCGGGAACCATGGTTCCTTGGGAGGAATCAAGCTGA 

CTCTTGGCATGAGATTCCTGCCTTCCTAGGGTTGAGAGCGGCACTGCCATGGCTTCTCTG 

GACGACCCAGGGGAAGTGAGGGAGGGCTTCCTCTGCCCTCTGTGCCTGAAGGATCTGCAG 

TCTTTCTATCAGCTTCACTCACATTACGAGGAAGAACACTCAGGGGAAGGACCGTGATGT 

CAAAGGGCAAATTAAAAGTAAGAGGCGAGGACCTTGCCTACCCCCTGCTTGTCGTTGAGA 

GCTTTAACTCACNGGATAGTTCTTATCACTTTTGGTGGTGGCACAGGNATATGATAATTA 

GTAGTAGCCAACAGATGACTAGTNGTTGTCATGTGCCAAGCGTTTTAAAAGTTNCCTGTT 

ATTAATTTCATTA 

Sequence 667 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAGAG 

AAAGCGTGAGGGCTGGGCCTGCGGCGGGCTTTAGGGAGTGGTCCCTGGCTGTGGATAGAT 

CTGCTGATGAGTCCAGGCCCCGGTCCATTCTCCTCGCGCTGCAAGGATGCTCCTGGGATT 

TCGGAGAGGCCGCAGGAGTCATTTCAAACACATCATCCATGGCCTTTTACCTGCAGCCAG 

CGTTGCTCCGAAGGCAGCTGTGCCACGCACACCTCCTCCCCGCAGCCCCMCCCATCTCC 

AGAGAGACCAAGATCTGCTCTGGCAGCAGCCATTCTGGCGACAACATTGAC 

Sequence 668 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATAGGAGAGGGAGGAGGG 

CTGGCAGCTGAGTAGCCAGAATCAGGCAGACGGTGGTAGCAGAAGTCAGAGGCCGAGGGA 

ATCAGGGAAAGGAGTTCAACCATGGAGGACCTTGCTGGCCAGGTTAGAGACTGTGGACTT 

TTGTCTGGGTGAGACAGGAAGTCACTGGAGGGCTGTGACAGAGCTCTGAGGCTGTGAGGC 

ACTGCTCTGTCAGTGCCATGAGTGGGGAAAACAGGAGCTTGCTGCACTGGTAGAGACCAC 

AGATAATGATGACTTGGACAGAGCAGCTGGGAGAGAACTAGTTCAATAACCCTAACACGC 

CTCTCCATTCTGCATTTTCCCTAAAAATGTACCTGCCCG 

Sequence 669 • 

CCGCGGTGGCGGCCGAGGTACCTGCCCTATCTTGCTGAATGTTTTATAATCTAATAAAAC 
TCAGATAAAGACCCAGATGTCACACCTGAACAGGAAAAGCTGAAAGGAAAAGATAATTAA 
AATATAAATCAACAGAATCAAGATTTTGAAAAGACCTAGAAAACTTGAAGGATTACTGAA 
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GCCAA GCAAGAGGAAATGATAGGATTAAAAGTAAAATCTTGTGTTTAGA I I I 1 1 1 I 1 1 1 1 

TTTTTCAAACGGAGTCTCGCTCTGTCACCAGGCTGGAGTGCAATGGCGCAATCTTGGCTC 

ACTGCAATCTCCATCTCCGGGCTCAAGCAATCCTCCCACCTCAGCCTCCCTAGTAGCTGG 

GACCACAGGCATGCGCCACACGCCTGGATMTTTAAAATATATACATATTTTTGTAGAGA 

CAGGGTGCTGCTTTATTGCCCAGGCTAGTCTCAAACTCCTGGCTTCAAGGCATCCTCCTG 

CCCCAGCTTTCCAAAGTGCTGGGATTACTGGTGTGAGCCACTGTGCCGGGACATAAATAG 

TTATGCTGTATTGGTTAAGGAATAATGACA 

Sequence 670 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAA 

GGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATC 

TACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGG 

TCCTGAAA TAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATT 

CCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGC 

TGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCAT 

Sequence 671 

CCGCGGTGGCGGCCGAGGTACAAGGAAGGCCTTAAAGACTGCCCCACTCTCCTTGTTTCC 

CATCCCCTGTCCCTTCTACTTCTCACATTCACCACTATGTGCCCTAGGACAAAATCAAAT 

GTGGMACATTTGGTCATGTCTACTTTGTCCAAGGGTGGGAGTTCTTGAGGAATTCAAGT 

GGGAACTAGAACAACTTTCTACCCTTTCCTTCCCTTCCTTCCTCCCCACCTCTACCTAGA 

AGCCCATCAATCACTTTGAACTTCTTGGAGAAAAAAGGAAACAAAAGAAAAAGAAAAGGA 

GAGGCTGGGTGCGGTGGCTCATGCCTATAATCCCAGCACATTGGGAGGCCAAGGTGGGTG 

GATCACTTGAGGTCAGGAGTCGAGACCAGCCTGGCCAAAATGGTGAAAACATGTCTCTAC 

TAAAAATACAAAAATTCGCTGGGTGTGGTGGTGGGTG 

Sequence 672 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACACTGGTGGGG 

GAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCGCAGGGCCTCTACGGACCTTACTAGA 

AAAATGAAACCTGATGAA ACTC CTATGTTTGACCCAAGTCTACTCAAAGAAGTGGACTGG 

A GTCA GAATACAGCTACATTTTCTCCAGCCATTTCCCCAACACATCCTGGAGAAGGCTTG 

GTTTTGAGGCTTCATGCCAGAAAGGGGAATGGGGAATGGCTGCTTAACGGCATGTNTTTT 
TT 

Sequence 673 

CGCGGTGGCGGCCGGCCGGGCAGGTACACGATGAAACGGGGGTAAGGAAGGAGMGAAAA 
ACATTGAAAGGCATTTGACAGGGTAAGGTTGTATTCCCCAGACAACCCTGTCAAGCAGCT 
CTGAAGGG ATGATG AGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGAC 
CATGTCCTA I till IGGAGGCGTTGGGGGGAATTGTCTTACGCAGCACCCAAGCACACTG 
CTATGCMTGGACCACAGATAGGMGCMGCACTGCATTTGGCTCCCCCGCGTACCT 
Sequence 674 • 

CCGCGGTGGCGGCCGAGGTACTAAATCATTAATTCATCCTGAGCTAGTGGCTTTATTAAT 

GAGTATCTCACAAA TACCAC AAAAATTCAACCTGGCCATGTGGAGCAATATAAAATTATG 

GCATTTCTTGGTATGTTTTTCTCTTTGGCGAGGAGACAACTTGATCTTGTGTTTCCAGAA 

GCATGTTAATTTGCCCTGCTTGCAGAATCTCTCTGGCTTGAAAGGAGATTATATTCATGG 

CAGTCTGTGAATTTTCATTTTATTTCATTTATTTATTTGAAGACAAGAGTCTCACTCCAG 

CCTGGGTGACAAGAGCAAGACTCCCGTCTCAAAT 

Sequence 675 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGGAGGTACCATGAGGGGAGA 
ACCGGCAAGGGGTGCCATTCTAGCATCTGGGTGGGAGAGAGGAGGCTGAATGCCAGGGGA 
AACTTCTTGGAAAAAGTGATGCTGAGTTAGGACAATTTAGTCMTGAGAAGGGATCTGGC 
TGTTCCTGGCAGTGGAGACAACATNTTTAAAGGCATGGGAGAATATCTAAAATTTACCCT 

Sequence 676 

AGATAATAACATCTGATATCCACATGGGGTCTGGAGGNGCAAGCCACCTTCCTTTCATCC 
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CACGGTCTCACAGCAGCCCTGGAMGAGGCTGCTCTCTGTTGGAQGCTAAGGGCCAGTGT 
TGGAAGGAGCTCGGGTGGAAAGTGTGGTCTGCATGAGGGGCTCCCGTGAATAGAGGAGAG 
GGGTGGCNGGTACCTGCCCG 
Sequence 677 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACCTGCCTC 

TGCCAGATACCCCTGAGGGAAGAGGATGTTCTATAACCAGGCCGACAGGTTAGCATTTGT 

GAACACAGTTCTGACGTTGTTGGGAGGGTTTGTTTGCCAGAAACATCCCCATGCGCTACT 

CTTTCMCCAGAGGTCMGMGTCCTTTACTTTTGTGTCTTTTTTGTTTGTTTGTTTGAG 

ACGGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGGCGCAATCTCGGCTTACCACA 

ACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATCA 

CAGGTGCCCACCACCACGCCTGCTAATTTTCATACCCGCGTACCTCGGCC 

Sequence 678 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGTGGTATTTCA 

CATAATATTAACCAGC TGTTA GCAATGACTGATATATACTTCCCATTGAAAATGATGTAA 

GGTCTGM AG GATT CATTTTGACAATTTTATATCACATATTTATATTTACCTTAGGTGGT 

TCI I I I IMTGTTTTAATTTGGGACCACACTAATTTCTAACTTGGTAACTCATCTCTTAC 

CAMATTMTACCMGCCMGAAAAATGGTTTCATGAATAGAATCTACTAGTCTTTTATA 

TCTT ATAAT GGTAGATCACTGATGAGGTAGAACTCCATAAGAGCTTCNCTCTCACAGTNA 

MGGTTTTGGTTGTGCATGGATTACACCTGGTGAAAGTTGGTTAGTATTTGTCTAAGTGG 

CTTAAGACAAATTTATTTTGATTTGTATTGTGAATGACTTTGCGAANCACCCAGAATTTT 

TNCCGCTTCGTGTGTNGTGTGTGTGTGTGTT 

Sequence 679 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCCACATAATAGCTC 

AGT GCATGC AATTTACAAAG TAATAA GTGAAATGCTCCCCATAGTTGACTATAACATTTC 

CTCAl 1 1 1 iCTCTGAATTTGCTTTTTAAAAAACTCTTCCCCTTGCCATTCCCTTCCCCAT 

TCCAGATTGTAACTGCTTCTTTCCAGCTGCATCAGAAGAAGGGGACTTTCCATGTAGGTG 

TTATTCTCAGAAAAGGCCAGAAAAGACCAGGTCATGGTGGGGATGATTTGCTCCAAGCAT 

AAAAGAGAATTGTGATGGTTCAGGAAGACTGGAAAATAACGAGACTGGAAAGAAATGAGA 

AGGGCTTCAGAGGAATGGCACATTGAAATAAAAGGGAAGTGGTAAGAACAGGAACCCAAG 

NGGAATGAANGGGCNCACAGTGGCAGGGATGATTGGATAGACTGTGGAATAAAAATAATT 
TG 

Sequence 680 

AGGTACAAACTGGCTTCTTCTCTTTGTCACCAGCACCTGCTTCATAGTCTCTCTGGAGTG 

CCAGGAACGGGTCATTTAGATTAAATCTCCCATACCGTTCCTGGATAAATACCTCCTTCC 

TGCGAGCCCGCAGGGCCTCGATGACAAGGTCTCTGGCCTCCAGCTCCCCTTCCATCACGC 

TGAGGAGCATCCGCAGCTCGGATTTACTGAGAGTATCCACATCMACTCTTTTTTCAGTT 

TTACAAGTGGAAATTAAGCAGTCCTCCTCCCCGTTTCTCCTTCCATTGCCAGGCTCAGCT 

CCTCTCACCCCAAGTACCTGCCCG 

Sequence 681 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTGGAAAGTGAGGTGGTTT 

TATTTCTAGTTCACCATTGTCCTTAGGCTGTATAGACCTCTGGAATCCCAGCTTATGTGG 

AGAAGGTATCCTGTTAGACTTCCCTCCTTTGGTCAGCACTGGGCCTTAACTTCTGGCCCC 

TCAAAGCTGCTAAAACTGAAGGCCAGGCTTGCCTGGCTTGGCAAAGGACGTCGGGCAGAA 

GCAGCTTCTCCTCTCCTCTTGTTTCTCTGTTTCCCCTCACCATAGGCTTTGGCCTGGGAG 

TTTTCCTACA 

Sequence 682 

ccgcccgggcaggtaccttcttggttgctgtgactgtctgctagcactaagactgtctta 

agcagatagagggcaatggtctttgaaggcaaatgacaaagcgtggccctgagctccctg 

actgagttcatttggactctcaagggatgccctggagctagactcgatctgagtggttgg 

actaactcctctttgtttttgtattgaagagccagcttaccccgccatttntaaacctca 

ggccaggaaaaccaaaaaacaaaaaaacccaacxjcaaacaaaaaacaaaccccaccttct 
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TNAGAANTNAGTAANCTTAAGGCTTNAAGAATCAACAGNGCCCCTTTGGGNATTAAGGGC 
CATT 

Sequence 683 

CCG CGGTGG CGGC CCGA GGTACTATAAMTACTATCCTAACTTTTTATGTGTTTTTTTAA 

CTTGTTTTTTAGAAGTTTTGTAGCGGTTTTTAAAAATGATGTATTTATAACTGGTTAGGA 

TGCTAATATCTGTATCTTTTACTCTATMCCTAATTTTTACATTTTCAGAAAAAAATTTT 

TACAACAATGTAAAAAATACATGGCCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCAC 

TTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTAAGGAGTTAGAGACCAGCCTGGCCAA 

CATGGTGAAACCCCGTCTCTACTAAAAGTATAAAAATTAGCTGGGCATGGTGGCAGGCGC 

CTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAA 

GGTTGCAGTGAGCCCAAGATCCGCGCCATTGCACTTCTAGCCAGGGAGAGAAGAGCCGAG 

ACTTCATCTTAAAAAAAAAAAAAAAAAAAAAAGGTC 

Sequence 684 

CGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACACACTCTGGCCCTGGCTTTATTT 

TTAGATTTTCCTTTCCCGGTTGATATCGGAAGGCACAGAGGCAGGAGGTGGGGTGGATAG 

TAATGTGTGCCCCCTTGGGGGTNANAGTGAGGTGGAGGGGGATGTTAATNACCATGAGAG 

GCAGAGGGTCAGNCNANTTTCCANNGCTTCNNGCTTCCTTTAAATGANGGAAAACACGTG 

CANGTNTTAGGAGACAAAGGAAGGGAANTGACTGTTTCCTGGCCTGGTNTGTGGGCCCAG 

TNGNCTGNTNCNTTCAGTGNCTNCGTGCANTTNGACTNTACACNTANGNNGGCAGGCATA 

GGTGTNCGGTTNTGAAAGACNGNNNTCTTTNCACATTCTCTNCTGCTCTAGGGACTGAC 
Sequence 685 

GGGCGAATTGGAGCT CCCCG CGGTGGCGGCCGCCCGGGCAGGTACTTTAGTAGAACTCTA 

GGAACTGACCAAACCCTTTTAACAACCAGGGGAGTGCTTGATGAAGGAAGAGGCTACCGA 

TCTTTCATAAGTATGAATAAGCAGCATGCATAAACCAATTACCTTCCCCTATTCCTCACA 

ACCACCACCACCACACCCACCACCTCTTGTGGCAGTGGGGATAGCAGCCCATGTTCCTGG 

AGTTGCTAACCGGTGCCAGGAGGGACAGTAGGGATCATGTCCTTCAAAATTTAGGGTTGT 

ACCT 

Sequence 686 

CCGCGGTGGCATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGG 

AAATGAGTGTGGAAAGAAGAATGCANAAATCTGAGGACTAGAGCCTGGAGATGGGGAGCT 

TCGAGCTCAGAGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCTTCTGAGTATCT 

GGGACTGCAGGTTATGTGCCACCACACTCGGCTAATTAAAAAAAATTTTCTTAGAGACAG 

GGTCTCTCTACGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGTGATCCTCCTGCCT 

CAGCCTTCCAATGCCTTGGGCTT 

Sequence 687 

ATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGGAAATGAGTGT 

GGAAAGAAGAATGCAAAANTCTGAGGACTAGAGCCTGGAGATGGGGAGCTTCGAGCTCAG 

AGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCCTTCTGAGTATCTGGGACTGCA 

GGTTATGTGCCACCACACTCGGCTAATNAAAAAAAATTTTCTTAGAGACCAGGGTCTCTC 

TACCGTTGCCCAGGCTGGTCTCAAACTCCCTGGGGCTTCAAAGTGAATCCCTCCCTNGCC 

CTCAGCCCTTCCAAA TGCCC TTGGGGGCTTACAGGCCATTGGAGCCCCACCATGTGCAAA 

NGAAAAGAANAGCAA I I I I I GGACATCCTGCCCAAAACAAACAAAGGTTTGGGCAATGGG 

TCCTGGTCAAGCAAAAACAAGTGGGGTTTGGNGGAATAAACCAACCCTTGGGTTAAAAAT 

M 

Sequence 688 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACT 

TGAAAGGTCACGTAGCTGANACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCtTGACAA 

AGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCT 

TGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGA 

GCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGG 

Sequence 689 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATACTCATTAGAGTTGCTCGGTGG 
AGATGGAATGATGGTGGGGTGCAGTTAAACATGGCTGAGTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCAGGTCATTCATATTTGGGGGAAGGAACAAAGAGGGTACCT 
Sequence 690 

CCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCTATGAATCTCACA 

GATGTAATAATGAGTTAAAGAAGCTAGGCACAAAAGAATATTACTGTATGATTCCAATCA 

TATAAAGTTCAAACCAGATCAAACTAATCAATGAACGAGGAGTCAGGATTCTGGTTATAT 

TCAGGGATAGTGATGGAAGAGGGCTATAAGGAGGGTGTCTGGGTGCAGGTCATGTTCTAG 

ATCTTGATCTGAGTGGGGGTTACATAGGTGTATTCACTTCATGAGAATTCAGAGGGCTGC 

ACACTAATGATCTGT ATAAT GCTCCTCTATAGTATGTCACACTTCAAAAAAGTTTACAGA 

AACAGTTCCTTCCTAATTTTCACAGGGCCTAAGAGCTAAAAACGCAGCCCCAG 

Sequence 691 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCGCGTAGCTGAGACTTCAGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAAGTGAGAAATTCCTTGAAATTTGCAAGGT 

Sequence 692 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGCAC I I I I I I 1 1 I 1 1 I II I I I I I 

1 1 l l ll INCCATANAGATGGGGCTTTGCCATGTTGCCCAGGCTGGTCTCAAACTTNTGAG 

CTCGAGCAATCTGCCCACCTCGGCCTCCCAAGGNGCTGGGATTACAAGCATGACCTGCCG 

NGCTGGCTAAAGTTTCTTATTTATACTTACTCATTCCTCTAATATCTGGATTTCCTTAGT 

CATCTGTCACTTCTCCCTGCATATTTCCTGTGATGTCTTTAGGTCCCTCCCACTNTTGTA 

GTAGCACTCCCTGGGGACCMTTTCGMGGATGCTGAGTCATATGGTTTTTGGTTTTGAG 

AGGGTTGAAAATGGAGACTCMCTCAATTTAGGAGCTATCCCATCATAACTAGTAGCAAA 

ACACGTCACTACTTGAGTCTCAACAAAAGACAAAAAAGGTTTNAAAGTTGGGGAACAAAT 

AGCTGCCAAGGGTTNTTNTTrn-NTGACAMAACATTGNGTTGGGGATTTAAAATCNATGT 

GMTCCTTAATCCCTAACTCATCCATGTTGGGGTTTTTT 

Sequence 693 

CCNCGAATTGGAGCTCCCCGCGGTGGCCGCCCGGGCANGGTACCTGCTTNCCAGAAGTGT 

TATCATGATTAAATGACAGACCAGTGGCAGTAGCATCTCCTGAGGGAGGGTTAGAAATGC 

ATA TTCTC AGGCACCACTGCAGTCTTGCTGAATCTGMGCTTTGGGGATGGGACCCGGTA 

GTCTTTTTGGATAACTCTGCCAAGNGGTTCCAATGTGCTCAAGTTTGAGAGTTGCTGAAT 

TAAAGCGCTGGGTCTTGCCAGGCATACCTGTAATTCCAGCTCTTTGGGAGGCTGAGGTGG 

AAGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGTAACATAGCAAGATCCTATCT 

CTACCAAAAAAAAAAAAAAAAGTACCT 

Sequence 694 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTCTCTGTCTTTAC 

AGCTGAAGCATCAGAGGATGGAGTGACCAGGCTGGTTCCAATGACAGTTATACGGCCATG 

GGGAGTANACATGGAGTCTAATTCAGTGCTTGAGGCTAAGAATGAAGTTGTATGCATTGT 

GGAAATTGTTCCAGGAGATCTTGCAACTTTCAAGTTTGAAGTCATGTCTGTGACAGTCCA 

GGAATNTGATGCAGCTGTGGAAGACCAGGTGGAAGGGTGTTCTGTAGAAGTTGTGCGCCT 

CTCTGTGGCCGGGGTGCTGTCCATGGTACCT 

Sequence 695 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTATCANG 
GAAMGAAAACGTTATTTGTCCCACAGATGCTGCTAGGAGCAGCTACCCCAAGACAGGCC 
TTGCACCTTGGGTCATGACAATGCGTGGCTACTGAGAGCTGTTGACAGAGTGGACAGGGC 
CCAGACCAGGACAGTCTCTCTAGAGGTCTTCACCTCCTCAACCGTAACTTAATCAGCCCC 
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ATGCCGGGCTAGCCCCATGCCACAAAGGCTCAGAAATGCCCTGCAACATGTGGGACACCT 

GGTAGTATCTACATAGGGGCCAGCATCCATCCCAGCTGCTGGGGGTGGCTCAAGAGCTGT 

GAGGGACACCCTTTCCTGCCTGATACCGTGGACCAGTTTGCAAAGAGCTGACTGTCCTGC 
TAGGCCCA 

Sequence 696 

TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTA CI 1 1 1 1 1 II I 1 1 1 1 I 

1 1 1 1 IGTATATTTAGTAGAGATGGGGTTTTACCATGTTGGTCAGTCTGGTCTCGAACGGN 

TGACCTCAAGTGATCCGCCCACCTTGGCCTCCCAAAGNGCTAGGATTACAGGCATCAGCC 

ACTGTGCCCAGCCAGCCCTATGCTTTTAAGAGTTCGATGGTTGAAAGAGACTGAGCGGGG 

AAGGTAGAGCGGGGCAGGGGAGGGACTACTTGGAGTCAAGTCAAAGTTTTAGGGAAAGAC 

CTGAATCTGAAAAAGATTATTTAACCTTTATGTGTCTGAAATACTATATTGTGCGMTTG 
TACCT 

Sequence 697 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACACGACAGAAGCCCACGGAGCAAGCCC 

TGTGCTGGCCCCTTCACATGACTTTAGGCCCTCTAGCAAGGTGAtGTTTATTTACAGGGT 

TGCATAAACAAGGCCTCACCATTCAAAAAACCTTGTATTCTATTACATGTTTCACATTAA 

CMAGACTGGAAATTCTNTAGGAAAGGGATCTTTTTTATCTACATGAAAAGCACAGGCTA 

GTAAAGACTTGTTGAAAAAGTTGAAAGAACATAAATGTATATGGTATATGCCACATAGCA 

TAATGGAGGAAGATAGCAAATAGGAAACATATTGGTGAGGAAGACTGGAGTTTGATGATC 

TAGTCAGGAAAACATCAAGTTAAATCCTTTACTTTACACCTAAACCATAAACTGGTGAAT 

AAAACAAGTATGTGAAAGCACAANANGAGAGAGGACAGGCCGGGCGCAGTGGCTCACGCC 

TGTAATCCTACACTTTGGGAG 

Sequence 698 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCTATGCAGNCATNC 

TGTNAGNACCCATTCCATTNTNCTATCCCTGGNTNGCTGGTGTCAATACTNTNAAGCGAN 

TTACTGCNGGNGCTCTNNTTTTTCCCTCANAGATACCNNGTTGATTTCTTTGATTCTCTC 

CATCTCTACAGGCATAATAACTCCTAATATTTAAAAACNCTGTAGAGGGATGNANNGAAG 

CTGNGGNGAGAGCCCNTGGGCTTTTCNCNTGGGTNAAGATGCACATTCCTGAAAATTNTG 

GGCCTTGGCTTAAGCTGNACTAGNGCCGGCCACTCAGCTGATCTCACTAGCGTCACCTGT 

CGCAATGGTGCTGAAGCGCACTNCCNAGAGGCCATAAGGCAAAGCGAGAGTNCNTGGCTA 

TNGACTGGANCCCATTTAAGCAAAAAAACATGCCTCNCGNANGACAAATTCNATCAACAA 

AGGGNGGGCAATACAGGATCTGTACCTGCCCGGGCGGNNCGGGCANGAACC I I 1 1 1 1 1 I I 



Sequence 699 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGCCCGGCACCCGGCTT 

GTGTGTCATCCTGGGCCCAGGCAGGTGATGATGCCAAACACAAGGCCCAGCTATGTAACC 

AAGTAAAAACTTTCATCAGAATGCCCATCTTTGTGACCCACAGCCCATTGTCAAGAGCCT 

T CCCT GTGCCAGGAGTTCAGCAGGTTCACCTCCGCCTCCACTAGTCACTAAGACACGGAT 

ATTTTAAGAATTAAAAGCCTCCACAAGCCAGGCACAATGGCTTACACCTATAATCCCACA 

ACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCAACGAGTTCGAGACCAGCCTGGGC 

AACATAGCGAGACCTTGTCTCTACAAAAAAAT7TAAAAGTTAGCCAAGCATGGTGGGGCA 

TGTCTATAGTCCTAACTACTTGGG 

Sequence 700 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTAGAGGCCAGGTTTCACC 

ACGCTGGCCAGGCTGGTCCTGATCCCCCGGCCCCAGGTGATCCGTCCACCTCAGCCTCCC 

AAAGT GCTGG GATTACAGGCGTGAGCCACCGCACCCGGCT C I I I I I I I I I I 1 1 I I IAAAA 

TCATGAT TTTAA CAGAAGCCTCCATTCAAGGCGAGACATGCCTTTTATTTCTCTAATTGC 

GAGACACT7TTCTGAATCCTCTTGTCAGTTGCACCTTTTAATACAATTGAGGTGACACTG 

TTCTTCATG GTGA CACTGGTCTTTCCCAAGAGGTTTCAGCTAATTCAGTCTATCAGATTT 

TACATCAGATTTTAAAATTTGCTTCAAACTTGGGTGCTTGTATTCAAATTCATGCTTCAT 

AGAAAAATGCATATCAAGTTCAACAGTTGACTAACTGCAGCCACGTTCACAGTACCTGCC 
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CGGCGGCCGCTCTAGAACTA 
Sequence 701 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAAGGCTGATACACAC 

TGACAGATTTTGTAACAAGGGACATTTAAAACTGAGCTGGTAATAGACTTGATTTCTGGT 

G TTGCC ACTCAATAGGCATGACTAAATAGTGTACCTCACTGTTCTACTTTTTATAATTAA 

AATTTTAGAGGAAGCT GAGTTCTTGTATTrAACTACAAGTTAGAGACTCAGCCCACAAGC 

i i i i i i i i I 1 1 I I I I 1 1 AATATGGTTTC 1 1 1 1 I I I I I I I 1 1 GAG ACGGAGCCTTGCTNTG 

TCACCCAGGCTGGAGTGTAGTGGCGCGTCTNTGCTCACTGCAATCTCTGCC7TCCCGGTC 

CAAGTGATT CTCC TGC CTCAG CCTCCTGAGTAGCTGGGATTACCGGCGTGCACCACCACG 

CCCAACTMTTTrAGTATTTTTAGTAGAGACGGGGTTTCCCCATGTTGGTCAGGCTGGTC 

TTGAACTCCTGACCTTGTGAACTGCCCACCTTGGCCTCCCAAAAACGCTGGGGTTACAGG 

CGTGAGCAACCATGCCCAGC C I I 1 1 I I II 1 1 I 1 1 I IATTT 

Sequence 702 

AGGTACGCGGGATATATNTAAATTTAAGAAANCATCCCCGGTAATATGGCTCTTCATAAT 

TCTAAGACTAAGGCTGGNGTAGAAACCTAACCACCTACCTTACAAGNGAAGGGGGCTATA 

CCATGGGGTAAGCCAAGTTTGAAATTTATGGGGAATCNTACCCAACTTGGNTTAAGGGGG 

CCCTNGGATTTGGCCTNGGGGGCCMGNNTTTTCTGTATTTTTTATAAAAGGTGATCTTN 

CATNGGTATTCCCTTGGTTTACCTTGGATAAGGGGGGATTACCAATGCCTTCTTAAGGAA 

AAAAATTCACCTTATTTGGGCCTTGGGGGGAAGGTAGGGTNGGGCTTCAATAGCCCTTGG 

TAAATTCTCCCAAGCCACTTTNGGGGAAGGAAGGGCCTGGANNGTTTTGCCGCCCCACTT 

ACCACTTCCCAAGCCCTTGGGGGGTGGAACCAAGAAGTGGGAAGGAACCTCTTGGCCCCT 

CAATATN NAAAA AAAAATNAGAAAGGNAANATTNCACCTATTCTTACCANAACCCCTAAG 

NACCTAA I I I I I AAAAAAATACCAAAAAGAATTGGCCCTNGTTTNTCAAAAACCACCTAA 

TTTGGGAMTMMNMNGGGGTGGMGAATTATTTTCTTTAACCCNNATNGGAATAAAA 

ATNNATNNNATTNGGGGNTCCCTTGGCCCCNGGGCCGGGCCCGCTTCTAAAAACNTAAGN 
GGGGATCC 

Sequence 703 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGGGGCAGGTACTACTGACTTACCTGCC 

ATGGGCTTTCTCAAGACAAGTCCTGCAGGAGAGGCACACTCACTTCTAGCGTCAACTATT 

GAACCAGCCACTGCCTTCACTCCTCATCTCTCAGCAGCAGTGGTCACTGGATCCAGTGCT 

ACATCAGAAGCCAGTCTTCTCACTACGAGTGAAAGCAAAGCCATTCATTCTTNACCACAG 

ACCCCAACTACACCCACCTCTGGAGCAAACTGGGAAACTTCAGCTACTCCTGAGAGCCTT 

TTGGTAGTCACTGAGACTTCAGACACAACACTTACCTNAAAGATTTTGGTCACAGATACC 

ATCTTGTTTTCAACTGTGTCCACNCCACCTTCTAAATTTCCAAGTACCT 

Sequence 704 

CCGCGGTGGCGGCCGAGGTACTGTGAAAGAACTAGCACTTTGAGCAGAGAACAAATGCCT 

TACTTGAGTTTCCCCTGGACTCTATCCCTATTCAAAGATGCTTGGTTATACCTCAAGAGG 

GAAGCAATCCAGACCAACTCCTATGACATGACAGGCACTCAAGCCTGATGAGGCAGAAAC 

CTGGCAGCTGTAGATGTTGGAAAGGATAATTTATGTGTTCAAGTGACTAGGATTCTAAGG 

GCTAGAT GCTAG CTTCAAGCACGGCTGGATCTAGGAAGCCCTTTTGCTCTCCCTTTTCCT 

TGGTCTACTTTTCTCTGTAGGCAAGTTCATTCATTCTTAGGCAAGTTCTCTGCATGTGGC 

AGCAATGATGGACACTGGAATCTCTGGGTATTCAGAGTTCTTTCAGTAGCAG 
Sequence 705 

CGGGCGGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTA 

GGTGCTCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGC 

CCGTTCTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGA 

GGGGCTGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGA 

ACCCCAAANGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCA 

TTTGTGGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCGTACCT 
Sequence 706 

NGGTTAANTGCCGCCNCTTGGCCGTAATCATTGGGNCATTAAGCCTGGTTTTCCCTNGTG 
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TGAAAAMTTTGTTATCCCGCTCCACAATTTCNCACACCAAACATTACCGAAACCCGGGA 

AGNCAATAAAAAANTGGTAAAAAGCCCTTGGGGGGGTGGCCCTTAAATGMGGTGGAAGC 

CTAMCCTCMCMTTTAMTNTTGGCGGTTGCGGCTCMCTTGGCCCCCGCCTTTTTNC 

CAAGANGCGGGGGAAAAA 

Sequence 707 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAACATCCATGGCAGACGCTATT 

CTTTCCCTCTTAGAGATGCAGACACTGAGGCTCAGAGAAGTTGTCCCGCACCCAGTATGT 

GATGGAGAGGTAGAGGGTAAAMCATCAACTGAAGGATTTAGCATTTGGGGAAGAAGGAA 

GAAGCCCAAAATGGAGTAGATCAAAGGCTCCCCCGTGAACAAATTTAAAATTAAGGAGAA 

AGAAGCAGAATTCAGTCTTCTCCACACCCATAACCAAACAGCTCCTATGAAGGCACCAAG 

CCTGACGCTCATCCCAATAAAAAGGAACGATCTGGAGAGAGGGGCAGCCGCTGGTGACAA 

GAGAACCCCCCAGGCAGCCTCGTCATCTGGCCAG 

Sequence 708 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTGGGGGACAGATATCATAT 

AGG GGATTC ATGTCAGTGACCAAACGAAGTGACCATTACAGCCCTTTNGAAACCTGAGGT 

GTAAI I I I I AAAAATGAACTCATGACTTTAATAGTCATAGACTCAAACCTGAGTTGATTA 

TTATGMTTAGTTTATGGGAGTCTCMTATGTGMTATGATGGAGACMGTTTTGGAATA 

CAGATAMTCAAGTCACTGTATTCACTCTCTCTCTCTCTCTTTGAATAGCCTTATCTTTG 

CCTATACACACAMCAGTGCAGCCATCAMATTTrCAATTTACAAAATGTTCACAGTCAT 

GCTTCT TCCTTGACTAAACACTGGGGTTGCTGCCAGTGGTAATTGGCTTGAAACCAGCTA 

ATTTTTATATATCTATTTAGTCTGGATATTCTAGATGAGTGGGCACTATAGT 

Sequence 709 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCCCACTCCTT 

GACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCACTACTTGGG 

AG GGCTA MGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAATTCAAGTGT 

TGATTTTATATTCTTTCCCTTTCTGACACAGCCTAMGCGTAGGGGGMCATGTGTTTAT 

CTGTGGGAGATAMCMGATGGAGTCCCAAAGACTTTAACAAMTATTTTTTTAAAAATC 

CACTAGAATAGAAMTACATTATTTAGATATACTTTATGCTGAGAGTGAGTATATATGCT 

TGTCCTATTTAMCTTGTGAGAAAAAGTGGTATCCCTTGATACATTTAGAAATATGGGGG 

CTATCTTGGTTCATTGNGGGGGGTGGGGGCAGAAGGAGAATAAATGCAGGATGCCCTTGT 

TGAAAGGAATCTTAGCATGGCCCACAGGGGACGTTTCCAGTCGATTACCAAGGAATGCCA 

GCCT 

Sequence 71 0 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCC 

ACACTCCTTGACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCA 

CTACTTGGGAGGGCTAMGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAA 

TTCAAGTGTTGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGAACA 

TGTGTTTATCTGTGGGAGATMACMGATGGAGTCCCAMGACTTTMCAAAATATTTTT 

TTMAAATCCACTAGMTAGAMATACATTATTTAGATATACTTTATGCTGAGAGTGAGT 

ATATATGCTTGTCCTATTTAMCTTGTGAGAAAMGTGGTATCCCTTGATACATTTAGAA 

ATATGGGGGCTATCTTGTTTCATTGTGGGGGTGGGGCAGAAGG 

Sequence 71 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACGGTGAGCCGGAG 

GAGTCATGTCAGAGGGGCGAGCAGGAGCGATTCCGTCGCCAAACAGGTTATGAGTGCCAG 

TGAGCCGCCTTAGATAGAAGCATCGTCAGCACTTTATTAATGATGGATAGNGAGAATAAA 

CCCGAAAATGACGAGGATGAAAAGATAAACAAAGAAGCACAAGACTTGACAAAGCTTTCA 

TCCCATAATGAAGACGGTGGGCCTGTATCTGATGTGATAGCAAGTTTCCCTGAGAATTCT 

ATGGGCAAMGAGGTTTTTCAGAATCATCGAACTCTGATAGTGTTGTTATAGGAGAAGA 

Sequence 712 

NCCCGCNGTGGCGGCCGAGGTACTCTTATGAGAGGAACATTAAAATTTGCAATTATAATG 
CAAAGAAACAGGAGACGATCGTGAGAATAAGCAATGTCACACACATTTCTCTCCAAACTA 
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TATGTATCTTGTCCTTTAAATTCTTTGACGTGTGTGTGGGGCCTGGGTGGGGGGTGTGGG 
TGTGTGGTGTTCTMGGGCCTTTTCTACTAATTCCAATAACTGGGTTATCTGTGAGGCTG 
CTTATTCTCACTGTATTTTACTGCTTCTTTGCCTTTCTTG H II 1 1 I II II A ACCATACT 
CAGGTATGGTTAAAATGTAAATGAAAGAA 
Sequence 713 

CCGCGGTGGCGGCCGCCCGAGCAGGTACTGGGGCTGCACAGGCTGTGGCAACACTGGCTA 

GTCAAAGCCTGGAAAGACCTGTGAGCTTGAACTCTTCTGGGTTGCCATTGTTCTGTTGGT 

TTCAGCAGTGAGACTGAGAGAGCCTGTTCTGTTTAGAAMGCCACAGTGGTTTTCTAGGT 

MAGTCTGCAGGAGATGTCACTTGGTGCCTTrrCMTACGAGTTTTCCACCTGCATTTTT 

GGMCCATTATGGGCCTTTTAAAMTTTATTAATMGTCTCTTAMTATTTTATAATCTA 

GCTTCTGAGACAAGATGATTTTAAMCAGTTATATGCTCTAMTTAAAAATTTA 

Sequence 714 

AGGTACCGTGTGAGCAGGTGGCGTTCACCAGGGGTGAGACTTTATTGACAGTAAGTTGCC 

T CTGCCA AAAAACGCCCTCATATGTCTGCTGATGTTTGAATTNCNNCCNNATGGCAGGAG 

GTT TTTTG GTCCTCCCCAGNTTTAAAMAAMTTGGTTAAAAATMCCTGGGTTTGGTTN 

TTTCTTTTGNTATGGGGAAGGCCCTTCCAAGNAAGNGGAAAATAAANAAAAAAGTAATTA 

NANCCTTAACCTTCCTTAAGGGGGAATTAATTGGTAATTTCAAAA I 1 1 I I I GGAATGGCC 

TTANCTTTTNTTAA I I I I I I I I AM1TTTTTAAATTTTGGGANGGAAACCAAGGGGGGGG 

TTTCCTTNTGGGCCTTTCTTGGTTTGGCCCCCAGGGGGCNTGGGGNANGGTTGGCCAAGG 

NCCAATNTTTNNGCCAAMTTTCCAMCCMGGGCCCTTTCNAACCTTTGGGCCAAGNTC 

CCCCTTTCAAAAACCCCCTTTCNNCCTTGNTTTGGGCCCCCCAAAA 

Sequence 715 

CCGCGGTGGCGGCCGAGGTACCGTTTTATGATGATAACATAACTTTAATGCTCCAACCTG 

AGAAAGATAAAATAGACTAAGATGACCATTGAATGCAAACAGAAAGTTCTAAATGAACAA 

TCMGNCAGGACCTGGAAATTTCAGGTCCCTGGTGGTTGGAAAANTrAAATTAAATTAAAA 

ACCAANTTTCTTGGTTTTTCCAAGGAMMNTGGI^AAAAAAAATTMGGTNTTTAAAAT 

MNCCCCAGGGAMMTTTCCAAATTCAAMTTATTAAAAAGGCCTTAAATTAAATTAAT 

ArTTTTTGGCATTTCNMGGCCCCAMC 

GGGTAATTAAACCAGGGCCAAATTTNATTAAAAAAATTCC 

Sequence 716 

AGGTACACGATTATTTCACCATCCAGGTATTAAGCCTAGCACCCAAGAG I I I I I I I I I IG 

CTTCTCTCCTTCCTCCCTCCACCCTCAAGTAAATCCCAGTGTCTGTTGCCNCCCNTNCTT 

CGGTANNAAC CAAG GT GG I MI NN I AAATTTTTCCACCAAAAAAAAATCTTCNCATNC 

MCCTTCCTTCTTTTCMTTTTTTGGCCTTNNCCTTAAAACCGNMCCTGGAAAAACCCT 

TNCTGGGTTNTGGCTTCCCCCAGTAAAAGNACCAMGGAMNTTTTAAANTTTCMTTTT 

TTTMTTCCCCGGGTTTTTTAAAMAGGGTTNAMAGGGAMGGNTTTCCTTAACCCCCA 

A GTGCCC MGGGGAAAGNTTGGGGTTGGGGGGCCTTCCAACCNAATTNCCTTAANTTAAA 

MTTTTTGCCCCNMGGGCCAAACCCTTTNNTTTGGGAAGNGAAAGGGGGGCCCCCCGGA 

MGGGGGCCCCGNMGGGTTTGGGGGAMNTTCCAMCCNMmGNTAMGGGGGTTTT 

TGGGGGGAAAAAG 

Sequence 717 

CCGCGGTGGCGGCCGAGGTACTACAATAAGGACAAATATTCAAAACATTCTGTTAAGTAA 

AATAAGACAGTCAAAAAGGAAAGCTGTATAATTACACTCATGTAAAAATATTTAGTCCAA 

CNCTCACAGGANAACCAAAGGTGGTCAATAGGTTCCTCAAGCCAGGTGGCCACCCCAAAG , 

GAATGGTTAMCCAAGGTTCTTCCTTCNGTTAAGGTTCCTGGAAGGAATTAAAACCAATT 

CCCCAAGGAGGTTTNCTTTTTGGTmCCTMCCCTTCTTAAAGGGGAGGMTm 

GGGAGGTGGTTAAAMNCAACCAAAAMGGGTTTGNAMGGGNTTTTGGGGGAAGGNTTT 

GGGAAAAAAANGNTTTTTAAAGGNA 

Sequence 71 8 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTT 
GGATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGTGATGA 
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CTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATT 

GAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGGTGTGGTGTAAACACC 

TACATTTTAATGTAMCATTTCAGTGTMTCMTGAGMCTATCATTCTGCTTTAATCAC 

CATGAGTTCTGAAATAACAMGGATTTGTCTGACATTCATTCTMGAMTTCATTCTTAC 

CTGACTMGAMCTTTTTTAACCCGGCACAATAATAAAGAAATGACCTGTNAGTACCTGC 

CCGGGCGGCCGCTCTAGAACTAG 

Sequence 719 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTTGG 

ATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGNGATGACT 

CAGACCGACNCTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATTGA 

AA TATC AGCCTTTCCCANACTGACCTGATGGGNTGACTTCAGGTGTGGNGTAAACACCTA 

CATTTTMTGTAAACATTTCAGNGNMTCMTGAGMCTATCATTCTGCTTTAATCACCA 

TGAGTTCTGA MTMCA AANGATTTGTCTGACATTCATTCTMGAMTTCATTCTTACCT 

GACTMGAAACTTTTTTAACCCGGNACAATAATAANAAATGACCTGTAAGTACCTGCCCG 

Sequence 720 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGA 

Sequence 721 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GNTGCTTNGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAAAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAA 

CATGTCCTTGCTCTCCMTGGCCCTGGCAGGCCMCCTTTAGTTTCAGGGCTACCACCTG 

TGCGGGGGCTTNTGTCATTACCCCCTGTGGATATTAATGCTGCACACTGNGGTTATGACT 

TGTACCTTCGGCCGTT 

Sequence 722 

GGAGAGGAAATGTGTAGGGGTGAGGGATGATACAAGAAAGCCAAATCCTCATCTTCTATA 

GTAGAGAGTCAGCGGATAAAACCTAAAAACAATACATCAAGAAATACTTACACTTATGGA 

AGGAAATACCAGAAGTTAAAAGGGGTTACTTCTGGGACATCAGACACCAGACTGCAGGGA 

AGGGC TGCC TCTTGTATTAACAAGCTTCCAGTATMTTTGCTTTTTAAAAATAGGTCCAT 

GCATTATTTTAATAAAAATTANGCTGGGCGTGGTGGCTCAGGCCTGTAATCCCANCACTT 

TGGGAG 

Sequence 723 

ATTGGAGqTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAGCAAGCTGTGTGACCTA 

GAGCACAGTGCTTTGAGTTTTTGAGTCTTAGC7TCTGTGTCTATAAMTGGGGTTCACAC 

AACTCACCTTACAGGGCTGTMGATTAGATTACACAGMMTATATTTTTTGGCTGTGGG 

GGCTGGMGTGTTGCTGATTAGpATTTGAAATCCCATCCTGTGGGTGAGAAAACCCCACC 

TTATGACTTGGTGGGAAACAAAGCCAACCTCCCACTGATGAAGCTGAAAGTAGCAGAACC 

TTGCTTCTACTGCCTCCCTTGCAGCTAGAGGCAGGCACAGGACTAGCCTGTCAATTGGAT 

GCAAATGCTCCAGGCCTTGAATCACAACCTGGTGACTTGCACCCAAGTCTATTA 

Sequence 724 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACCGACTTNATCATGCTGAG 
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TNTCACCTTCCCTGGGATGTNCCCTTGGACAAGANACAAGGGAGGAANGGCCTTAANGAT 
CCTANCTGGGGGGGGA 
Sequence 725 

TAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCGCCCGGGCCGGTACCCATAAAAATTAAA 

MCTATTTTAAAAMTAAATTCCATTTGAGCCACTCCTTCAAACCACCCAGAGTGGGTAG 

ACGTCTTTCGTGCCTCTMGAAGCCCCATCTCTATTCTGCGTCTCACCTTGCAGGGCTGC 

TCATCTGAATCCTGAAGATGGTGGACACCCATCTGCTAGGACTGAMTGAATAGGACAGA 

GGGAGGTGCAGAGTGAATGGACCATACTACCTGTCATCTTGGCAACGTGTGATTGAATAA 

MCAACTTCTTTAGMGTTTGATAGAGTGATTTGATMTGTMTTTACMGTGATCATTT 

CI I 1 1 1 A 

Sequence 726 

GGAGCTCCCCGCGGTGGCTTTTTGAGTCTGGACAGGNCTCTGTTTTTGCTTTAMGTTAA 

GAGAGCTAAATAAATGATGGTAAAAAGATAATAAAATAGAACATGAAGGGCTGTCAGTCA 

GTGTAGGTATTTCCATCCCCTCACTTTTCAAGTGAGGTCACGGAGGCTCAGAGCGATAAG 

GAGACTTGTCCAAGGCCACACACCGGCTGGTGCTCAAGCCGGGACTTGAACCCACGCAGT 

CTGACTCTAGAGCCCMGCTCCTMCTATGACATCCTATTTGATACACTGTTTTACTGGA 

GAAACAGATCATTTGACAGACATTCTTTCTGTTAGCMTTTGACAACTCTTTCCCCAGTT 

GTCTGTACCTGCCCG 

Sequence 727 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTA 

TTCTCTTACAGCTAGGACCCAGCGTTCTGGGCAGAATAGCAGGCAGCGCAGTAGCTACTG 

AGATTATAAGTGGTAGGTTTATGGAGCTGTGACCACAACTCTCCACAAGCCAGTGCTGTC 

TCATGCAAGCACTCTCAGTGTCCAAGTGCTGAGTGTGTGAGTGGTCTGGGCTTTGCAGGG 

TCGGCCAAGCTCTTGGAAGAGCAGGCTTTGTTAGCTGGGGAGTCATCGCTCCATGCAGGC 

CCTGAGAATGGAGCATCCTGAGTGGACTGGTAGAGATGGGGCATGGGTCACTCTGAGGGT 

TTGAGCTACTTCTGCTATTTTTGAAATTCTGGTTTGAACTGCAGGATCGTGCTGAGTTTG 

GCACAGACTMTTTCTCTGTTGGCAGCACATGATGTATCAACTCATGTGTCAGTTGGTTT 

G 

Sequence 728 

CCGGGCAGGTAC TACCT TCTCTGCTACAAGTCGAGCGAGGAGCCCCGCATGAGCCCTGAC 

ACCTGTGCCACCATTTTGGAAAAAGCTGGTCTCGATAACTGGGCTCTTGGAAAAACAAAA 

GTGTTC CTTAA GTATTATCACGTGGAGCAGTTAAATCTAATGCGAAAGGAAGCTATTGAC 

MGCTTATTTTGATTCMGCTTGTGTCAANAGCATTCTTGTGTTCAAGGAAGGATACCAA 

AAAATACAGGGAGGAAAAGGGAAAGGAAAGCCGCTTATAATAATACCAGTCAGCTTGCAA 

GGAGGGACCACCCTTGTCAGGGAAAACMNMGAAAAGGAAAATTTGGTTTGGACCATTG 

MAAAACCCCAGCANTTMCCAACCCAATTTCAAAMCTTTCTTGATTCAGGGAMTTTC 

GACTTACCAAGAAAAAACCTTTGGNAAAAATACCCANGGGGGGNTCCTGGGNAAAGGGNA 

GGANGGGAGCCCAAAAAAANAATTTGGANAACCCCCCNANGACCGACCCCCCNGGAAACC 

CCCA 

Sequence 729 

CCGCCqGGGCAGGTACTTTCI I I II I I I 1 I I I I I I 1 1 1 1 I I I ACGG AGTCTTGCTCTGTC 

ACCCAGGCTGGAGTGGAATGGTGTGATCTCGGCTCACTGTAACCTTCGCCTCCCAGGTTC 

ACGTGATTCTCCTGCCTCAGCCTCCGGAGTAGCTGGGATTACAGGTGCACACCACCATGC 

CTGGCTMTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACGATGTTGGCCAGGCTGGTC 

TTGAACTCCTGACCTCAAGTGATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTATAG 

GCATGAGCCACCACGCCAGGCCCACTCTNTAMTTTTGACCACCCTGCCTTGAGTGGTCT 

TCTAGCACCCTMCCTCTGTCTMCCTTCGAGAGCTTTGCACTAGCNATTCCTGGGGACC 

AG CTATG GTTGGTATCTTCTCAACTTTCTAA I I 1 1 1 I I AAAAATATTATTATTATTATTA 

TTATTTTAAAA 

Sequence 730 
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TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGCAGGTACCTGTGTGGAA 

AAGAATGCTTGCAMGCTTGTCACCCTCACGAGMTTCCTCTGACAGACATTTGCCTTTG 

ACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGAAACCGAAGAGCATCAGAGGGGACGA 

CTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGAGTTGGTTACCAGAGAAACTGGCGAT 

GGCGATCCCGTGAGCMCATCTCTCAGACCCATTTTAAATGCCGGGGGATACTTAATCAT 

GCTGAAAAACAGCAGAGCCCTGAGGTTTTGGACTACATGTTGCAGAAAGAA 

Sequence 731 

CT ATAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTACACCTGGGACAAA 

TACTTTTTTGTGMGNCAGGTAAAGCCTTTGCGTGCAATATAGCATCTCTATGCAATGCA 

NCAACTCCTCGTCTATCGCTACAGTAAGAAAACAGCCACGGGTCAGGTGTTGNGGCTCAC 

ACCTGTMTCCCAGCACTTTGGGAGGTCNAGGTGGGTGGATCACTTGAGGCTAGGATTTT 

GAGACCAGCCTGACCAACATGGAGAAACCCCATCTNTACTGAAAATACAAAATTCCCGGG 

TGTGGTGGCNGCATGCCTGTAATCTCAGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTT 

GAATCTGGNAGGCGGNCGTTTGNNGGTGAGCCAAAAATCGTGCCATTGCACTCCAGCCTG 

GGCAACAAGAGCGAACTTTCGTTTCAAAAAA 

Sequence732 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTAGTTATTT 

TAAATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGT 

GGAGCGGGMGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAAT 

GACTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGC 

TCATGTCTATMTCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGA 

GTTGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAA 

AATACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAG 

AATCACCCTTGAACCCAGGAGGCAGAGGTTTGCAGTGACCCGATATTATGTCACTGCAAG 

TNCAGCCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAAAAAGAAAA 

Sequence 733 

CGACCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGC 
TGGT TCAA GCCATTAAATTACATCACAAAGGTTTGGTTTCTCTGTATATATTTCTCTGGG 
GCACTTTTGCTANGTTGGCTCTATCCTGAGGCAGNCTCTCTCCTCGTGGNAACCAGGTGG 
CTCTAGCAGCCTCAGCTTTATATCTCTCAAGAGTAAGTCCACCGTCACAGAGC 
Sequence 734 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACA 

CCTGGCTAATTAATTTAAAAAAAA I MUM I AGAGATGGGATATCTNTATGTTGCCCAG 

ACTGGTCTCAAACTCCTGGCCTCCAGCAATCCTCCCACCTCACCTCCCAAAGCCCTGGGA 

CTGCMGCATGAGCCACCATGCCCAGCTATATTTTCTGTAAATTGCTAATGANAATGAAA 

CATGTATGCTGTGGACAGMGCCTTGTTGGACCTAGAGCCCATGCTGGGTCCTTTGCCTT 

MTAAMCATMCTCTGGCATTACATATATMTTMCAGCCTCAMGANCATGTTTCTTT 

ATTAMCTCTGACTGTTTCAGCATTATTTT 

Sequence 735 

GCGAATTGGAGCTCCCCGCGGGGGCGGCCGCCCGGGCAGGTACTACTGTGTCCTTTAGAT 

CACTCTGCCTTGATCACTCTGTCCCGTCACTCTGCTATTTCACCTGNCAGNGAAATACCT 

GGTATCGTCCTGCCMCGTGAAGCATTGAATGCTTNATACGTCTCCATCCTGATTGTTTA 

GGCTTTGAATGCTGAGAAGTATCTGCACTTTGTTGGTCA 

Sequence 736 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGTACCCTTAAGCCCTGCTCCT 

TTGTAMGTCTTTTTGGATTGTCATCATCAAGAGTCAGTNGATCTCCANCTTCTCAGAAC 

TCACAGGGCACTCTGTCTAGGCATTGCTGACCGTCTGCAGTGTCAGATGGTGACTTCTGT 

ATGTGTTGTGTTTCCCGTTAGACTCTAAGGTTTTTAAAGGCGAGACTCACTCCTGCAGAA 

GCACATAACACMTGCCAMCTCTTATTTACGGAGGTCCTGGCGCATTGTCAGCTTTTGG 

TMATGCTTTTCTTTTGTTGMTACTTATCTTCTGTGTGCCMGATTTGTGTTAAGTGCT 

AGAAAAATGTGGGAGGTCACCGCAGACCCTGTTCTCATGGAACTGATGGTGTGTAGTGGG 
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GTGNGGATTAACATAAATAAAATGATGCGCAAATGAACACAAAATTCAAATTGATGATGT 

GTACCTGCCC 

Sequence 737 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTGAGCAGTCGAAG 

TGGATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAANGATCCCAATGACGATGATT 

GGAAAAAACTTCA ATGG CAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGT 

GCMCAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCT 

TCMCAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGC 

TCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAA 

GAGATAGTAGGCCACAGTGTTACTGGCTTCCCATAAACACAGCCCTTTCCTGGCTCACAC 

GGGGCATGACCTCCCGCGTACCT 

Sequence 738 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTAGTTGGATGTCGAGGTTNGATTAGATTCT 
GGGGTTGGTTTGCTTGTTTTGGTGGATNGTTTNTC^ 

TCCACCAGAAGGCACATGTGCTTGCCTGTGTCTTTTTTGTTATTGTTTTGAGGCAGAGCC 
TCNCTCTGTCTTCCAGGCTGGAATGTAGTGGCACAATCTTGGCTCACTGCAACCTCCACC 
TCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTNCCAAGTAGCTGGGATTACAGGTGTGT 
GCCACCATGCCCAGCTMTTTTTGTATTTTCAGTANANATNGGGTTTTTTGCC 
Sequence 739 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACTTCCACGA 

GMGAATTAATATTGTAGTGTTAGGAAAACTAGCMTTTAACTAAACAGCATCAAGTTAC 

AAACCAGGAAAGTGATTTAAAACTAMTGCTGGCTTATCTTTCTGAAACAAAGCATCTAA 

ATTTGAC AGTCCAAAATGGC ACTTATTGAGTGTCCGTGACAATACATGCTGACAAGCAGC 

ACACCTCTTTTTTGTTTTTTTAAGACGGCATCTTGTGCTGTCACCCANGCTG 

Sequence 740 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATATAAAAATTGATTTAGCTTCTACACTCA 

AGTMTTATAAACAGGTTTNTCTTTTGGGACATTTGACAGTTATGTGAAAGGTGAGTCTT 

CGTTGTGTAGTATTGTCTGTTACACTGCAGGTGTCTAGAATTGCTGATAGTGTTCTCCCT 

CTAAAGTAATGTCACCCAACCACTTGTAAATTGACGATAATAAGACAGGAAATCAAGAAC 

CAATATAAATAAGCAAACATTTGAAAATMGAGCTAAAMTCAAAMTMTCTCTCTTTT 

TGCTGATAATACTTTATACCTAAATAACCTAAG A 1 I I I I I I 1 11 1 I I I I I IGAGACAGAG 

TCTTGGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCCCGGCTCACTGGAACCTCC 

GCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGAAGCTGGGATTACAGGCA 

TGCCCACCGTGCCTGGCTATT 

Sequence 741 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGTATAAATCATGCT 

CCTGCTGTCTAGAGCTTACCACCCAACGAGGGCTTCAGATAAGATCAGCAACTGCCCTAG 

AGTGTGGAACTCCTATGACAAGGTGAGCCTGGGGTGTGATGGAAACACGGCGTGCATGGT 

TACCAAGCCACGCTTCCAGGGAAAGGGGTCCGTGCGGGAAAAACTTCAGAGAGGAAATGA 

CATGTCAGTCAATAACCTGAAAGAACTGGNTGAGAGTTAAGCANCAGGGAACAAGGGCAC 

AGTNNTCCACACAGCTTTTTGGAAAGATCATGTTGNTTATAGTGCAAAAAAATACTGAAT 

ATGGGAMCAATTTGTTATTATTTTTTAGGAGTNTTGCTTTGTCCCCCAGGCTGGAGTG 

A 

Sequence 742 

ACTACTATTGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGGGGGAAAA 

AGAGTTACAATTTGCCCATAAAGMTGGAGAGMCAGAAATGTANCTTTTATGCTGMAA 

ACAAMTGCMGGGCMTCCAGTTTCTAATTCCTGTGCCAAAGCTGCTGTTCTTGATGAC 

CTCGGTCAAATCATTTAMTTCTCTCAATTTGTTCATATAAAAGTGCTATTAACCTGCAG 

TTCCTTCAAATACTATCCAATCAATGTTGGCTACTTGATTTTCA 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCACAGGGTATGCATAAA 
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CCAMTATACMGMTTTMTGACAGCATTATTTGCAAACAGTGAAAGATTCTGAGCAAT 

CAAAAGGTTCATTACTACAGGCATAGGTCTATAAATTACTACTGTATGGAATACTATGCA 

GCCATTAAAATAATGAGGAAGAGGGAGAGTGCCCTGTATGCACTGACATGGAAAGGTTTC 

AGTATAT GGTA AMAGCAGCTCATCTTAGMCAGATTTTATAGTATGACCCCACTTGTGT 

GGAMTATTTTGTGATGTGCCACTCAGTGTATACGTATTCATGAGTGCATATACAAGTGT 

GTGAGAAGCAATGTAAGTAACTGTTTCACAAGGACCCCCCCTTTAAGAAGGCAGAGGGGA 

TCGGGGGATATAGAGTGMGGGATGATTTTTGCTTTTTCCTCTA 

Sequence 744 

CCGCGGTGGCGGCCGAGGTACGCGGGAGGAAAGGGCTGTGTTTATGGGAAGCCAGTAACA 

CTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAG 

GTAGAGGTGGAGGCGGAGCCGGATGTNAGAGGTCCTGAAATAGTCACCATGGGGGAAAAT 

GATCCGCCTGCTGTTGMGCCCCCTTCTCATTCCCGATCGCTTTTTGGCCTTGATGATTT 

GAAA 

Sequence 745 

CCGGGCAGGTACACAGTAAGTGAAGGGCCAAGACTGACGGCTGATAGGACAGGGGTGACC 

AGNGGTGGGGAGGGTAGTGGGAGCAGTCCATCCTGGAATCTGGCATTCAAGGGGCGCATT 

GTCTGTGGGAGGATTTAAAMTMTAAMCCAACTAAAGGCAGTCTGCTTTTTATGGTCA 

CCAGGCCGCCAGCAATTCTAAATTTCAGTGATAAAATATTCCTCCTCACTGGACACGAGA 

AGCTGGCTTTCTCCTTATTCCCCAGTACCTTNGGCCCGCTTCTAGAAACTAGGTGGGATC 

CCCCCCGGGGCTGCAGGGAATTTCCGATATTCAAAGCCTTATCCGAATACCCGTCGACCC 

TTTNGANGGGGGGG 

Sequence 746 

CGGTAATACTNGTTATCCACAGCAATCAGGGGGGATTAATCGCAGCNAAAGAACATTGTT 

NAGCAAAAAGGGCCAGTCAACAAGGGCCAGGAACTCGTAAATAAAGGCCCGCGGTTGCTN 

GGNCGTTTTNTCCCATMGGGCTTCCGCCCCCCCTTGGACGAGGCNTCACCAAAAAATTC 

GACCG CTCA AAGTCANGAAGGTGGCGGAAACNCCGACAGGGACCTATNAAAGGATACCAA 

GGCCGTTTTCCCCCTTGGGMGGCTCCCCTNNTTGCCGCTCTCCTGTTCCNGACCCTTGC 

TCGCTTACCGGATACCTGTCCCGCCTTTTCTCCCTTTCGG 

Sequence 747 

CCGCGGNGGCGGCCGAGGTACATCTTTGGTGACTTTTCATTCACATTTCATGGATMTTT 

GGGGAGGTGGCCTGCCANCCCTGAAGCCCTACATCCCCATACACACTCTGTGCACATCCA 

GTGCCCTGCTCCACCATGGCAGTGCCCGCAAGGGGGTCCCAGATGAGAAGAAGCTGGCTA 

AAGGGCCCTTGTCCCCTCTCAGACTCCTTCAGCGGGCTGGAGTCCTCCCTCGCTCGATTT 

CGCCCGAGAGCGTTAGGGGTTTCTAMTGCAGGCGCCTTTGTGTTGTMCGAAACTTTTA 

GTTTAAGGGAAAATCTCTTTTAAGCCACTGATTGTTCTGACTTGCTGAGTTTACTCAGCA 

GCCTTATGCTGGCTCTGCCACTGCACAATAAAACCAAAGCANGACAGTTGCAGNTNAAGC 

AAGGGGGAACATGTTTTGCATTT 

Sequence 748 

GCCCGGGCATGGTACCTGTGTGGAAAAGAATGCTTGCAAAGCTTGTCACCCTCACGAGAA 

TTCCTGTGACAGACATTTGCCTTTGACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGA 

AACCGAAGAGCATCAGAGGGGACGACTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGA 

GTTGGTTACCAGAGAAACTGGCGATGGCGATCCCGTGAGCAACATCTCTCAGACCCATTT 

TAAAATGCCGGGGGATACTTAATCATGCTGAAAMCAGCAGAGCCCTTGAGGTTTTTGGA 

CTACATGTTGCAGAAAGAAAGAAGNAATTTNTACCTTNNCCNAAAAATAAAAATATNNNA 

NNNNGGTACCTCGGGCCCGGTTTTNAACTAGTGGGATTNCCCCCGGGCTTGAAGGAATTC 

GNTNTTCAAAGCCTTNTTCGATCCCCGTCCNANCCTCNANGGGGGGGGGC 

Sequence 749 

AGGTACI 1 1 I I I I I I II I I I 1 1 I M I I GGNCTAACTGNNNGG AGTATTTCTTTTACCCAA 
GATAAGTAAAAGCTACAACTCTTAGTATAAATATGNGTCCAAGTGCCTNATAACTGCTAA 
CCACAGGGATCCTGAGCTCTNATAGCTTAAACACACAGNGTNNATTTTACTGGTCTACTT 
CTCCTGNAGACCTAAAAGGGCCTATAGCCTCAGTAGTTGACAAAACAACATATTAAAATT 
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CCTCACTGATCACTAACATAACCTAAAATCCCTGCTTTTGACATTAGCATGGNANACATC 

CTTAGCAGGCCTAAATAGAATGGCCTTATAAGTGGATCCAAAGGGC 

Sequence 750 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCMGGCATTGTTCTCCTCCAATrTGGACCCCAGCCTGGTGGAGCAAGTCTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCA 

GGCACCACCAATTA 

Sequence 751 

GTGTCCGGAATCCTACCCGGTGTGNNGACAGTGCCTGATAGTTTCTTCTGCCTTTCTATC 

CCAAAACGATTGGTCAGTTTACCCAAGTTTGCAATGCAGTTTANAATCTCCCAGGAACAT 

CTCTTCTAGTAGTTGCCTTAGCCATCTTGTAGTTGATTTGA CI 1 1 1 1 1 1 1 1 1 1 1 I G CTNN 

CAGAAAGCTCTATGCTTCATATGGACTTGCATACCAAI till I GTTCATCTGTTGGTCAT 

GATGGTTAGCAGAGCCTGACCTCCTGTTACAATAGAATGATCGGTTCCTGGGCTACAGAC 

TTGAGTCTGTTTTTTTTGTTT1TTAAACCTTCCCATGNGGCAATTTGCCATATGCAAAAC 

T 

Sequence 752 

CCGGGCAGGTACGCGGGTGAAAATGGAATAGTTTTCTAATTACAGAAAGAAAAGAAGTTG 

AAGTGGGTTTCGCCATGTTGAGCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCT 

CGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACGCCTGGCCAAAAAT 

CTTATAMTMTCCCXJTTCTAATTTCGGCCAGCTTAATCACACACCAAATfCCTTTCATG 

AGATT MTCTTCCACAACTTCTACACTTCCTTAAATCTTTGATTTTGTCCTATACTTCTT 

TTTTTATATTAGCAATCTACTTTAGGACAGAMTTTACTTTCCTTTCCTCTTGATTTTGA 

CCAAAGTCCTCTCTTAT 

Sequence 753 

TAGTTTTCTAATTACAGAAAGAAAAGAAGTTGAAGTGGGTTTCGCCATGTTGAGCAGGCT 

GGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGGGATT 

ACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATTTCGG 

CCAGCTTMTCACACACCAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTACACTT 

CCTTAAATCTTTGATTTTGTCCTATACTTCI 1 1 1 1 1 1 ATATTAGCAATCTACTTTAGGAC 

AGAAATTTACTTTCCTTTCCTCTTGATTTTGACCAAAG 

Sequence 754 

CCGGGCAGGTACCTATATGATGTTGGCCATGCTCACTCACTCCTCCAACCCTCAGTTTAC 

ACATCTGCAAAATGAGATACTTCTTTTCCAGTGTTGCTGTGGACATTAGCAGGCACACAC 

ATTTGGTGCTTGCACAAATGAGGTCCTAAGAGGTGGGTCCCTCTCATCTTACGTGAGGAA 

ACTGAAGCAGATTAGAAATGACCCAAGGAAACCACTCCGAGTTCAGTCTGGAGCCCACTC 

CCCTAGGTTTTAATCATCCCCCAACTCAGTCCCTATCTGCTGAGGTTCCTGGATCCAGAC 

GGTTCTTACCAAGGAACTGTCTGTCCTCACCACATGGATGGTTTTCCTGGCAGAGGTGTG 

CCCTGTGAGGGGGTCA 

Sequence 755 

GCCG AGGTA CANACAAGGGGGCNACTGNCATGGGGGNGGNNTCTGGTCTTGTAGTCNGTT 

TGGAATTTTCTAAGTCAGGGTGGGGTGGGGGGACTGTGCACGGGTCATGTGCAGACTGGA 

ACCCATCTCCCCCTCGGTCTGCAAGTTAAAACAATTGGGTTGTCCTTCTCAGCATCTGCC 

AATGTCTCTTANTCAATCTTGGATCAAAAGGGCGTTGGAGGAGGAGGCTGGGAGGGAAAT 

CCAGACAGTTCTCCGCCTCTGACATCAGGTCCAGCTGTTAGCATCGTGCTGTGGGTCCCT 

GAACAAGAAGCAAAGTCAGGACT 

Sequence 756 

AGGTACCGCTGTGTCCGGGTGGGTGGNGNGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 
TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 
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CAACTGGGTTTCCCAAGCTATGTAAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 
CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 
TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 
TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGT 
Sequence 757 

AGGTACCTCTGGATATGTTACACTGAGANAGATACTTCATCACTTACATGATATTGCTCC 

CCCACAAATTTCATAACCTGAATCTAGTTATAAGGAAATACTATGCAACCCAAATTGAGG 

G ACATTCTGCAAAACAACTACCTGTAATC I I I I I I I 1 1 I I I I I 1 1 I GAGACGGAGTCTCA 

CTCTGTCGTCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACTTCTGCCCCCG 

GGGTGCGAGCGAT TCTTCT GCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACACGCCA 

CCACGCCCAGCTMTTTTTGTTGGGGTTTCACCATGTTGGCXJAGGATGGTCTCCATCTCT 

TGACC 

Sequence 758 

CCGGGCAGGTACTATGGTCCCCGGCAACCTCCCCTTCCTCCTGGGAATGCTCAAATGGGA 
AGGCA GCATG AAACGGTGAACAGGCAATCACTGGACAAAGTCACAAGAACTGGGCTTTAG 
AMTGGTTTTACCATTAGCAGTTGTGACACCTCAGAAGTGGCAACTCTGGATCTNAATAC 
CCTACCCTTNACCCTAAGNANAGGTACCTCCNCNATTTTNNCGGGGGGAAACNTTCTNGG 
GAAisnTNCCCTTTCCNAAA AAGGGGG GGGGGGGCT CI 1 1 I I 1 1 1 1 1 1 1 I I GGGGGGGGGG 
CCCCN NCNN CCCCCCCCNt II I I II I I I NTNAMGGGGGr4TTNMNANANATTTCNTTNC 
TCNTTNTTTTNTNANGNNAAAAANTCNGGTNGNGGNG I M I I I 1 1 1 1 1 AN AAAAAAAAAA 

Sequence 759 

GGCGGCCGAGGTACAMGMAGGACTTGATAGCTATTACCTTGCTGCTATGTTTGTTNCT 

TNGNCTCACCAATCATTNTTNTGTATACCTAGCACTGCACCAGGCGCTGAGGTTAGAGAA 

ATAACTAAAACTGCGCCCTTCACCCCTGATGGCAGGATAGGCAAGGTTGGCACCATCGTC 

ACAGCAGGACCCTCATCGATGCCTTGGTGTGTGCCTGGCATGGNGTTTGCAGCAGTTTAT 

CACATNNAATCCTTACAGC 

Sequence 760 

AGGTACTCAGGCCTTACTGGGATTTCCTTTAAGACCTCTGGGAGGAAGTGTCAGTAGCTG 

GGCAGGCCTTCTTGGCAAGCATTCCTCCCTGGGTTGTGGCGGGGGCTCCCGGCCTGCTGT 

GTGGCAGCTGCAGGCTCCTGGGGACCTGMGGAAMGCTTAACCGTTCTCCCTTCCCTTG 

CTTGGCACTTAGAGCACTAGTTCCATTCCAGACATACCGATTATCTTGCCTACGTGGCAT 

AGAGGCCTAGGAGCCTCCCTGGGAGGMGAGGCAGGGCMGGTCTTGCCTGGCTGCTTTT 

AGGGGGAMGA TGTAG GGAGGAAGCTGCCTTATGCTTGGATCTGCAACCTTTGCCTGGAC 

CTGC GGAG CCTATTTTGGCCAGGGGGAGGGAGACAGAAATTANACCCNANGTATTNAGGT 

MTCCTTTTNTTGCCTTTGAACATTGCNCGGGNGTACTTTGNAAAANAAAAAAAAAAA 
Sequence 761 

CTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 

GGMTAGTTTTCTMTTACAGMAGAAAAGMGTTGGAGNGGGTTTCGCCATGTTGAGCA 

GGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATT 

TCGGCCAGCTTMTCAC ACAC CAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTAC 

ACTTCCTTAMTCTTTGATTTTGTCCTATACTTCTTTTTTTATATTAGCM 

GGACAGMATT TACTTTC CTTTCCTCTTGATTTrGACCAMGTCCTCTCTTATGCAAAAT 

GAAAAATTACTCTTTTTTCMCTTTCTTT 

ATCTCTCCTACTTACTGG 

Sequence 762 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 
GGMTAGTTTTCTAATTACAGAAAGAAAAGAAGGGGGNGTGGGNTTNTGCCATGTTGAGC 
ANGCTGGNCTCGAACTCCTGACCTCAGGNGATCAGCTCGCCTNAGCCTCCNAAAGTGCTG 
GGATTACAGGCATGAGCCACCACGCCTGGCCAAAAATNTTATNAATAATCCCCTTCTAAT 
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TNCGGNCANCTTAATCACACACCAMTTCCTTTCATGAGATTAAACTNNCACAANTNCTA 
CACTTNCTTAMTNTTTGATNNTGNCCTATACTTNTTTTTTTATATTNGCMTCTACT^ 
AGGACAGAMTTTACTTTCCTTTCCTCTTGNTTTTC 
TGNANAATNNCTNTTTTTTCAACTTTCTTT ACCAAAA 
Sequence 763 

TTAGGGCGATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATGTAATGCTCCTGAACTGT 

ATGCTNGACACGGCTGTCTACNTAGGTTTTGTTCTGTGTATTTTATGACTATTTTTT^ 

AAAGTAAACAAAAAAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAA 

AGCAGGTGAGTGGTTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCAGAATGGCC 

ATGGGCTTTGCCTTCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGT 

T CAACA CTGCCAGTGT AGTTA ATGTCACTGAATTATACACTTTAAATGGCTAACATGACC 

AATTTTATGTTATATATATTTTACTACCACAAAAAMCTAGCTGGCACCTAAAMCATTC 
CATT 

Sequence 764 

CCGCGGTGGCGGCCGAGGTACTTGGATGGGTTTTGTGTGTATGTTTGTGTGTGCACTNGC 

GTCCACCCTGTTGGGCTT AGTGM CTTTTTGATTCAGTGATTTAMGTTTCTCATCAGAT 

TTGGAAAATTCTCAATTACTTTTTTCTTTAMTATTTCTCTTGCCCTTCCCCTCT 

CTTCCAGGATTCCAATTTCATCGATGTT 

Sequence 765 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTTCAAGCG 

ATT CTCAT GC CTCAG CCTCCCAAGTAGCTGGGACTAAAGGTATCCACCACCACGCCTGGC 

TAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGATCAGGCTGGTCTCGAAC 

TCCGGCCTCAAGTGATCTGCTCGCCTTGGCCTCCCAAAAGTGCTGGGATTACAGGCATGA 

GCAGCTGTGCCCAGCTGGATAATTATTTAATAAATTGGGGAGCATAGGAAGCATAGTATT 

TGTGMGTGGGTAGGCAGGTGTGATGGGGGTAGGTGATGTTACATTTGGGGCATTTTGAA 

GTTGGTGGTTCTTCTGAGTTGAGCAGTCAGTCACTCTTCATTTGCTGCACCTTTATCTCA 

TTTTAGCCAACAGACATTGAATACCTACCAAGTCTTAGGTATTTGCA 

Sequence 766 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAAA 

GAAAACAAATACCAAGTATTTACAGATCCAGAGAAAGTTCACAAGAATGGGAGGATGCCA 

GTTCCMTGCTTTGTAAAGTCAAAAATAGCCACATTGCAAAACAAACAAAAAAAAACGAG 

AACGTTCCCGAGTGTGCCTCCAAAACATAAAGGAGAAAATCATACAGAAAAACCTCATGT 

AAGGGTTGGAACTTGAGCAACCAGCTATCCAAATACAGAGGGGAATCCTCGCTTAGCTAG 

GGCATGGCCTGAG 

Sequence 767 

AGGTACACACAGTGATTTGGGGTCCTITmCCTAAAACAGCTTCmATCA^ 

MTTCTGGGTGAGATAGAMCACTGAMACAGGGCGGAAGTTTTTTCTTCTGGCTTCTTA 

GTCCATGGAGGGCTCAGCGTGGAGAGGATATGCCGTGGCATTCTCCCTGGGAGACCACAC 

ATGTTCCCGACAGCTCAGACCCCAGACCCGCACATGCTTCTTGACAGTTNAAACCCCAAA 

CCGNAGGNGCTCCCGACAGNTNAAACCCCANACCCCGCGTACCTGCCCG 

Sequence 768 

CCGCGGTGGCGGCCGAGGTACTTAATAATTCATAATTTAGCCATGATAGTATCTAAGCTC 

ACTTTCAGMTTATTGCATACATGCCTTAGGGMGMCCTATCCACTMTGCTTTTAATA 

ACTTACATAGATTGTGTTGCGGCMGTCMGTTTTMTATAGAGGAAAGGGTTTATCTTA 

TCATAGTAAAATAGTAGTGATGTGTTCATMTTTACTATTTGCATGGTATATTATCAAGG 

CTGTAAMGCTTGMTTTGCCTTTTCCACATC^ 

CCAGAGAAGTGGGTAGAACCCAAATGCCCATGNGGGT 

Sequence 769 

GGCANGGTACAAATTCAGGGGAGGATGGAGCAGCTGCAAGCCTGGCTGGCATCCCATGCC 

AACAAGGTGACCCAGTAGATAAGTGACAAGGTGACTGAGCTGCCTGGTGCTCTTGCATAG 

AATTTTCMGTGTTAGAAAAATGTCTCCATGCCTTGCMTrTGTCTCTTGTGGCCAAGCC 
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TAACTCAGATGGAAATGCAGAAATCACCCGTCTTCTGCGTCGCTCACGCTGGGAGCTGTA 
GACCGGAGCTTGTTCTMTTNGGCNATTTGGGTTChfrCCX)CCCCGGGNNCr™ 
Sequence 770 

TACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI 1 1 1 1 1 1 1 I 

I I I I I 1 1 I I I I IGAGGNGGAGTCTCACTCTGTTGACCAGGCTGGAGTGCAGTGGCACAAT 

CTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAACGGGTCTCCTGCCTCAGCCTCCCA 

AGTAGCTG GGAC TACAGGCGCATGGTGCCACTCCCGGCTAGTATTTTGTATTTTAGTAGA 

GACGGGGGTTTTACTGTGTTGTCCAGGCTGGACTCGAACTCCTGAGCTNAGGCAATCCAC 

CAG CCTCAGCC TCCCAAAGTGCTGGCATTACAGGCATGAGCCACCGTGCCTGGCCTCTTT 

CATAI I I I I I I I ACACTTTTCATTTCTTCTTATTTTAAGTGNGCTGGATAGGGGCTCCAG 

MCAGMTTCAATAGMAGTTGTGACAGTAGGMCCCTTATCTTGGTCCCTGATTTTAAA 

GGAGGGTTNAAAAAAAACCCCCCCCAA 

Sequence 771 

ACTTAGGGC GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTA CI II II I II 1 1 

II I I I II I I IGGAGAGAGGGGGTTTCACTATGTTGCCTAGGCTGGTCTTGAACTCCTGGC 

CTCAAGAGATCCTCTTGCCTCATCCTTCCAAAGTGCTGGGATTACAAGCGGGAGCCACTG 

TGCCTGGCCTAGMGATCTGTTTTCTTTTCTCTGAATAATTCTTGTGACACTGTCTCTCC 

CTCCATCTCTTTCTGTTTCTTTGTCATTTTTCCCAGCTATC C 1 1 I I I I CCTTGNCTTGTC 

CTCTTCTCCCCTCCATCCTAAAACCTTTGATCACAAGCTAGTTTCCTTTCCACATCATCT 

GCTCCCCTCTACTAAACGCTATTTCGCCCCCACCTGCTTTCAAGCTGNGCTTGCCTCTGA 

GCCCCTCTTTCACCACGGCCCAAA 

Sequence 772 

NCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCATGCTAGACAA 

CCTTATGACTTGAAMCAAMTAATATTTGAAATGGAAATGGCCTCAGTTCCACCCCTGG 

TGCCCACATAGCATAGTGAACCTGCCCCTGCAGCATTGCCCATGAGTGCTAAGATCCTGT 

GCCCATTTGCATGTCTTCCTTAAACAAAAGACCGCCTTAGTAAGAAATTAGTAAACCAGG 

GAG ATM TCAACTTATCCCCCAAMGATTTAAGCCTCTCATTTTGTTTAACCTTCATTGG 

GGATTTTAAATAGAAAACTAGGGCCCGGCAGGGTGGCACATGCCTGTAATTCCAGTACCT 

GCCCGGCCGCTCTAGAACTAG 

Sequence 773 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGG 

ACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTTCGCT 

TCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCT 

CCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCT 

CATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGT 

CATCAGTTTATCAACCAACAAGCAG 

Sequence 774 

AGGTACGCGGGGAGTGAACGCTCTCGGAGAACCCTTTCCCACGAACGTCCACTTCAAAAG 

AACGCGACGGAGCATTAACTCTGCCACTGACCCCTGGCCTGCCTTCGCCTCCTCCTCTTC 

CTCCTCTACCTCCTCCCAGGCGCATTACCGCCTCTCTGCCTTCGGCCAGCAGTTTCTATT 

TMTCTCACCGCCAATGCCGGATTTATCGCTCCACTGTTCACTGTCACCCTCCTCGGGAC 

GCCCGGGGTGMTCANACCAAGTTTTATTNCGAAGAGGAAGCGGAACTCAAGCACTGTTT 

CTACAAAGGCTATGTCAATACCAACTCCGAGCACACGGCCGCTCTAGAACTAGTG 
Sequence 775 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTACTGGCATAAGAGTAAATTGGTGAGAACT 

TTCTGGAGGGGTAGTTTGGCAATGTGTTTCCAAAMATCTAAAMTTATATTTGCCTCTA 

ATCCAGCAATTATACCTCTAGAAATTAATACTAAGGAAAATCTTAAGAATATACCGTAAA 

ACTTTAGTTGTMGAAATTTTTTTGTGGCCAGGCATGGTGGCTCACACCTGTAATCCCAG 

ACTTTGGGAGACCAAGGTGGGCGGATCTCCTGACCTCATGATCCACCCGCCTCGACCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCAAATTTTAAATAAGAAGAAACAGTCAACAGCAT 

CAGACATAGTAGGTATGTCCAACACCATAATGGCTGAAAAGTGCCCCCTAGTCTGGCAAT 
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TAGTAGGTCATTGGTTTATTAATAACCGGCATGTTAAAGTTG 
Sequence 776 

CCGCGGTGGCGGCCGGCAGGTACAMTCATACCTCCCMGGTATTGCTCCATTGTGTTTT 

TGTGCATTTGGTTTGGATTTTTATGGGGAATTGAAGACAAGTGGATCATAAAGTGCAAAA 

TAAAATGCTCTAGAAATGACAGATGGGGCACAATTTCCAAGAAAATTCATCTAGACAGTG 

GCAACACTGAGAAAAAAAAGAAACATTCAAGAAG 

Sequence 777 

GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTAC I 1 1 I 1 1 U I 1 1 1 I I I 1 1 1 1 

TGCTAGAGATGGGGTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGGCCACAAGTGA 

TCCACCTGTCTCAGCCCCCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACTCCTGGC 

CCATGTTTAGGATTTATACCMTATTATTMCTTAGAMTAAGTTTCTAATAAATTATTC 

C ACCCG MCTTAGGGTMCTGAATTTTMTGCTGATGTATTAAGCAGGTTCTTCCTGGGG 

TCTTTTGATTCTCAAGGGATCCTTCACTGNGGGTGGACTTCAAATTAATAGGAAGCAGGA 

AGGAGCCACCTGCACTGTTTTCTTGACTGGGGATGACACCNAAACCTT 

Sequence 778 

CCGCGGTGGCGGCCGAGGTACTATGAGAATTTCAAACAAAGAATGAAGCCATAAAACAAA 

MGACTGMTATTTGGCTCTGCCTGGCTCCCAGGCTTTCTACTATTCTTGTGACTTGGCC 

TCMCAAMTCTAMGTGACTTGTTATTTGTGGGTCAGCTTTGTCCCATCCTTACCAGTC 

ATGGCTTTAGACAAAAGACTCAGCACCACTCACCCTNTGGGACAGTNTGACTGNGGTCTG 

AGNCCCCTGCTTANATATTAGGCTTAAGCTCAGTT 

Sequence 779 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAGAATTTCA 

AACAAAGMTGMGCCATAAAACAMAAGACTGAATATTTGGCTCTGCCTGGCTCCCAGG 

CTTTCTACTATTCTTGTGACTTGGCCTCMCAAMTCTAAAGTGACTtGTTATTTGTGGG 

TCAGCTTTGTCCCATCCTTACCAGTCATGGCTTTAGACAAAAGACTCAGCACCACTCACC 

CTCTGGGACAGTCTGACTGTGGTCTGAGGCCCCTTGCTTAGATATTAGGCTTCAGCTCAG 

TTCC 

Sequence 780 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACAGTC 

AGCATGCGCTGTAGCAATGTGCTTTGCAiGCTGGAACTCCTATCAAGCATCCTAGGCAAGG 

CATGCACCCCAGCGCCAGAGAGAATCAGGAAGGGGAAGGTGCCCTGAACCTCAGACAAGA 

ACCCCTTCCAGAAACCACCACCAAAGCCATCACTGTGTTTCCACCCTCAGACCTGTGTCT 

CTTTAGCTTCTTGGTAGAAGGAAAGAAGAGGAGCTTGGGTGGGGCAG 

Sequence 781 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCATGGACAT 

GCACCCCGGCCACAGAGAGGCGCACMCTTNfTACAGAACACCCTTCCACX)TGGTCTTCCA 

CAGCTGCATCAGATTCCTGGACTGTCACAGACATGACTTCAAACTTGAAAGTTGCAAGAT 

CTCCTGGAACAATTTCCACMTGCATACAACTTCATTCTTAGCCTCAAGCACTGAATTAG 

ACTCCATGTCTACTCCCCATGGCCGTATAACTGTCATTGGAACCAGCCTGGNCACTCCAT 

CCTNTGGANNNTTAMCNTTNAAMANNNAMNCNNNCCCCTNGGCNTTTTTAAAAANNN 

GGNNNCCCCCGGGNNNGGGAAA I I 1 1 1 1 I I I A AANA I I 1 1 1 1 1 1 CCCCCCCCCCCCNGGG 

GGGGGGGGCCCCCCCCCCCCCC I M I N I 

Sequence 782 

NAATTGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTCCCTGAGCAGTCGAAGTGG 

ATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGA 

AAAAACTTCAATGGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCA 

ACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCA 

ACAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGAC 

Sequence 783 

CNATTGAGCTCCCCGCGGTGGCGGCCGAGGNCTGATGTCCTACAGTCCTCTACCTGATCT 
ACGTTCACTGGAAAGTGTNGAGTCTCAGCAGGAAGCACCTTGCTCTCGTGTCCGGCTAAT 
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TCGAGTGCTTTACX3TAAGTAGAGGMTTGCTGACTTTTGGGACATTTCTGGTCTTGCCAA 
AGTTCACCTTGTAGTAAAGCCCCCAAAGATACTTCCCAAATAGATGCTCTCTTGAAAATA 
ACTCAG 
Sequence 784 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCGGCCTATTTTA 
TATTCTGTTTAGTGCTTTCMCCCCGAMTCCACCTTCTAACATTAATATTGATATCCAT 
CCTCTCTCTCCTCCAACCTCTCTCTCTCTTGCATTTACTTTTAGATTTTCATTACTTTCT 
TTTTATTCTGATTCTTGTAMTAGTATAAACTAGA 

AATTTATGATTGACACATMTAATTGTATATATTTATGGGGCACMCGTGATGTTTCGGT 

GCATGTATACATAGTATAATGATCAAATTAGGGTAGTTACCATATCCATTACTTTAAACA 

TTTATCATTTCTTTGTGGTAACAACATTAAAAATCTCATCTAGAATGGCGTGAACCCTGG 

AGGCAGAGCTTGCAGTGAGACGAGATGGGCGCCACTGCCTCCAGACTGGGCGGACGAGCG 

AGACCTCCCNTCTCAAAAAAAAAAAAAAGG 

Sequence 785 

GCTCCCCGCGGTGGCGGCCGGCAGGTACGAAATGAGAGAAATGGTTTAGTAAACGTATAA 

GACATCAACATAGNAAAGTATTCTATAGGNNTATGTGTTGGAATTACAAAGATGAAGAAA 

AGATACAGGCAAGTATTTGATATACTNAATTAAAAATAGCAAGATGTAGAGTAGNCATGT 

ATACAGTGATAGCAAG AACATG GATCCTTAAGGACAAAACTGAAACATAATGCAAAAAAA 

GMAMTATGCAMTTATTTTTCGTATGATGTAAGTTGTAAATAT 

Sequence 786 

TACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGNGTACNCGGGCCTATTTCTGAA 

TAACTCAGNGGCTTAAAATTATATCCCAAAGTAGNGGTATCACAGGGTTTCCTGATGAGG 

ATAAATGGGCCTGAAGTGCTTATGGGCACCCACTATGTATCATGGNAAAACTTGCACGTG 

TGT GTGTGT GTTGAGAGAGAGAGAAAAAANAATAGANAAAGTTGGTGAGAAAAGGGNAGG 

CTGTTTTTTGNNCCGAGGGNfTGTNTGGTTGGGCTTT 

Sequence 787 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGNACACANTAAGTGAAGGGCC 

AAGACTGACGGCTGATAGGACAGGGGTGACCAGTGGTGGGGAGGGTAGTGGGAGCAGTCC 

ATCCTGGAATCTGGCATTCAAGGGGCGCATTGTCTGTGGAGGATTTAAAAATAATAAAAC 

CMCTAAAGGCAGTCTGCTTTTTATGGTCACCAGGCGCCAGCAATTCTAAATTTCAGTGA 

TAAAATATTCCTCCTCACTGGACACGAGAAGCTGGCTTNTCCTATTCCCCAGTACCTGCC 

CG 

Sequence 788 

GGNGGCGGCCGCCCGGTTTTGGACGCGGGT^NTGCCCTNACTTTTTTAGCGGAGCAGAG 

GAAACATTCATAAGGAAATATGCGAGTAGAGCTCAGGAGAAAAGCAGGACTAGAGGCCCA 

AGAATCACAGGCCAGAAGAAGAAGCTGTAGCCTCGGGAATGGAAGAGCTCTCTGAAGGGG 

AAAGGGGAGAACAGGAATGTNCCAGGAGCCAAGGCTCATCTATAAGGGACTTNCACATTT 

AGGATGTAGAAGAAGGAAGCAGAAGCAGGGGATGACCAGAAATGGCCCCAGAGATGAGAT 

GAMGTTAGGAGAGCGGNGAGCAAGCCTTTAGGTTTCACAAGGGAAGGAGGGAAAGTAGG 

TGTTAGGTGCTGCCMGATCAGGGAMATMGCAGAAGACCAGGCCATTfTNANTTGCNG 
TGG 

Sequence 789 

CCGC GGTGG CGGCCGAGGTACCACAATCAACTCAATCACAACATATTACAACAAAACCCT 

TCATCTTTTTCCTTAACCCACTGTMCACAMGCAGAGAATACAGATTAGCTTTTTTAT^ 

TGTGTGTTTGACTTCATCTCTTACATACCTCTATTTCAGTATCTATGATATTTTCCTCTT 

CTTATCTGTTCMTGACAGTCTTCCCTTTTMTATTCTAAACTTGTCAAGCACAGCANTT 

MAAAAGTATTCTCATGTATATATTTTATCTTTAGAGCATCGCATAAAGNCTGATACATA 

GGMGTTTTAGATGCATATTTACATTGGGTAGATGAATCCAGGGGAAAAG 

Sequence 790 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCAAGAGAGACGTCTCTTACTGCCTCATT 
AAGCATTTGGAGCTGTTAAACACAAATCAAGGCAACCAGAAAGGGCATCTTGGCTTCAGG 
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CTGGGCATAACCATCCCATTTGCCACATAAAAGTCTAGTGGCTACTCTGCACCCTTTCTG 
GGTAGAAGCAGAGTTAGTTTGGTCATGGGGGCCCCTGTGGGACAGTGTTGCCCAGACAGG 
TACCTCGGCCGCT CTAG AACTAG 
Sequence 791 

AACCCACT ATAGGG NTNATNGGAGCTCCCCGCGGTGGCGGCCGCCCTGGCAGGTACTGNC 

TGTCTCAMTTTTTGGATACGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGAT 

CCTAACAGGTGATGACTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCT 

CMTGTGCCTAGATTGAM TATC AGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGG 

TGTGGTGTAMCACCTACATTTTAATGTAAACATTTCAGTGTAATCAATGAGAACTATCA 

TTCTGCTTTAATCACCATGAGTTCTGAMTMCAAAGGATTTGTCTGACATTCATTCTAA 

GMATTCATTCTTACCTGACTAAGAMCTTTTTAACCTGGCACAATAATAAGAAATGACC 

TGGNAAGTACCTCGGCCGCTCTAAAACTAAGGTGGGATCCCCC 

Sequence 792 

NGGCGAATTGGAGCTNCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAAAATGTAA 

GTTGGTAAAAGACATTGGACTCCAGCTATGTTCTTTAGAAAGAAGGTATTGGACTCTGGC 

CATGTTCTTCAGAMGAACATGCCTGGCTTTTTCACGA7TTGATCAGTCTTCTTAGACCC 

TGAACCCCAC CATGA AATGGCTTCCCCAGACACAACCCGAGAGAGTTATGCTTTGTTTCT 

CAGCTAAAATATTTTGCAG ATCTTAATTTCCTGGGTCATTGCATC A I I I H 1 1 1 I I I 1 1 I 

T 

Sequence 793 

AGGGCGMTTGGAGCTCCCC^CGGTGGCGGCCGCCCGGGCAGGTACCMGTATACCTTTT 

CATTTAAATCATTGMCAGTTCACMTGGCTGTTGTAAAGTTTTrGCCTTGTTACTGAAA 

CC ATCAT CTCTrGGTCTATTTCTATGGAGTTATTTTTTCCTAGTTATTGGCAGTTTGCTT 

ATCTTTTATGTGTCTMTAACTTTTAATTGMTGTAGMCATTATGGATCTTATATTTTT 

GAGMTCTGMTGTTTAAMTTCCTTTGMGAGTTTGTGT1TTCTTCTGGAGGCAGTTTA 

CTTACTGGCTGTGAGCTCAGTCCTGTTGAGGCTGTTTNGCGACNNNCTGTGGCTTTGTCA 

GGGTGGGGTGGTGGATCAGACATCTGGTCATCAAC 

Sequence 794 

ACNCCCGGCCAAAAGGGAGNNCACAGGGGGGGGCCAATATATAAGGGGGGCAAGGAGGGG 

GGGNNGGGNAAAGANGNCAAACCCCNNCCCNNGGGGNNGGGGGGGGAGGGGNAANAAAGA 

AGGNGGGGGNGNCCANGCAACCNAAAGGGACNAACCCAACANNNAAGCGGGGGAAGGGGG 

CACGANAGAANANANANNANAGGNGANNCAAGAACCAANCAAGGGNGGGGGGGGAAGGGC 

NNCAAANGGGNNNAAAAANNGGCNCAGGNNACCNAANGGGGGGNNGGGGCNNAGAANGGG 

GGNGGGGGNNANGGAAANNAAANNCNGGANNNAACCAAGGGGAAGGGAGGGGGNGGGAAA 

NCAAGGCCCCCGGGGNCAAAACAAAAAGNGGGGGGAAAANACAANCGGGGNNNGGNGNCA 

GGAGGGGAGGNNANNAGCCGGCGCGNAAAAAAAAAAAACAANCCAGGGGCAANGGGGNNG 

GGGGNGGCNAANGCCNCNNGGGGNAAANCCCNANGGGNGNACCCNNGGAGGGNGGGGNNN 

GGGGGGGCNCNGAAGGGAAAAANAACCCCCAAGGGAAAANCCCANGGAANGGGNAANAAG 

GGGGGNGGCAAGNGGGNANCCCGNAAAAANAAAGGGGAAACNGGGGGANNGNCCCAAAAC 

CCCNGG 

Sequence 795 

CCGCGGTGGCGGCCGAGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAA 

CTMGAAAACACCAACCATTCMCAGTATACTAGMTTCCTTTCAATGCATAATAGAAAC 

MGMGGGATTAGAAMGCATGTCATAATTTCCAGATAGCATMTTATTTACATTAAAGA 

TCCMGAGACTCAGACCTAGTAAMGATTTTGGCCACATTGTGACATTTGAGATCACATT 

AAAAAAAAAAAGGAAAAATCAAGNGATACTAACATCACCAATTAACATCATNAATTAGAA 

AATTTATCCATCAGCNGTATTNNNNNGGGCTCMTGCNTTGTAATGCCGCATTTCGGGAG 
GC 

Sequence 796 

CCCCCCCCGGAGTTTCAGCGAAAANCCCGGCCGAGGNACCNCNCCNTTACNCCCAGGNTT 
ANAAACNCCCNGGTNTNGNCCCAGGAGAGGCGGGGGANCACCGCGCCNGGCCAGCANGGN 
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CCCCTNCTAAATNNNGNNGTNNAAGGAANCNGGACCCCAAANNCNCGCCCCCCACGAAAG 

GAGCCTGGGAACTACCAGGGCGTGAGCTCACCGNGCCCGGCCGCTCTAGAACGAGAAGGA 

ACCCCCGGGCTGCAGGAAAAACNATAACAAGCNAANCNAAACCCGACCACCCCAAGGGGG 

GGGCCCGGGACCCAGCTGTTTGGGCCCTCAANACNAGGGAAAAATTGCGCGCTAAGNGCN 

CAANCANGGGCCAAAANGCTGGNTNCCCCGGAGGGGAAAAAAGGACACTCCCGCGNCACA 

AATTACCACNCAAAACATAACNGAAGCCGGNNAANCAATAAAANNGGGGAAAAANACCCC 

NGGGGGTGGCCCTAAAAGGAGGGGGAGCCCCAACCACCAACAAAATAAATTGGGGG 

Sequence 797 

AGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAAGTAAGAAAACACCAA 

CCATTCAACAGTATACTAGAATTCCTTTCAATGCATAATAGAAACAAGAAGGGATTAGAA 

AAGCATGTCATAATTTCCAGATAGCATMTTATTTACATTAAAGATCCAAGAGACTCAGA 

CCTAGTAAAAGATTTTGGCCACATTGTGACATTTGAGATCACATTAAAAAAAAAAAGGAA 

AAATCAAGTGATACTAACATCACCAATTAACATCATCAATTAGAAAATTTATCCATCAGC 

CGGGTGTGGG 

Sequence 798 

GCGGCCGAGGTACAATTC AACAATT NNTGGTCCAGGATCATGAATGGGCCATTNNTAGTT 

CTGTGTGTGCTTAMCACATTT7TTTGTGGGGTGCTGTGGATGTGTGGATGTAGCCAAAA 

AAAACCCTATTGTGGGNTNGGTCCTGGGGCAGAAAGTCTGGTGCCAGAGAGTGGGGTTCT 

GGGGGTCTGTCTTCATAGTTTGGGGTAGCACTAAMTCCTGTGAGCCTTTCTGGGCCTTG 

GTAACCTCCCCTGTMGTTAGCTGTTAGATAATTCAGCTGGGTAGCATTTTATACCTGGA 

TGATGTTCTAAAGTCCAGCCACANAAGGCCNNNGTCTGGCAGAGTGAGAATTNCCTTTGA 

AGAACCTTNAAACTGNTNCCCNAGAGTGACACAGGGGNNCCTNGGGGAAAANCNAAAAAG 

NNNTTGGGAATTCTNTNCAAAAGNAAGNCCC A I I I 1 M I I 1 GCCNNNATTTNGGCCCNCG 

NTAATNCCCCNCCCCAAGNAAAAANNAAAAAAANTN I 1 1 I I I I I 1 I I I I 

Sequence 799 

CGGTGGCGGCCGCCCGGGCTTGGTACCCTCTGTCACGGCTTCCCTTTGCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCAGANATTGTTGCCTCCTGAA 

Sequence 800 

GGGCGNTTTGGAGCTCCCCNCGGTGGCGGCCGGGCAGGTACTATCTGGAACNTGTAGCTT 

CCTTTNGCACTGCAGCATGGGMGCCAGAGTTGATGATTCATACACCAGCATTTACATTT 

TTCAGCATGAAAGTGGTATGTTCTTCAACTCACAACCCATTGGCCAGAACCAGTAACATG 

ACTTCACCTAACTGCAAACTAGCTGGAGAATTGTGGGAGAGCTCATGG 

Sequence 801 

CCGCGGTGGCGG CCGAGG TACCATTTAGCACACAATTTCCATGTCCCAAAAGCAACCCCC 

ATAMCAGTGAC TATTTTTTATGCT GTTTTTCTTTGCCCCMCACTTT^ 

TGTTATATCTTGC I II I I I I I I I I CTTTTTTAATGGAGTCTCACTCTGTCACCCAGGCTG 

CAGTGCAGTGGCGCGATCTTGGCTCACTGNAACCTCTGCCTCCTGGGTTCAAGCAATTCT 

CCTGGGGGGTGGGGAGGTTTGCAAGTGATCCAAGATTGCGGCTCTTCACTCCAGACTGGG 

NGAAAGAACGAAACTCCATCTNAAAAAAAAAAAAAAAAANAAAGTACCTNCCCGGGCCGG 

CCGCTCTANAACTAGGTG 

Sequence 802 

CCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATGGTGTCAACTTATGTCAGGACCCATG 

GGCCCTCCCCATGCACACAGCACTCTTGGAATCTCATCCTTTTCCATGGCTCTGGCTCAC 

ACTTCCACAGCATTTACTCCTAAATATGCCCCCTGGGTTCAAGGGTGATTCTCGTGCCTC 

AGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGCCTCGT Al 1 1 I I I I I GTGT 

GTGTGTTTAGTAGAGACGTGTTTCACTATGTTGTCCGGGCTGATCTCCAACTCCTGTAGT 

CAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTATAGACATGAGCCACCACA 

CCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAAGTGTGGGTAGTCATTAGTGC 

TGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGCACTTCCCGTCCACT 
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Sequence 803 

AACGACCGCCCGGGCAGGTACTAAGCCAGGACCTCAGTTAGCACTAAGCACTCTTACTAT 
TGCCCCCACCTGGC ACAAAGC AAACTGAGATCTTAGTTGGGCCCATCATGTGTCATCTGA 
TTGTCTTAGAAGTTCTTTTTTTCTMGACAGAGT^ 

CCAATGGCACAATCTCGGCTTACTGCAACCTCCGCCTCCCAGGTTAAAAGCGATCCTCCC 
GCCTCAGCCCTNCGAGTAGCCGGGACCACAGGCACCCGCCACCACGCCCGGNTAACCTT 
Sequence 804 

TACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTCTCTAT 

GACTATCMGCTCAGGCCTCTCCCTTTTTTTAAACCAAAGTCTGGCAACCAAGAGCAGCA 

GCTCCATGGCCTCCTTGCCCCAGATCAGCCTGGGTCAGGGGACATAGTGTCATTGTTTGG 

AAACTGCAGACCACAAGGTGCGGGTCTATCCCACTTCCTAGTGCTCCCCACATTCCCCAT 

CAGGGCTTCCTCACGTGGACAGGTGTGCTAGTCCAGGCAGTTCACTTGCAGTTTCCTTGT 

CCTCATGCTTCGGGGATGGGAGCCACGCCTGAACTAGAGTTCAGGCTGGATACATGTGCT 

CACCTGCTGCTCTTGTCTTCCTAAGAGACAGAGAGTGGGGCAGATGGAGGAGAAGAAAGT 

GAGGMTGAAGTAGCATAGCATTCTGCCAMAGGGGCCCCAGNTTCTTAATTTAAGCAAA 

CTAAGAAG 

Sequence 805 

CCGGGCAGGTACAA TGGA CTTTGACAGTTCTTCCCAAACAGATCCTMTTTTAAACATTA 

GGTTTGCTTTGATTCTTTTCCTTGGGGCTMGAGCTCACAMGACTTAGGTTCTGGTCAT 

GGCTCCAGAGGCCACACATTCCAGGACAAAGTCTCTCTACAGTCAACGCCTTAGTCCCAC 

ATCTGTAAAATCGGAATAATCATCCCTGATCCAGCTATCACATTGCAGTAGAGTGAGACT 

CAAATGAGATAATGGAAGACAGTGGGAATGATCATTTCCAACTTGGCCTGGCTGACCCAT 

TCCT TGTTCTAMGTCAGCTCAGGTTTCACCTCTTCCAGNGMGTTGACCTGGCACTTTC 

TTTTAGGATGGCTACTGCTCCTCTGGGTGCCCCGGGGCTCANTGTCTCCCCATCACCGCC 

CATGGCACACTTGGAGTGACTGGTCCTTTACTTTGNTT 

Sequence 806 

TNCGGGCAAGGTACATTGGCCCCAAAGAGNAGGAATTCCTTGTAGAGGAGCTTGTAGATG 

CTTNCCCTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTG 

AGAGTGTATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCC 

CCAGGGCTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCACACACCCCA 

GCCTTCACCCTGCGTCAGTGGACAAGGGGGTAGGAGCCTGCAGAGCAGAAAAGTACCT 

Sequence 807 

AGCNCCACCGCGGTGGCGGCCGANGTACGCGGGATATGTAGAACTTCACNNGTTTGAAGT 

TGGCTGATTAMTATACTAAGTATTACTGAATCACTGCCCTGCCTTTTCTGCTTTCTTTA 

CAG ACCTGTTTAGTATACACTGTATGTA ! 1 1 I I I I I I I I I I 1 1 I GAGACTCCGTCTCAAA 

AAAGAGAAAATTATGGGCCGGGCACAGTGGTTCATGCCTGTAATTCCAGCATTTTGGGAG 

GCCGAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACTAGCTTGGCCAACATGACGA 

AACCCTGTNTGTACCTGCCCG 

Sequence 808 

CCGCGGTGGCGGCCGAGGTACGAGACTTGTCACCATGTGACATGGCAGCTTCAGAAACTT 

AGCCACTGCCAAAAAAAGAGCAGGCAGGGATAATGTTGTCCCATTGTCCAGTCAGAGAGA 

CCTGTTGAGTCTCTAGTTTGCCAGTCCCCMGAGACCTTTGGAGTTTTGCTGGAGCCAGA 

CATCCTGCTTAGAGATGAGGAAGATCCTGCTG7TCCGTGGGGAGCTCTTGAGACACCCGT 

GCCACCACCCACCTTCTCCTGATTGCCACTTGCTGCCCTTTTCCCATTACCCTCTCCTGA 

CTCCATAAACATCTTCMGTCTTCCCTTTCTCCACCCCAAAAAATGCCCACCTTGGAAAG 

GG 

Sequence 809 

AAATTMTTGGGGTTGNGCTMCTGCCCGGTTTTCAATCNNGNAAACCTTGTGGGGCCCA 
NNT GMTT AANANAATNGGNCCACCCCCCCGGGGAAMGGGNGGTTTTNNMNTTTTGGG 
GCCTTTTTCCCCI I I I I IAAAAAA 
Sequence 810 
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CCGCGGTGGCGGCCGCCCGGNCACGGTACGCGGGGATGTCTTCTGAGAGAGTCAGGGCAG 

CTGAAGACTGGGTGAGGGTGAGGGAAGCCGCTGGTGTCCTCCTCAGTCACCCGTGAGAGG 

ACTCCTNTGTGGAGCTAATCAACTGCAAGGAAGATTGTTCCCAGTGTCCAGACCTGAAGG 

AGTCTGGACCCATAGTGCANTGAGATTTGGGGAAGGAAGGATTCCGGATAGGGGTGAGCT 

TTNTGNTGATAAGCAAATGTGAAC 

Sequence 81 1 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGTTTATGGGAAGCCAGTAACACTG 

TGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTA 

GAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGAT 

CCGCCTGCTGTTGMGCCCCCTTNTNATTCCGATCGCTTTTTGGCCTTGATGATTTGAAA 

ATA 

Sequence 812 

CCNTCAGGTACCAGANCTTAGCAGGGATTTTGGACAACAAAAGCTCTAAATCCTCTTGCA 

TCGACA CGTTC MTTTGCACTGACCMTCTGTTGGCACAGTAACTGTTTATAAGCTAAAT 

TTCTACATTTTGGCTACAAGTATCCCAMTCCACCTTTTAAAAAATCCTAGGTAGATGCC 

ATCT GGTGTTMTGATTTGCACACCCCTTAMTTGAAMTATTTTAAATAAATCTCACGG 

TTTTATATAGTATCATTMTGTGTCTATTTTAAAAAGACAATCTGAGAATAACACTTCCC 

CTAATT GTTGT CTTAATAATGACCAAGAGCTGAGGAAAAATGATTCACACTGTTAGTTGT 

TTTGTTGTTTTGCTCACGGGGGAAGGGGGTGAAGTACTGGCTGTGCCTGGGTTTG 

Sequence 813 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCC 

CAGCCAGGCACCACCAATTACCAGAGGAACAAA 

Sequence 814 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 

GAGCAGTTATCCMCAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 

TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATCACGTCTGAGATGGGAAAAAGGCTCTTCAGCTACTCC 

CAAAAGGGTATGCCCTGGGCATGGG 

Sequence 815 

CCGCGGTGGCGGCCGAGGTACTC I 1 1 1 1 1 1 1 1 1 1 If I I I 1 1 1 I G AG ACAGGGTTTCTGTT 

GCCCAGGCTGGAGTGCAGTGGCACAATCTCAGCCCACTGCAGCCTCCGCCTCCCCAGTTC 

CAATAATTCTCATGCCTCAGCCTCCCAAAGTGGTGGGATTACAGGCGTGAGCCACTGCGC 

CCGGCCACCTTTCTATTTTTCTGGTTMCTTTCTAMTGTTTGAMTGGCTTCCAGTGM 

TTTCATTTTATTATTGGGGGAMCTTCCATACTTATTTTCTTTCTTCCCAM 

AGTATACTCTCCTCCCAAMTTTAGATAGTTGTATTTTTCTGATTATTCCAAATAAGAGT 

GCTGAGAGGCTAATCACAAAGAGCAACAGCCAGAGATTTACAAAGTGGTTCTCTTACTAT 

TGAACATTTTCACTTAT 

Sequence 816 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCCCAGGTAAGATCAC 

TCGTGGGTMGMCATGAGGTTCTCACCCGTMGGCAGGATTTTTATAGAAGGAAGGTAG 

GTCTTTCAACCTATGTCCTCCTTCTGTTCCACAAAGTGGAAAGCCACAAGCCCTACAAAA 

GCCTTGCAAGTCCCAGAGGCTGCAGCCGTATTTATTCTTCAGGCCAAGACTCTCAGGACA 

GAGAGCACCCATGCACCCCGCAGGCTGCAGGCCATCTCCCTGCATTTGGGACTGTCCTGA 
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GGATGGCGGCTTCATTTTTGTCCCTCCTACCTCTGA 
Sequence 817 

GMCCTAGGGC GATTT GGAGCTACCCNCGGTGGCGGCCGAGGTACATTTTGGCAAACCGT 

GMGGGCTTTCNTTTTNGCAGGTTGGACTTCCCCCCCCTAGTNGGCAGG AI I I I I i I I AG 

GGGACCACCTGAGAMGGTCTGTTACCGTGCATAMCCTCCTTTMCACCTTTTAAAAAC 

TCTTCTGGGGGCCGGACTCAGTGGCTCATGCCTGTAATCCCACCACTTTGGGAGGCTGAG 

GCAGATGGATCACCTGAAGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCC 

GTCTCTACTAAAAATAGAAAAATTAGCCAGGAGTGGTGGCAGGTCCCTGTAATCCCAACT 

ACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGC 

Sequence 818 

GCCAGGAAACCCGTAAAAAGGGCCCNGTTTGTTGGCGGTTTTTTTCCATMGGGTT^ 
CCCCCCTTGACCGAGGCANTTAACAAAAAATNGACNGCTTCAANGTCAGAAGGTGGGC 
Sequence 819 

CCCCCCGGTGGCGGCCCGCCGGGCAGGTACTGGGAAATGAGGCAAAAGTNTNTCTCTTCA 
CTGCTAGCTCCTTGGGGACCAGCAAGCGGCTCTCAAGTTGCGTGGTGGCCCAACTGGAAA 
AMGGCAGTTCGGTGCAT CCTG GGAATATCCAGGTGAAAGTGTGAGATTTACCTAGAATA 
GCTTCTGGGCCTCTGGGGTTTTCACGCTGTCTCTGGTGMGGTGTCCATTTTAGAACTGA 
AGCAAAAAGGTn"CMTCCGTTCCGTTTTCTTTGTTTTAGCACTTACCCCAGNNNCCTCC ■ 
ATAACAAAGGTGGNGCCCCTTCAGGGAAATTAAA I I I I I 1 1 1 1 CNTAAAGGCCTTGGCAT 
taanccn i nrm IGNGGGNNGGGNAAAN I I I 1 1 M 1 1 I I 
Sequence 820 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACAT 

CCAGGNCAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACGCATTCCGCTTCTG 

TCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTC 

CMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATT 

CCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATC 

AGTtTATCAACCMCMGCAGTTCCAGCACCCAGCACTTTTTACCTGAATTTtAC 

Sequence 821 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCATGCAAAACTCCAGATTCCTATCTTC 

TTTGGGGGAAAAGCAMTTGGMGCTCTGACAATGCTGGGCTTTACTTTCCCACATAGCA 

ACCATCAGTTGGAGCTGAGACACCTCTGCTCTCTTTAGAMGAATTATTAATGCTTCAGT 

CTCCATTATTGCT TCCCT MCAGTGAGGATMGTTATTGGCATCAANCCTGGCCGGTTTA 

NCTTGGGGGTTTATTTT^NNNTTT^^ 

CNGNGGGGGGGGGGAANTNTNNNANNANGGNNGGGGGGGGGTTTNTCTCNNCCCCCCCCA 
Sequence 822 

CCGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGTGTTTATGGGA 

AGCCAGTMCACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCG 

GGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCA 

TGGGGGAAMTGATCCGCCTGCTGTTGMGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 

TTGATGATTTGAAAATAAGTCCTGTTGCACCANATGCAGATGCTGNTGCTGCACAGANCC 

TGTCACTGCTGCCATTGAAGTTTTTTTCCMTCATCGTCATTGGGATCATTGCATTGATA 

TTAGCACTGGCCATTGGTCTGGGCATNCACTTTCGACTGCTCAGGGAAGTACCTCGGCCG 

CT 

Sequence 823 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGTT 

AGCTATTATCATCACCCTCCTTGCTAGGCAGAGCAGGACAGTGGGGAATTGATGTTTCCT 

CCCCTCCATCTCACAGGTGGGGCAGGGGTGTGCTGAGAAGAGAACTTGGGACTCTTGGCC 

CCTGTTCAATTCTCTGCTTAACCTGCTAGGCAATTTGGGCCTCTGAAAATTCAGTAATCC 

TCATAGCAACTTAGACGTCACCTGGGCCTGTGGTCCCCTTCCTAGCCTAGGAGTCAGAGC 

ATGMGCTCCATCTGTCACATTGGTTTGTTCAGAGAACTACACATGCGTTTTATTTTAGC 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

132/467 



AGCATACAGGTTCCCACTTAGGCATTGAGAGGACATAGGAAGCTGTTTAAC7TCCTA 
Sequence 824 

ATCACTACTTAGGGCGMTTGGAGCTCCCCXaCGGTGGCGGCCGAGGTACTAGTTATTTTA 

AATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGG 

AGCGGGAAGTGCAATQCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGA 

CTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTC 

ATGTCTATMTCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGT 

TGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAA 

TACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAA 

TCACCCTTGAACCCAGGAGGCAGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAG 

CCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAGAAA 

Sequence 825 

CCG CGGT GGCGGCCGAGGTACAGATGTATGGATCTCATAGCATTGAGGGGTCTTTCAGAT 

TATGTTTTCAAACCCCTCACTTTCTCTTTTCAGATAAGACCACAGCGACCTGGGAAAGTG 

CAACGTCTTAGCCAMGACACAGAACTATTTAGCGACACTGTCTAGACTCTAGTTTCCAT 

GTCTCCTGACTTCAGTCTAGTGTTCX^ACCCCTGCCGCCCACCCCTGCCCCATCCTCATTC 

CTCCTGTAGGAGAGGCCAGACCTTTGCCTGCTGCAGCTTGTGGCTCTTCTCCTGCCTTCA 

GTTNTTCCATTGCCTG 

Sequence 826 

GGGNNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCTGTCTGGGCAACACT 

GTCCCGNNGGGGCCCCCATGACCAAACTAACTCTGCTTCTACCCAGAAAGGGTGCAGAGT 

GGCCACTAGACTTTTATGTGGCAMTGGGATGGTTATGCCCAGCXrrGAAGCCAAGATGCC 

CTTTCTGGTTGCCTTGATTTGTGTTTAACAGCTCCAAATGCTTAATGAGGCAGTAAGAGA 

CGTCTCTCTTGGGCAGTACCT 

Sequence 827 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGGTGTGGCTACTACCGTTACA 

ACTGCCTGTGCTTGGACATGGACCCTCTGCAATATGCGGCAGTTTCATTCATTGCCCCCT 

ACATTCTACACCAAGTAGAAATGGAAGGCAATTGGATACTTCACAGACAAGATCTAAGTG 

GAGAAGGAATGCGTCCTGTGGCTGCAGAGATCCTTGGAGCTTGGAGGGGAGAGCTTGAGC 

CCCACTGATGATGACCTCCCACAGCTCGCCAACTCAGCCCTCCCTAAGTCCCCATCGGGG 

GCCAATTCTCACTCTGGGGTTGGGGGGACTCCACCATAGCTCATCCATCATAGGGGATGT 

TGGTATCTACTGTGGGTTTGGGTAGGGCCCGATGTGCTGAGGATGGCTCCCCCACAAGCA 

AGAGATGTGGGATTTGG 

Sequence 828 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCMTTTGGACCCCAGCCTGGTGGAGCANGTCTTTCNAGATANGACCCTGAATGCCTNAT 

TCCATTGGGCTGGGGCTTCCACCTACCAAGTTGGGTGGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTTA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCCGGCCCCCCCC 

Sequence 829 

CGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCTGATCTACTCCTCTCTACAACAAC 

CTTGTGGGTGACGTTATTATCTCCA7TTCACAAATGAGGCCACAGAGGTTCTAAAGGGTA 

AATGACGATGATGATGAGAGGTAAGTGATAAAACAATGTCTCCTGACCACAAATCCTGGA 

ATTTAAACATAAGNGTAGTAAACATGAACTCTAGGAAGCCTCCTGGGGCTTCTNCCTGTG 

TCTGGAGCCCCTGCACATGCCCAAAGGAAGTCCTTTTGGTTCTNCGNTCAGNAGAGAAAG 

GGNGCATTTCACTAAAAGGGAGGTGGGGAAACAAGACTGGTGGTAGGG 

Sequence 830 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 
GAGCAGTTATCCAACMGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 
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TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AMTCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATTCACGTCTGAGATGGAAAAGGCTCTCAGCTACTCCCA 

AAAGGTATGCCCTGGGCATGG 

Sequence 831 

CC GCGG TGGCGGCCGAGGTACGCGGGTMCAGGAGTCTTTGCTGAGTGATCATCTGTTTA 

TTCTTTTACTCCACAAATATCGAATGTTTACAGCGTGCCTGGCACTGAGCAGGGCTGGGG 

TTTCCTGACCATATGGACCTTCCTGGGTATATCTGTGGGGCTTGAATGGTGTGTGACCTT 

GTGTCCTGCCCG 

Sequence 832 

CGGGCAGGTNCGCGGGGGTGTTTATGGGAAGCCAGTMCACTGTGG(XTACTATCTCTTC 

CGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCC 

GGATGTCAGAGGTCCTGAA ATAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGC 

CCCCTTCTCATTCCGATCGCTTTTTGGCCTTGATGATTTGAAMTMGTCCTGTTGCACX 

AGATGCAGATGCTGTTGCTTGCACAGATCCTGTCACTGCTGCCATTGAMGTTTTTTNCA 

ATCATCGNCATTGGGATCATTGCATTGGATATTAACCCCTGGNCAATNGGCTTGGGCATT 

CMTTTGACTTGlvn"AAGGGAAGTNCCTCGGCCGNTNTANAACTAGNGGGATCCCCCGGCT 

GGANGMTTTCAATTTNAACTTATTGATACCGTCCANCCTTGNGGGGGG 

Sequence 833 

ACCGCNGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACTTTTNNGGACAAG 

GTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACC 

AACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGAC 

CCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTG 

GGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTTATCA 

AC 

Sequence 834 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACCTTACCACCC 

CATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGCTCAATGTAAACGTGTG 

CACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTTCTCTGCACCCCTCCCT 

CCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGCTGGGGCGCTGGTCTGA 

TTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCCAAAAGGTTGGGACAAA 

CACAGCMCAAGCCCAGCTCCCTGAATTTCAGTGATTCATTTGTGGGGATAAAGGAGTGA 
ATG 

Sequence 835 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATAACTTATCG 

GCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGGAGCGGGAAGTGCAATCCTACC 

TTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGACTACCACACTTCGCTAAGGTC 

ACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTCATGTCTATAATCCCAGCACTT 

TGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGTTGGAGATCAGCCCGGACAACA 

TAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAATACGAGGCATGGTGGTGCATG 

CCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAATCACCCTTGAACCCAGGAGGC 

AGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAGCCTGGGGTGACAGAGCGAGAC 

CTTTGTTTCAAAAAAAAAAGAAG 

Sequence 836 

GNGGNGGCGGCCGAGGTACTTTAACANGCCATACTCCAGTCCCAACAATGTTAAATGCCA 

AAGCAGTGTTGGTAAAAGCCTCAAATGGTGAAAAGGACAGAAACTCAAACCCGCCCTTGT 

GCCAGTAAGTAACTGTTACTTATCTCACAAAGCGCTTGGCTCTGGAAACAATCTAACTCT 

GAGCTGCACGTGGAGTCTACATGGGMTGTGCAMGCATGTATTTCCTTTTAGGTGCAGC 

AGAGGTAACCGAAATTCAGATAAGAGAAAAAAAATCCAGATTTCAATGCAAGAGGTGGAA 
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GATCCACGAAGATACTCGTTACTATTTGGTTTCTAGGAGCAGGATTGCCACTAGATATGA 

TGGAGAACAAAAATGAAGAGGTGTTGTGTAACAAAACAAAACAAAACAAAACAAAAAAGT 

AGAAAGAAAGAGCAACAGGCCGGGCCGCAGTANCTTCATGCCTGTAATCCCAGCACTTTT 
GGGGAGGCCCAG 

Sequence 837 

NTTGCGTTGCGCTNACTGNCCCGCTTTCCAGTNCGGNGTAAACCTGTCGTGCCCAGCCTG 
CATTAATGAAATCGGCNCAACGCGCGGGTGAGAGGCCGGTTTGCGTATTTGGGCCGCTCT 
TCCCGCTTTCTTCGCTCACCTGACTCCGCTGCGCCTCGGGTCGT 
Sequence 838 

CGCGGTGGCGGCCGAGGTACTAGGCACTGAAGATACAATAAGCAATCCAGNAATNCCTCT 

TTGAAGA ATTTATTCTGATGA AATAGAGACAAGCCTATTAAAGTATTCAAGGCAACATTA 

CTTAMTATTTTATTTTATTTTATTTATTTTTAGATGGAGTCTCGCTGTTGTTGACCAGG 

CTGAAGTG 

Sequence 839 

AGGTACCTGTAGTCCCANTTACTTTGGAGGCTAAGGTGGGAGGATGGCTTGTGCCCAGGA 

GGCGGAGGTCACAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGCAATAGAGCCA 

GGCCTTGTCTCATAAATGAAATATAAAATATAAAATAAAATAAAATAAAACGTTGTTGGC 

AMGATGT ATTCAA AATAGTATGTAGMCTTACCTGGAAGGTAATTTTGCAATATTTACC 

AAMTATACTTTTTCATCTAGTAAGGTCTCCTTCTCAGAATCAGTCTTAAAGCCATTCAA 

ATACAGATTTATGATTAAAGATTTAAGTCCAAAAATGCTCATTATAGCAATATTTATAAT 

AGGAAAAATTGGGGAAAAACAATTATACATCCAACAGTAGTAAGAGTGTGACTACATTAT 

AGTATTAGTATGTAATGGGATTGTCACAGAGCMCACACATGTTTTTGAAGAATATTTAA 

GGGCGTGATAAATATTAA ATGTA AATGTAAATTGAAAAAATGATATCTGTAGATTTTCAT 

TATGCATTTCTTTATGAAATTTTTNGATATACACAAAANAAAATAGTCATGCATTTGCTT 

CATGACGGGGACATATTTCTGAGAAATGTGCTGTTAGTCGGTTTC 

Sequence 840 

GGCGGCCGCCGGGCAGGNACANCTTCCGTGGGGGGGCNNAAAACCCCCCACCNAAANCAA 

AANAGCANCAAAGGAAGAATTNNTTAAGGGCAGGGGGGGGAAGCCCCNAAAAACCNGANG 

GCAAA NNCC AGAGCNGNGGNNNAGCNCNAGNNNCNGGGAAGAGCAANCANNACGCTTTTG 

TGAGGTTTTNGGNGNNGGAGGGGGGGAACCCCCCCGANAAAAAACCCAGNAGCCCCCCCA 

CN N AACAGGGGANGAAN AAG AGG AAGGN AA AGGANN NAANN AAG AGCCACAGNCCNGGAA 

ACANGAAANCANNANNNCAGACANGCCNNNGAANANGNNGCCNACNNNAAAAGAAGGNNN 

GGNGCGCCANNGGGGAGNAAANNGGAANNANNAAAAAAAAAAAGGAAAANGNGAANGAAN 

GAGGANAANANNNGGGGGAGGGNAAANGGGGCGGGGGNNNAANGCCNAANAANANNNAGN 

NNNNGGGGGAGGGCCNG 

Sequence 841 

AGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGC 

TCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTT 

CTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGC 

TGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCC 

AAAAGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCATTTGT 

GGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCG 

Sequence 842 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTACTCCCAAAAGGCTATGAAATN 

GGGGAAAACCCCAGGTGATTCATGCCTGCTTAGCTGCAGNATNTCAGTNGCANTAGGTGG 

AACCCCAAACCCAGNGCANAGTGCCAGNGTCTGCTTNGGTGAGATATGAGTGTCAAGTCT 

CGAACCAAGCAACCTATCNAAAGCCTGNGACACTCCTGGCCACAGGCGGNTGGTANAGGC 

ATAGNANACTATTGCCCAGGTGACGTGACTTCACAGATGCTGGGAAGCCTGCTGCCCCAT 

CCAATACAATACTGCCCACTGTGCATAGAAACCAGATTCCAAAGTTAGAGCTTCGTTTTG 

GCCATGAGTGCMTTTCACTGCAATGTTTTATCTTACTCAACTGCCAGGGTCAA7TTAGG 

TGGTAGGGCTAMTCTCCTTCTTTATATTGGTCCAMTGATTTTTCCTGATGCTGCATTC 
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CCGGA 
Sequence 843 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCTTAGACCAGGAACACAGGGAGGTAGAG 

GGCAGCAGAGCAGGGACTGGCTTCAGAGCCAGACAGGTGGCTATGTGACTTAATGTGTCT 

GAACCCTGGTATCCTAGTCTATTAAATGGTATAACAGCAGCTTCTAGTATGTAAGTTCCT 

TGTCGGGAGAAAAACTGTTTTGCTCATGGCTGGAGCCTTAGCATGTTGCATCATATTGAA 

CATGTAATAGATGCTCAATAAATATATTTTTAAGAATAAATAAATGTAAATGAAAATTAC 

TTCACAGTGTTTCTGTAGAGATTTTATAAGATATGGTATACACAATGCATAACATAGGAA 

CTGACGCTCAAAMTGCCAGTTACTTCCATCATTGNGTCATAGGCTTTTATGTTCATTAT 

CCTGCTGCATCATCCCAAAGAA 

Sequence 844 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAMGGGCTGT 

GTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGNGGTGCCATCTACATTT 

TTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAA 

N TAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCG 

CTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCANATGCTGTTGCT 

GCACAGATCCTGTCACTGCTGCCATTGAA 

Sequence 845 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTAACCCTAAGGGATTCCTACAGCTTTTCT 

GCATGTTAMTAGTCTGTTTTAGCTTATTCTCTTATTACTTGTCTTGGTTTTTACTTTGA 

MGTTTGCTTMTMTCATGGGMTATTTTAGATTTTAAAATACAAAATATACAAGCTAA 

ACTTGAGAGCA GTTTT TAGTTGTAGAAACTGTTTCTTGAAGTAATTGACTTAGCGTTTGC 

TCTGCCTCTTTCTTTTCTTACCTAGGTAGGTAGTGGGGACTCCTTCAATTATCTGAGCAA 

TTCAAATCTCAGAATGTAGTGTTGGGTAAATTGAGGGTT 

Sequence 846 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGNGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCA 

Sequence 847 

TGGAGCTCNCCGCGGTGGCGGCCGNGGTACTCCAAGCAGTCCCAAAGTGGGAGTNCTTAA 

AACACCATGGGCAGGTGAATGGCTGACCAGGTGGAGGTGCACAGTGCACCATGACAAGAG 

CAGTGGAAAATGGGTGAATCTGAGATGCCTGGAGGCGAGGGGGAAAGAGCACATCACAGA 

GGACAACGTCCANNGGGACACCCTTTTATA 

Sequence 848 

CCGCGGTGGCGGCTGTGGACTGAAGGGTGACTGGTTCCACTGTGGTCTCCATGGGAACM 

GTTG7TTCTGGAGTCTTCCAAGGAGAATTTCTCACAGTGGACCTGATCTCTGGGCTGATG 

CTGGGTTCCTTGGAGCTCATGATTTTGAGAGTGGTAGACATTTCTGGGCTTCCTGGGGAT 

GTGCCTGCTGGACTGCTCCCCGTCTCCTCTGCTGGGGCAGGCCACGTGGAATTTCCTTGT 

GCTGCCTGGCTTGACATCTTTA 

Sequence 849 

CCGCGGTGGCGGCCGAGGTACCTGAAGAATCTCTCTTCAGCTCTCTTCTCCTGGAAACTT 

GAGTGGGGCAGGAGGAAAAGCGGAGCTAGGTGTCATTTTAATGAGGAACATACTTGTCTC 

CTCCATTTATCTGGCCCTCCCTGATGGCACTCCAGAATTCCAATCCCACACGATTAACAA 

CATAGTTTCCCTTTTCTGCTTGAAGGTCCATTCTCCTCTCAATTTCAAATCACCTGAGAT 

ACAAAGCTGCATTTCCCCACAAGAACCAGTTCCCTCTCCTTTCCTTCAGTGCTACTGTCC 

TTCTCTCAGACCACCAAGCTTAAAAACTCCAGAGGCTCAAACAGCAAAGATGGCAGCCCG 

CTCCTCCCTCTGGGGAGTTCTGGCCCAGGGAGTT7TCAAATTTCTGTAGGCGGAAGAATA 

CTAGCGGGGAGTGGCTGGAGACCCCAGTTGGTAGGGNTCCACAT7TGGGGGAAGTGAGCC 
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CAAGCTTTTTN 
Sequence 850 

CCGGGCAGGTACATGAAAGTAAGATCACAACCACAGGAACCACACAAAATTCAAGGCACC 

AGAGGAGCCCAGACTTGGCTGGCAATGCCTGTTTTGGAGCTATTCCACATTTCTGGAAGT 

.CAATGGGAATACCGGAATATGAAAACACTATGAGGCCGGGCACAGTGGCTCACGCCTGTA 

ATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGTTCGAGACTAGC 

CTGGCCAACATAGTGAAACCCCATCTTTAATAAAAATACAAAAAATTAGCCGGGCGTGGT 

GGGGGGTGCCTGTAATCCCAGCTACTCCGGCGGCTGAGGCAGGAGAATTGCTTGTACCTC 
GGC 

Sequence 851 

CCGGGCAGGTACTTTTTTCTTTTTTC I 1 1 1 1 1 1 1 1 1 1 1 GAGTGGGGGCGGGGTTTCGCCA 

TGTTGGCCAGGCTGGTCTTGAATCTCGGGTGATCTGCCCGCCTCGGCCTCCCAGGGTGCT 

GGGATTGCAGGCGTGAGCCACCACGCCCGGCCTCGATATATTCTTACAGTGGAATACTGC 

TCAGAAATACTGATGAATCTTAAAAAACATGATGTTTAGCAAAAGAACCTTGGTATAAGG 

TTCTTGGTATAAGGGATACATAGTCTATGATTCCATTATATGAAATTCTAGAACAGGAAA 

AACTATAGTGAAAAACAATCAGATTAGTGGTATCTGGGGTAGAAAGTAGGAGGAGATTGA 
TT 

Sequence 852 

CNANAGGGC 1 1 I rTGGGGGCAAAACCGCGGNGGCGGCCGCNCNAGAACNAGNGGANCCCT 
NTTGGGGGGGGAAAAAAACCCCAAGCCCACCGANACCGNCGACCNCGAGGGGGGTNCCGG 
NACCCAGNGNNNGNCCCCTAAANAGAGGGNNAANNGCGCGCNNGGCGNAANCANGGNCAN 

AGCNNGNNNCCNGNGNGAAANNGNANCCGCNCACMTTTNTCTTTTNTAGNCGAGCCGGG 

AGCAGAAAGCCCNAAGAAAAAAGN 

Sequence 853 

AGGTACCCACAGCCCTTTCTTTTGGAATTCCCTAGAAGGGGTCTGTGCCACATACAGGAA 

GTAGGGAGGGTGTCTTTGCAGCATATTTCTTCTCTTGGAGTTAACTGCGAACGTTGCACG 

GCGACCTCTTGATCCATTCTGTGAAAGCCCCAAG(XTGTCATGCAATAAAGACGTCCAGT 

TTCACCGCAGCAGGGAGGCCGCATGAMTATTCCACTTGMCAAAACCACTTAGCAGTTT 

ACATCAATGCTTACCCTGTCGCATTGAAAGTGATGTGAACCCACACCCAAGAGCCCCCAA 

ACCAGCACGTTGATACCAAGTTTCCCCAGCTGCATCCAAATCAATTCCTTCTT 

Sequence 854 

CNTTTGGGGGCTAANCCGCGGGGGCAGCANCNNCACCGGCAGCGCGCNATTCTTTGGGGG 

GGTGNAAAACCCCCAGACCCCCGAGCNCNNCCACTCGNCNCACCANTNTAGAGACTGGGG 

CCGNTCCNCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnl 1 1 1 1 1 1 IGGGGTTTTNAGNGAGGGGAANCG 

CNCGCAAAAAAANAATCNCGGCCANAGGGGGGCCNGGGNGAAANTGTTANCCGNGCACAA 

NNNCACACAANANACGAGCCGGGAGCAAAAAGAGGAAAAGCCNGG 

Sequence 855 

CCGGGCAGGTACGCGGGCTACACACACAGTTCGGATGCCAAGGGTGACACCCCATTCCCT 

TCACAAGAGGCGGTTCTGTCACAAATCAGCACTCCACCCCCACCACACCTCTCAGTGAAT 

GAAGTGCTGGTGGTCTCACTCCCCTGGTGACCCTTAGCCGTGGGATGGGGTGGTTACACT 

AAGGCTTCAAGCTGAGAATGGCCATCATGGCGGGAGGCTGTTTGCAAAGGCACCTTCTGT 

CATCCTGGGGTTGGCTAAGTCAACTCCACCCCTTCCCAAAAAAAAAAAAAAAAAGTACCT 

Sequence 856 

GCCAGGATTCAAACCAGGGANTTTGCTCCAGCACTCCGGCTCTTAACCTCAACCGTCTGC 

CTCTCCACAAACACCAGGATCAACCACCAAGACCAAAAAAACAGTCTCACAAACCATCAA 

A CATTG CACTTGGTGGCTCAGGACCTTAGCTTCGTCTTAAAGGTCCCTGTTATGCTTTTT 

CTTTTTGCCCCAGTGTGGAGTGGTCTTCGTGTTTGTGAGTGCAGGGGTCAGGGGTTGTGT 

CTTTTCTTCTTGTNCCCTTCCAAGAGGTGACATGTATCCTTGATACTGGAAGGGCCCTT 

Sequence 857 



WO 01/070979 PCT/US01/09126 



TABLE 1 

137/467 

AGG TACGC GGGCACTCCAGCCTAGGCAACAGAGCCAGATT CI 1 1 1 1 I 1 1 1 1 1 1 1 1 I A AAA 

AGTCTTTTAAAAMTTCTTTTATTGTGCTGATTTTATTGTGTCATGAAGTGTAATATCGC 

ATGTAGGTATGTGTCAAGTATACAGAGTGTCAGGCATACGGTGTTCAGTCATAAGCAGTT 

CTGGCCTTTGGCCCTGCACTGTTTGTTGGCTTTTTAGGTAGGAACCTTCTTAGAGTAAGA 

CTGTCATGCTAAAATTGTAGCAATCAAATGTGCCCCCCATACAACTCATTTGAGGTTGAG 

ATTATGTTGCTAGAGTGGAGGAGATTGGAGTGTTCTAAATGCTAACAGTTTGTCTTGCCT 

Sequence 858 

GGCGGCCGCCCGAGAACGAGGGNAGCCTTTGGGNGGGGGGAAAAAACCCAGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGNACCCANCGCAGAGNCCCCAAAAGCGAGG 

GGCAAGNGCGCNCAGGGCGGAAANAGGGCNACAGCCGGGGCCGGAGCNAAACAGGCACCC 

GCCCACANTTNNTTACAACANACNANCCGGGAGCATAAAGCCGCAAANCCAGNGGAGCCA 

AAGGAGGGAGCGAACCCACAG 

Sequence 859 

GGCAGGTACAACCTCCATCCCCTGGGCTCAAGCGATCCTCCCATTTCAGCTCCCCCTGTA 

GCT GGGA CTACAGACACACACCACCGTGGCTGGCTMTTTTTGTATTTTTTGTAGAGGCA 

GGGTTTTGCCATGTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCCTGCC 

TCGGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACCATGCCTGGCTGGGACATTAT 

TCAMTTGMGTGAGGACATGATGTTAAAAAGTTCTGGGCAAGTATTTTACAAGTTAAAA 

TACAGATGTAAGACTTGACTTGATCAAATGCCCAGCTCTGTAATTTCACCTAAATTG 
Sequence 860 

AGGTACAACATTGTGAAMTTTTCCTATCATTTCCTCTAAAACTCTGCAGGCATGGAGGC 

TGTCTGCTGAGATATAGCAGGGMCMTTTAACCCTTTGTTTTGCACCCACAAGATGAGC 

ATTACCAAATTCCCAACAGAGACATGCTGGGGCTGTGCTGTGCCTCCACCCTGCCCCCTC 

CACAGCCAGCTCCGCGTGTCTCAGTCTGTAACTGATAGCATTCCACTTCCTGCCTGTACC 
TCCCCGGC 

Sequence 861 

CNGCGGCGGCCGAGGNACACNNANAANAACCTTTTAAGGGGNGGAAAAACCCAANCCCCC 

CGNCCNACCNCAGNGACNNGANGNATNTACNGGAAACAGGGCGCAGCCCNAGGAAGGACA 

GNNGAAGNCCNNACNGNGCAAGNCNAAANNNNAAGGAAAANNAGNCCCGCGANGAGNNNC 

CNCANGCTTTNNNTNTGCGGGGACCAGNCAGGGGNCGNCCCCGACANAANCAGGGCNCGC 

CGNCGGNGGNAGAGNCACNNNGCAGGGGNNGGNGAAGCNGCNCCCANCCANGGACCNNGG 

CCGCNCNAGAACNAGNGGANCCCCCGGGCNGCAGGAANNCGANANCAAGCNGANCGANAC 

CGNCGACCNNGAGGGGGGGC 

Sequence 862 

AGGTACI I I I I I 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 GGNAGAGATGGGGCTTTACCATGTTGGCCAG 

GCTGATCTCCAGCTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAATTGCTGGG 

ATTACAGGCGTGAGCCACCGCACCCGGCCCAGTCTTCATATTTATAAATAAAGTCTTGTG 

GGMCACAGTCACATTTATTCATTTACATTTTGTAAATAGCTGCCTTCAAGAGGTAGAGT 

TGAAAGTCTTTAAGACAAAGATCAGGTGGCTTGCAAAACCTAAAACGTCTGGTCTTTTAC 

CTAAAAAGTTTGCCAACCCCAAAACAAGAATATATTCGATTGTAATCTTTACAAAGGTGT 

TGGCTATATTTCAGTGTGT 

Sequence 863 

NCTATAGGGCNAATTGNAGCTCCCCGCGGTGGCGGCCGAGGACAAATTCAGTCCCAATAC 

TCAATACGTATTATAGATGACTATGAGTGCAAACCTTAGGATGNGATTNTCTGAATAATN 

G^CITrGTAGGATTTGGTTACATTATTTAAAATGAAAAAGATCTAGTTTTAGTGTGAGC 

TCAGTMTGNTAATNGGTTAAGTTCATTGCGMTCTTGAGTTTTAGATAAGTAGTTATTT 

TTTTCMTATCACTTCTGTTTTTAGTGATATTATATCAAGAAACAACGTATTCAAGAACC 

ATGGCTGACAGTGCCAGATATACTTAGGGATAAACATCAAAATGCAATTATAGTTGCTAT 

AACGTTAGATACTCGGAATCAAAATTTATTTGCANGCTGACTTGATAAACTAAATGAA 
Sequence 864 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTCAGCAG 

AAATGTGTGCTTTAAGCAGAGTCACAGGGGCCTGGGGCTGAACTGAGTCATTTCTCAAAG 

ATATCCCTGCCTGGGATGATGATGGCTCTAATTGAAGCTCTGGCATCATCTGGGGCTTTA 

TGAGCCAAGGGAGATAAGAAGAGCCACAGCAAAACCCTTGGGTCTACAGTGCAGGCTGCA 

ACCAAGGCAGCATTTGCTAGAATATTTGTGATTATGTGTTCAACCTACAACCT 

Sequence 865 

CTA CTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTAT 

ACCTTTTTATTGTTGTTATMTTATTATGGGGTATTTCTAATTAATATGATGTTGAAACC 

TGTTTGGCACCTTCTGGAAGCTACCAAAAAAATGACACTCCATTGAAGTGCTTAAAAGCT 

GTTCTCATMGAATTCTACTGGCCTATTGTAAAAAAAGAAAAAAAAAAAAGAAAAAGAAG 

AAAGACACAAAGAAAATAATCTAAACACCAAAAACTAAACACAATTCCAATCCTTTTTCT 
GTACCT 

Sequence 866 

ACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGAAATCTGGACAGTG 

CTGCAC AGATT GATACATTAGCCTTTGCTTTTTCTCTTTCCGGATAACCTTGTAACATAT 

TGAAACCTTTTAAGGATGCCAAGAATGCATTATTCCACAAAAAAACAGCAGACCAACATA 

TAGAGTGTT TAAAAT AGCATTTCTGGGCAAATTCAAACTCTTGTGGTTCTAGGACTCACA 

TCTGTTTCAGTTTTTCCTCAGTTG TATAT TGACCAGTGTTCTTTATTGCAAAAACATATA 

CCCGATTTAGCAGTGTCAGCGTATTTTTTCTTCTCATCCTGGAGCGTATTCAAGATCTTC 

CCAATACAAGAAAATTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCA 

AAATAAGTCATTATGGGCTCAAATAGAAAGAAGACTTTTAAGTT 

Sequence 867 

CCGCGGTGGCGGCCGAGGTACATAACATGATATCAAGGAAATGCTTGAAACAAACTTTCA 

CAATAAAGTCAGAAAAAAACTGTAAAAATTGTCTGCAATCCAAGAAAAAGCACGTGCCCT 

GTGTGTAGG GGGA AAGAGGGAAAGCACTTGCAGTGTGACTTTATGTGGTCTTTCCCCAAG 

TATTGCTACGTTTTGACCTTTGGCCCAACTGAACAGGTGAAATGCCCTTCACATAAGTTT 

CAATCCCCAAGAAACTAGCTGGAATGCAGGGGACTGTAGACACACTCCTGGACCAAATGG 

CATCGACTCTCAGAATCCAAAATGGGCCCTGCCCTCATTCTGAGCTTACGGCCCCAAGCA 

TATTCTAAACAAAGCTTTTTTAA 

Sequence 868 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGATATCAC 

CTGMTTATTAATGAATGCCCAGGAAGTAATTTTCTTCTCATTCTTCTAAAACTACTGCC 

TTTCAAAGNGCACACACACCG CGTNC ACATACACTGCATTCGTTGCTCCAGTATAAATTA 

CATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATGAAGACACC 

ANAGTGTATGCATACAMTCTCACTGTATTAAAGATGCAGGTTTTCTAATTGTACCT 
Sequence 869 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCGACGCGCGGGACAAAG 

GGMGCGAAGCCGGAGCTGCGGGCGCTTTTTCTGCCCGCGGTGTCTCAGATTCATTCTTA 

AGGAACTGAGAACTTAATCTTCCAAAATGTCAAAAAGACCATCTTATGCCCCACCTNCCA 

CCCCAGCTCCTGCANCACAAATGCCCAGCACACCANGGTTTGTGGGATACAATCCATACA 

GTCATNTNGCCTACAACAACTACAGGCTGGGAGGGAACCCGGGCACCAACAGCCGGGTCA 

CGGCATCCTCTGGTATCACGATTCCAAAACCCCCAAAGCCACCAGATAAGCCGCTGATGC 
CCTACATGAGGTACCT 

Sequence 870 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGG 

ACCCGGAAGTTTAAATTCCGCTCCCTCCACGAAAGAGTTGTAGTGAGTGAAAATAAATAT 

TAA AACA CACGGAAATGTATTTTCCTGGCTGCAGCACCNGCCATCTTGCCTCGGNAGGAC 

TCATTTTNAAAAACAGCAGCTTCTTGAAGCCCCANAACGCATTCCTGTGCTACGG 
Sequence 871 

CCGGGCACGGTACAGAGCCCAAGACAAAAGATAGGCCTGTGAGGATAACATCTGGTATAT 
CTGACCCTTCCCAGCATGGCCAGGAGGCACAGCCAGGCCAGGGAGGGCATACTGGGTTTG 
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GCTTTGCCCTGCAGCTGTTGGCCTAGGTGCTGCGGTCATACATATGCCCTNAGGCCTTTC 
CATGGCTACCTACCTAGAACCCAGATT C 1 1 I 1 1 1 1 i I 1 1 1 1 GAGACGGAGTCTCGCTCTG 

TCCCCCANGCTGGAGTGCAGNGGCGCCATNTNAGCTNACTTGCAAGCTNCGNCTTCCGGG 
T 

Sequence 872 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTCCGGGTCATCCGCA 

GAAATTCCTCATAGATGGTAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACA 

GGGAGTCCACCCCGGTGTGGTGCCTGCTTGNGGACAGACCTGAATGTTGAAACTTGACAG 

TCA GAAAA ATAACTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATA 

TTCCTTTTGTTCCTCTGGNAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGT 

AGGTTGGTGATGGTAGAAATTCAGGTACGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTT 

GATGAACTGATGACTCC 

Sequence 873 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCGCCCGGGCAGGTACAATGCTCACT 

GGGAACCAAAGTCAGGCATGGGGCTGGGCTTTAAGGAGCACAAACAAAAAGGAGGGACTA 

GAAAACTTCAGAAAGGTATTGGTGGGGGATGTTGCGGGGGGACAGGGGACAGCGAGGATG 

TGGGATCCCGAGATCGTCCAAATCCCTATGTGTAGACATATGTGTATAAAGGCCTTTAAG 

AGACTCAGGCTGATGGGGTATCTGTAATAAATCAAACATAATATAACAGCACGTCAAGTG 

ATAAGGGGACTCTGGAAAAACAAGCAGCAAAAGGAGCAGTATCAAACTCCACAGAAATTC 

ACAAACATCAAGACACCAAGAAAGCTGCATTNATTTAAATCAAGGTGACAGGCTGGGCTC 

TGTAGCTCCAGCCTGTAATCCTAGCACTTTGGGAGGC 

Sequence 874 

CTACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI I 1 1 1 1 1 I 
1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 IGGGATGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAATG 
GCACAATCTTGGCTCACTGTAACCTACACCTCCCGGGTTAAAGAGATTCTTCTGCCTCAN 
CCTNCT GAGAAGCTGGGACTACCAGGGGATCCCGCCCCACCCCGGGTAGGTTTTTTGTAT 
I I 1 1 1 IAGNNAGAAGACAGGGTTTTCCNCCCATATTGGGCCCAGGGCTTAGGTCTCGGAA 
CCTCCTGGACCCTTGNGGATCCTGCCCACCCTTGGGCCTNCCAAAAATGCTGGGGATTAT 
AAGGNGGGGAGCCACTGTGCCCGGGCCAACAATAAATTTTTTTAAGGGTTAGTCAAACCT 
AACAACAAAANTTTAAAAGGTCAATCAGTAGTTCTAAC 1 1 1 1 1 1 1 1 1 1 1 
Sequence 875 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGACTTTGTAA 

ATGTGATTCAGGGCCCCCAGCACCCCTGTGTCTGCAGAGTGCCTTCAAAACTCAGCTGTT 

CCGGCCGGTGCCAACCTGTGAACTTCCCACCATATCCCAGAATCTGCTATTCCCCAAACC 

ACTTCCCAGTTTCCTTT CAGTA ATCTTTCTGAAGGAGCCAGGACAATAGGGCCTGTTGTT 

TAGTGMT TTCTTTATT ATTTTC AGCC TTTAAAATGTAATTTCCATCTCTTGCAATGAAT 

TTGT TTCCCI 1 1 1 1 1 1 1 GC TTCATTTTGTTTAMTTTTCAGGTATTTAGCTCCCCTTTCA 

TATTATTTTTAMTTTTTTAATTACCTGTTGTAGGGGTGTTCCTCCAGAAGCAAAGAGCA 

AAATTTTACTGTTGTGATGTACCT 

Sequence 876 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATTGA 
TCAAAAGCTTTGTAACCACAGGAAAAAATAAACTCTTCCATCCCTTAAAGAATAGAATAG 
TTTGTCCCTCTCATGGGAATTGGGCTGTATGTATATTGTTCTTCCTCCTTAGAATTTAGA 
GATACMGAGTTCTACTTAGAACTTTTCATGGACACAATTTCCACAACCTTTCAGATGCT 

GATGTAGAGCTATTGGGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCT 

AGAGATAACTCGGGA 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCTTA 
AACTGCTCTGCTCTTTAAACCAAATACATACACACATACACAGATATAGTTAGATACAGA 
TGTGTGTGCATATAAAAATATGACACTCC7TAGTAAAATATTCCTCTAGACCTGGGGTTC 
ACACATCCCTCCTCCTGATCCGTGCTGGTGCCTACTCAGGCACTACTTTGCAGATTTCTC' 
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TTCTATGAGCTAAGGTTTTTCTGAGCTAAGGTCAAGCGGTGACTTAGCAAGTTGAACGTG 

TMTGAACCAMCCTGTTTTTCCATGGAACCAATAATAATTAATCTAGAGTGAGCCATTT 

GGCCTCCAGAAACAAAGAGATTTCCATCACAGAGTGTTGGTGAGGGGTCATGAGTAAGGC 

GGGGGGGCAGTGAGAGCAAGCTGTTTTATTGTGAGAGTAGCAGGCAGGCTGAATGAGAAG 
GGGTAGCTGTT 

Sequence 878 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCATTCC 

GGGTCATCCGCAGAAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCG 

AGAAGTTAGACAGGGAGTCCACCCCGGTGTGGGTGCCTGTTGGGGACAGACCTGAATGTT 

GAAACTTGACAGTCA GAAAA ATAACTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGC 

GCATCCTCMTATTCCTTTTGTTCCTCTGGTAATTGGTGGTGCCTGGCTGGGCTTTGTCC 

TGGGAATATGGTAGGTTGGTGATGGTGAAATTCAGGTAGGAAGTGCCTGGGTGCCCCGCG 

TACCTCGGCCGCTCTAGAACTAGTG 

Sequence 879 

CTNCT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I 1 1 1 1 1 I 1 1 1 1 1 1 1 
1 1 1 1 tGGG GAMTMCAGGGGAGAGCAMTTTCTAAAAACTTGGGGTTTTATAGTAATTT 
CTGATTTTCATGTTTAGAAAAAGAAATCACATTAAAAATATGCTTTTTTAAMTTTTGAG 
ATAGGATACACTATAATATTATTGTAGTCCAGAAAATCTGTATACTATAATTCCTAGGGA 
AAAAGAGAAAATTATTAGTGTCAAAATACCTATAATTCCCACAGTTACCATATACATTTT 
TAAAA ATTGT TTAAATACACAAACAATGATGATGCTGTCCTACTAGAAATGACAGGAGCN 
AGAGCTTTTACCTTTCTTTCAAAATGCCTTTAACCCCTTTCATTATTNCCAAGGTTCAAA 
ATTTAAANATTCI I 1 1 1 1 1 1 

Sequence 880 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGGCGCAATCT 

CAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGATTGCAGGCATGTGCCACCATGCTCGGCTAATTTTTTGTATTTTTAGTAGAA 

ACGGGGTTTCGCCATGTTGGCCAGCTGGTCTCCAACTCCTAACCTCAGGTGATCCACCCXa 

CCTCGGCCTCCCAAAATGCATCTCTGGTCTTTAAATGCCCTTTGCTGTATATTCTATAAC 

ATCAAGTCTCAGATCTGGTTTGACCTCAGTTGGCCTCTTAATAGTTTTCCCCTATGAACA 

TTCTGGTGTCCCAGTAAGCCTGTAAGCAGCTGAGACTGGGAAACCATCTCTTATATCCCA 
CATCGTCCCATG 

Sequence 881 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCACACTGGTAAAGAGTGGCAAGGTAGC 

CTTTGTMCCAGATATATCTGATCTCAAAATCMTTTTCTTAATTTAACCCACGTCAGTC 

AGTCAAATGCTAAGGCTCTTCAAGCTACACTTGGTTCCTCCACCCTCTAAAAGGTGAGAA 

CTCMGAGAGCTGGGTTCTTTGGGACCTTATCATATTTTTCCCCTCCCTAGGCCTTGATT 

TCCCATTTGGAAAATAAATCAGTGAGGGCTTTCTAGTTAAAAATGCCAGTTGAAGCCAGG 

CTTGGTGGCATATACATGTAGTTCCAGTTACTCAGGAGGCTGAAGTGGGGAGGATCGCTT 

GAGCCCAGGAGTCCAGTCCAGGCAACATTGCAAGATCTCATCTCTAAAAACTAAAAAATG 
GACCAG 

Sequence 882 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAATGAATTGAC 

AAATGGAAGAAAATTTAAGGAGAGTAGCCTAGAGAACTCATTCTAGAAACTAAATAACTT 

MGTCAAAMTTATTTCCTATATTGCCTCAAGCCCTGCAGATAGCTTTGCTATGTTTGTG 

TATTTGCACATTGCACTCCAGCCTGGGCGACAGAGACTCTGTCTCAAAAAAATAAATGGA 

ACAATCACAACAGA AACAT TCCCTTATTCACTGAACATTTCAAACCCTGAAAATGTGTAA 

TGAGAMTGACAAATTTTTAAAAGTTTAATTACTAAAGAAGACAAAAAATGTCTATTATG 

AATAGACCAATTCTCAATTGGTAGAGGAACTTTGAACTGGAAAGGAACCCTAAAGAAATC 

TCCTGTCTACCCCTGTTATTACAGATTAGAACCCGAAAGTCCAG 

Sequence 883 

CCGCGGTGGCGGCCGCCCGGGCAGGTACfATAATTATAATGATTTCAGATAGAACATGCA 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

141/467 



ATTAGCCTTTTGAMTCCMCnCTGTGCAAMTTTTAGTATCAGAAAATACGAGATTTG 

CAGGGGGAMCATCAGTAAACTACCATTMTGTCMTGCCCAGTTTTGACTTTTGTTAGC 

CTGACACTCCCAAACAGTTGTAGAATCCGATAGATGACTGATGGCAAAAGATTGTGAACA 

TGTGGAAGAAAATCAGTGGGATTCGGTGCTGATGAATAGGTTGCCTTCAGAGTATTATTG 

ACAGACAGCTTGTGGMCTMTTCTTTATTTTTGATGTTGTGGGAATTAACACATCAATG 

GTGGTTATGGGAACTACCAATGGGTTCCTACAAT 

Sequence 884 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATTCTAAATTATAAG 

AAMTAT ACATT TGCACTTATTMTATAGAAATTCATTTTGTGTATATTTAACATAGCTT 

TTAAACTATTTTACATTAGCTACTTTCATTATGGTTTCTTGAACTTCTGAAAAAAATTAG 

AAATGTATTAAACTTATCAGTAACATAAAMCTTATTTTGTTTCACCTAACGAATACTGC 

GTTTGTAAAMTAAATTTAATATAGMTATATTTTTAAATTAAATATTTGAATATAAAAT 

AGCTCTMGAAAGAAGCAAATTATCACTGAACATATTTCTTATTATTTCTGGCTTTGAAT 

TAATACGTAACTTAAATTGGCTTAAATGATCCAGAATATTGGAGGAATATGATACTTTCA 

CATMTATACTATGAACCTGTTCATATMCTCTGGATTGGCTACCTAACCTTCTGNTTTA ~- 

ATG 

Sequence 885 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCACACCCAG 

CCAG TCAA TAAC TGAGA AATCAAAATAAAATAATAATTTCAAAGAATTACATAAATACAG 

GGCCTTTTGAGATTTTTGGCAATTGTAMCAAAMCGMTGGTTTTTACAATTCAGTGTA 

ATTCTACGAATATTTATTTGGCACCCATGTTAGGCACTGAGGCTACACAGCAGTGAAATA 

GGCCTAGTTGTTCTCAACTAGAGAACATAGTTGGTTAATGTAGCTGCACTGAATTGTAAG 

CTGTT TAGM GATMTATAC CCTG AGGCTTTTTAAAGTATACTATTACTATAAGGAAGTA 

AAATTATTTTATACTTATAAATTTTGTTTTGGATTATTCAACTGAATTTGGAGTGTTCAG 

AATTTTATGGGCGGTTGGGGACAAGGAAGAGGTATAATGCTA I 1 1 1 1 1 1 I CCTTTTCTTT 

TTT 

Sequence 886 

CTACTATAGGGCA ATTGGAGCTCCCCXjCGGTGGCGGCCGCCCGGGCAGGTA C I 1 1 1 I 1 1 1 

I 1 1 1 1 1 1 1 1 1 1 I I IAAATTTCCACCGGTGCCGAGGCCTCAGTGGAGCCTGGCTGGCGGCT 

TGTTAGAGCCTGCAGCCTACCTGTCCTGCATAGGAATGAAGCCGGGAGGAGTTACATGAT 

ATGCCCTCGTTGCAGGCCGGGGACACAGCTACCGCATTCAGAGACCAGGAAACAGAGCAA 

AAGCTGTTCTCANAGTGCGGCTGAGCGAGGAGCTACAGGGGAATGGNGGGGGCCAAGCTG 

CATGGAAGATTGTCCCATTAACCTGGCTTTTTACCAGGGTGGTCCTNTCCCTAACCCCTA 

AGAATCACACCCTGCATCCAAACGGCAGCAACCCCAAAA 

Sequence 887 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA C 1 1 1 1 1 1 1 1 1 1 1 1 1 



[ 1 1 1 1 1 1 1 1 i rAAAG CTGGGATATCTTACAGAGGMGGAAAMTTMCCTTTTTTACTTT 

CTTTCTCACTTTTTAAATCAGCCAAAGTCAAAGCCCGTTTGCCAACCTGCATGTCCATGC 

CTGTAAGCCCTTCTNTTGGCCAAGGAAGAAAGGAAGAAAGAAAAAAGAAACCCAGGGGCC 

TGTATCCCCTGATTAAACACAGCACAGCACTCCAGGCAGACATGCCCGGNGGCGGCTCCT 

TTGCACCATTGACCTCAGGCCAGACACCTCAGCGCCAACAATGGGACCTCGGCCTTCCGG 

CTAGGTTTGCCCCAGGCTGGGCAGGAAACCAGCTCGGCCGCTNTAGAACTAGGTG 

Sequence 888 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGACATTGTGGCTAGCCAACCACA 

TGGTCAGCCTCAAAGTTGAGAGGCTCAGTAACCCTCCTATCCCTAGAGAATTCCAAAGTG 

TGGATGTAATTTAACCTAGGAAAGCCATTGGTGACTATCTGTGATCCTCTGGAAAGTATG 

CTATGTTGGGGTATATCTTTGCATCCAAAGCCAGAGGGGAACCACAATGGCCTAGTAAAA 

CCGGTGGGTCCTCAAATGCCCACTTAAGCCTCTGGCCTNTTGGAANTTTGACCCATAGTG 

GGCCGTTCAGCTTGATTAGAGCCGGGGAAAGAAAG AAAATATTGNC A I T 1 1 1 1 I INTTGA 

AAAAAAATTT 

Sequence 889 
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CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGATCTATGCTGC 

TATGGGTGGAGAATCGACATCCTTTGAAACTGGCCACAGGCAGAGCTAAGAGGATGACTA 

AAAGGTCCCTTGGGTGGGTGCTAATGAGCAGGGGCCCAGGAAAACCTCTGTCTTCCCGGA 

GAGCCCTCTTGCATGAGTTTCGGCTTTGCCAAGATTCCAGGGACTTGAGGACAGCTATTG 

AGTTATGGTTACGTGACTGCCACATTGGGGCTTGGAGGCATCTGGCAGATGGTTGGGAAT 

GGGCTGGCACCACACTAATTAGGCCACGATGATCCAGTTTGACTCAGGGAAACCCAGAAG 

TCATAGTNCTCTTTGCAGAATGACACAAGGATGTCAACATGCTTTGNTTGTGTACCTCGG 

CCCGCTCTAGAACTAAGTGGGATCC 

Sequence 890 

CCGCGGTGGCGGCCGAGGTGCATATATATATATACACACACATATATATATTTATGTATC 

TTTTAMCATATMTTACTCTCTTTMTTCATTTGGACTTTTCATCTAAACTTGCTCTGT 

TTGCACAGGTCTGTTAGGGTAAGATATGTTCCTACCTTGAGAAATGTTGTGAATATCTAG 

CGAAACACCAAACATCCTCAGCTGACTAATGTGGTATCAGACTTTCTGGTTGCAAGGTAG 

GGGTGAATAAGGCAGGATGGGGTGCGGGGGTGGTGCTGGAAGAAGACATGGCATCAGGTT 

GGGTTTTGCAGGATACTGAAATTGTCTAGGGGCCTTGGCTGTGCAAAGAGCCTTTCCGTC 

Sequence 891 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACGCGGGATTTCT 

CAGATAGTTATGCGCAGCTCCAGGCACCAGATTCTGTGCTGGGTGCAGGCAGGACCTGGA 

GGGCGTCCTCAAGTGTTGATCTGCAGGGACTGTCTTGATCTTTCCAGCAGTGTCATTGTG 

GGCACGTGACCTGAGCTTTCTGAGCCTATTTCCGCATCTGTAAAGTGCTATCCACTTCCA 

CCTCCTGGGCTGTCGTGCAGATGTAGGAAGGAATTGCACTCACACACTCAGCATGAGACA 

GGCGCTCAGTAAAAGCCCGTCCAGGGGATATGAGATCAGTGAGGGATAGGAAAGCAAGGG 

TGGGTAGAAACAGCAAAACCCTTTCCCA 

Sequence 892 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAGCATTG 

GACACACAAAAATACAGGCAGCTTCTTCCCTCAAGGAGGTCACAGGTGGGTGTGTCCATA 

GCAAAGCTGGGAGGAAGTTGTATGAAGGAGCCTGAAGACAATGGGGAGCTAGGGGAAAGT 

TCTGAGTAGAAAGGAACATGTGGACAAAGGTTTGAAATGATGAAGACTGATTAGGAAGTT 

CATATTATGAAGCATAATTCAAGCTTTCTCTACGATGTTCAAATCCCATCTCTCCTACTT 

ACTAGANAGGTGACATTGGGCCMGTTACTTATCTCCTCTGCTCCTGTTTATTTGTGTTC 

AAAAACAGGGACCTCTCTCACAGTGTGATTNTGAAGACTGGACAAGAAAATGGGAGGTTT 
TG 

Sequence 893 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGTTTGGAAGTTT 

AGGCAAAAGTCATTTCTTCCCTATATTTTGTCATGCTTATCTCCTGTCTCTTTCTGTTTT 

ACAGATTAGCAATAAACTCCTTAMACCCAMAGGTTTGGGCTTCTGTTCCTTTCACTTG 

CAGTCAGACATGGAGTTAGTGGTAGAAGAAACAGAAGGGGTAACCTGCATGGTGACAGCT 

ACTGAGGGGATGGATAGGAAAGCAGGCTGAGTCCCTGGGGCCAGTGGTTACCAAAGCCAA 

GGAGAGGGCAAGGGGAGCCCAGTGGGCCTGG 

Sequence 894 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACACGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGGCACCACCAATTACCAG 

AGGAACAAAAGGAATATTGAGGGATGCGCTCAACCAACTNTT 

Sequence 895 

7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAGTAGTCTAAAACAA 
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GTGACTTTACTACTTATTCTTCTGCATGTCCTTACCAGCTTCTTACCTTCTTCAGGTTGA 
GCATGAGATCAGCTTCACAGGGGATGGGGTCCTTMGGGTTTTTTTCCATACTAGTTTCA 
GCCTTAACMTGAGTTTTCAACCCTTAAACATGAAAAATAAATAGTGCGAAAGAGGGGAG 
GATGGTAGAAATGCTTTAAMTTACCTTTTGTAAATTTTACT^ 

TGCCTTGCTTATCAGGGAAGTCCTACAAACAAAGAACTCCACGGCTTCTTCAAGTCTTCC 
MGGGAACAGGGTCCCCCTGGTTCCTAAAAATCAATGGGMGTAGGTTTTTGGTAACCAT 
CTACTGGTCM N GGGN AACCATTTCTACCTGGCGGTTTATTACACCTTTGCTAGGCTTCT 
TTTTTCI I I I ICAI I I I \ AAAAAATAATTTTT 
Sequence 896 

TTAGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACAGTTCATATCCCAGTTCTA 

GMTCAGTTCATTTTCTMGGAGTCCTGGTTCCTTTTATTGGAAACCAAAATCTGGGCAC 

CAGGTGTGCTCCCA7TCTAGTCGTTTTCTGACCACATAACTGCTAACAAAGATGCTTCAC 

TCTGGCTACACTGAT GTGAACT TTGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAAT 

GAGCTCC TACAT TACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAAATAT 

GCAAAGAATTTTCAGTATGAATGAAATAAGACAAACCATCAGGCTTGCTGTATTGTAAAC 

CAACACAATATAGTTATAACAGATCTGTAGAAGGGATCCTTAGAATAAGAGAGGCATTTG 

TCGGGGGGTCATCAGGGAGAATACTGGATAGNATCTT 

Sequence 897 

TAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 11 H I I II 1 1 I H I M I ) 

mill tCTTC ATGGCT ACATCTGAACAGCTACTGAGGGATATATATGCCAACTTTGGGA 

GTTGCACAGCTTTTTTGAGGCCATTTT^MNATGACTAGGGACTGTAATTTCT 

TTGGAATAGCCACMGTTGTTGTAGCCMGGTTTGNGNGNGTTTTTAATACAATTCTTAA 

AATTTTAGTAGGCTTCTCATCTGTANATAGATTTGAAGGGGNGGGGTTGCCCCTCCACAC 

CTGTGGGGTGTTTNTCGTAAGGNGGGACCAGAGACTTAGGA 

Sequence 898 

CCGCGGNGGCGGCCNAGGTACACCAAATGGATTACAAGCAGCATCCAGCAGAAGACAGAC 

CCCCCAACCCTGCCCACCAGGGCTCACACTCTACAAAACCCTGAGGGCCTAGAAATCTGT 

MATGCATCGNCAAGCACTGGGGCTGATTTGCAGTAATTCTCTAAGCAAGGCAAACATGA 

TCTAGCTTTGAAGGCAGCATGAAGGCAGCGGGTTGGNGAGAACAATCTNTCCTTAAGAGA 

AGAAGAAACCTGGGGCGGANGGAGTTTTCCCCGG 

Sequence 899 

AGCTC CCCGC GGTGGCGGCCGAGGTACATGTTANGGTCTTGAGTTAATTGCTCTGTGGCT 

GTGGATTTTTATTTGATGTTCTGATCTCTTCCTTCCAGTTTGATAMTTAGTGTAGAAAG 

TGGAAGAAAAACATGCCGGCGCAGCCTGTGCGCTTTGTCAGGTTAACAGAATGGAGTCCT 

GCTCTGGCATCAGTCAGTGCTGTTGTCCGAACCCTCTGTGGCTCC7TCCTCCCTCCCTGG 

GGCCCAGAGCTGCAGACGCTAGAGGGGTA 

Sequence 900 

GCTNCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAATGTTAAACTGAGGGATGAGT 
GAMCAATATCAGGATTAATAAATAAACACATTCTTGAATTCCATCACTTAATAGAAGTG 

GCCATTTGAATGCTGGCAGGTNGGAAGAAAAGAGGAGGACAAAGAACCCCAAAAGTTTGG 

CATCATAACTACTGCCACAAG 

Sequence 901 

ATAG GGCN AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I H I I 1 1 U I I II I I I I 

TGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGAC 

CTTGTGATCTGCCTGCCTCG GCCTC CCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCG 

CCTGGCCTTTCCAGGGTATTCTTTrAACGTGGTCmTTTGCCTTTTTGMm 

ATCTATCAGCATCATATACCACCACTGGAATATAAATTTGAAAGAGAGTCCTGCAGATTA 

TATACATGAATCTACTTAGGCCTAATAACCAAGCAGTCCTCAGTGGCAGATCAATGAAAA 

GTGAAACTAAAGGCAAGTGAAGGGTAGGAGAGATTGGCCAGT 

Sequence 902 

GGCAGGTACCCACCTCCTCGGTCACCCACAGAGCCACCAAAGATTCCATGTCCCAGAGCT 
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TCCCATAGCAGACCTGAAAAGTCCATGACCTGAGCTTTGGCCATGGTAGTGGAGTGGAAC 

AGGAAATAGTCCAGCA GAGGA GTGTGGGGGAAGGGGGCAGGAGAGGCACAAGAATAAGGG 

AGACCTGG ACTCTG GCTTnTGGGAAAAGGAACCAAGCTCATAGCAATTTGGCTGATNAC 

ACAATCAGATTTTTCCAGGTTAAGCTTCCTTTCTG 

Sequence 903 

CCGCGGTGGCGGCCGAGGTACCATCTACTGAATGCCCAGTTTTGATCTATTTCTAAATGG 

AGCAAACC AATT CCATCTCCTAGAGCTGGAGACTGTATCCAGGCAGTGTGTGGACAGAAC 

GGACAATCTTTTCTGCCAAGGGCCTATTTGAGTGGAGCACCCCCACACGGGTTAGACGGG 

TCGGCACGGGGCTGGTGGGTGAGGAACTCAGGGGTCAGTGCAAGCTGCAGACCCTCATTT 

GGGGAACGCTCTCAGCACAATGCTCTTACAACTACAGGGTGCACTCCAAAATGGAGTTCA 

AGGAAAAAAGGCTAATGAGAAATAAAATCTGAAAAAATAACTTAAAAAGTTTTGCT 

Sequence 904 

CCGGGCAGGTACGCXSGGGG CCCTT TGGATACCTGCACTCCCCATCACCGCACTCCCCATC 

GTGGCACTTCCCTTGTTGCAGTTTTATGGAGTGTGCGTCTGGCTCCCCAACTAGACTTGA 

ACCGCTTGGGTGCATAACTCGGGACTTGACCATTTGCGTCTCCCTACGGCCAGCTCAGCC 

TCCGCACACAGGGACCTGCAGAGAGTGGATGTAGCCACTGCCCCAGCGTCCCTGGGCTCT 

GMGAGMGCCATTGCCCTTCAAGAGCCACCCTCATTTCCTGGGCACTGGTTTGGAAAAA 

ACGAAGAAAAAGAGACACCCAGCTCACCTCCA 

Sequence 905 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCNGGGCAACTCATTCATGATATTGGG 

AGAAAAGCAAAGCAAAAACTGCAACAAAATCTCAAACCCTTTCTGCAGCAGCAGATGGCA 

AACAGTGATCAGAGGAGAAGGACCCTTCCAGCATTAGAAGATTTCCAAAGGCTGTTCCAG 

TAGGGGCTGTGGGCTTCTGGGAGCCCAGATGCCCCCTGATGGTATATTTGAGT7TGTGAG 

GTGGAGGCCAGGTGGCAAGANACTGCNNGCCAATGTCAATGAAAAGCCTGGGAGGAAAAA 
GAGATTTCTGGGA 

Sequence 906 

AGGTACTT TGCTA ACCAGCATTTTGGCTGTGTTATTGGCAGTTTTCAAATTTGAATTCTC 

TTGC CATCTTTTTGAGAGTGCATAGACATTTMTTTTAAGAMTTTATAGAATTGGACTT 

TTTTGTCTCTATACATTTGTAGGTCAGATGCACATTTGTTTCCTGTTTCATCTTTCTTTA 

AGAGCAAAAAT GTAA AGTTTTGTATGTAGAGGATAATTGTATGATGATGATAAACTAATT 

AGGTATTACAGTTTTCTMCGACAGAMTTTGTAATAATTAGGTAACTTGGTTCATATTA 

AAATATTTGATACATAGGCCGGGCATGGTGACTCATGCCTGTAATCCCAGCACTTTGAGA 
GGC 

Sequence 907 

GGCAGGTACCACACCCATCTTACCCTCTTCCCTCTAGGTTCTGACATTCAGCTATCTTGG 
TGGGAGGCTGGGGAGCACTATTGGGGATGAGGGTAAGGTGGAGTTTTATAAAGCTCTCCA 
GGTGACTCAGAGACCACCCTC ATTCCACCTGGTCACAAATCCCTGAATGGGAAACAGGTA 
Ci i 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 GCAAAGAATTGTAAAATTTATTGTATAAGTATTGCA 

GCTTTTCANAATGTCATCATTGCCACTAATGATTACTGATACACAACAAGCAGTTTCTTC 

AGGCCTGTGGATTGGCATC 

Sequence 908 

AGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGTGCTAATATCAAT 

GCAATGATCCCAATGACGATGATTGGA-AAMCTTCAATGGCAGCAGTGACAGGATCTGT 

GCAGCAACAGCATCTGCATCTGGTGCMCAGGACTTATTTTCAAATCATCAAGGCCAAAA 

AGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGGTGACT 

ATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCC 

AAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCCCATA 
Sequence 909 

AGGTACAGCAAATTAAACCCAATAACAGGAGGAGGAAAACACCTAATTAATATAAAAATT 
TCAAGGATATGTTAAACAAACAAACAATTACAAGAAACTCCTGTCAAGTTAAACAGAGAG 
AGAGAGAGACAGGGAGAACCACTAACAGGAAAGAGAAAAAGATATACAAGCTATCCCGCA 
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GAAATTAMAAGAGACTMGAATATTACAMCAACTTTATTTCATTTAGATGAAATGGAC 

AAATTTATTTTAAAACACAATTCGCCAAAATTGACAGAAGTGGAAGTAGAAAATCTATTT 

CTGTATTTATTAAAGATACTGAATCTATAATTAAAAATATCTTCACACAGAAAATTACAG 

TTTCAAATG 

Sequence 910 

AGGTACGATCTGMGAMTGAMGGCATTGAACTTTGGTGGGTAAATTGGGTCTTTTCCA 

GCAAAGGTATAAATCCTTAAAAGCXDAAGATCATATTGTTTGATTTCTCTGGGCTCTCTGC 

TGGATACAGTGCCAA GTCCATA ACTGTATACCCCATGGACACTCTATGTTAAATGGAGAT 

TAATGTGTAAG AGGTG I 1 1 1 1 1 1 1 1 1 1 GTTTTGTTTTGTTTTTTAATTTGGAAAAGAAGC 

TTAAAGACCACAATGG GTGT GGCATTGGCTCGACCCACAGATCTGCTTAGTCTCAGACAG 

GCACTTTGAACCAGTCTTTTAAAATTGCGTCACAACAAC 

Sequence 911 

AGGTACAGATCACTATGGCTTGTCTTTTCTCCTAACTAATGTAAAATTCCCAATAATTCA 

TAACT T GTATGAGGACMC AGTTGTGTGAATCTACCCTGGTCCTTCTGATNATTTTTAAT 

1 1 1 UNA 1 1 1 1 1 1 I 1 1 1 1 1 IGGGGACAGAGTCGTGCTGTTATCGCCCGGGCTGGAGTGCA 

GTGGCATGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCCAGCAACTCTCCTGCCT 

CAGCT TCCCGAGTAGCTGGAATTACTGGTGCCCACTACCACACCCGGCTAATTTTTTGTA 

TTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGGACTC 

Sequence 912 

AGGTACAMTTGTCGTTTTTATTCCTCTTATTGGGATATCATTrTAAAMCTTTATTGGG 

1 1 I 1 1 ATTGTTGTTGTTTG ATCCCTAACCCTACAAAG AGCCTTCCT ATTCCCCTCGCTGT 

TGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTGACTTCTTGCTTCAGT 

CATCAGCCAGCAAGAGGGAACAMACTGTTCTTTTGCATTTTGCCGCTGAGATATGGCAT 

TGCACTGCTTATA 

Sequence 91 3 

TGGCCAGNTCAAATNACAACCCCCCAACCCCCCCCCCCCCCCCCCACAAACAGACAAGGA 

CACAGNTCACCANACAATGGATGTNCAGGNANTNGATATCAGCAGATATMTTAGNCCTNT 

AGATAGGCTAATTTNANTNAGCAAAGGAAAGAGGAGGTANCATTAGNCAGATGGGNTATT 

NACCTCTGAATTAGATGGCACTTACCCANCTTCTGGNACAGNCCTGCTGGNGGCGTCTAG 

ACTAGTGATCCCGGCTGANGATCGATTAACTATCATCCGCGACCTCAGGGGGGGCCGGAC 
CCACTTTTGTCCTTA 

Sequence 914 

CGAGGTACGCGGGACACTGGTGGGGGAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCG 

CAGGGCCTCTACGGACCTTACTAGAAAAATGAAACCTGATGAAACTCCTATGTTTGACCC 

MGTCTACTCMAGAAGTGGACTGGAGTCAGMTACAGCTACATTTTCTCCAGCCATTTC 

CCCAACACATCCTGGAGAAG GCTTGGTT TTGAGGCTTCATGCGAGAAAGGGGAATGGGGA 

ATGGCTGCTTAACGGCATGTC 1 1 I 1 1 1 1 1 1 1 1 1 GAGACGGAGTCTTGCTCTGT 

Sequence 915 

CGCCCGGGCAGGTACGCGGGGACTTGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTT 
TCTCCCCGCTTGCCGGGGTGGTCCTCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTG 
TGGAGAGGGACTGCGTGTTTGGAGGGAAATGGGAATGTACCT 
Sequence 916 

CCCCGCGTCCGCTCTCTGTCGGGGTCCCCTCCATCTCGCTGCTGCTGAAGGCCGCGAGGG 

CGGCGGCGATGGCGGAGGCGGCGCTGTTGCTGCTGCCTGAGGCGGCGGCGGAGCGGGACG 

CTAGGGAAAAGCTGGCTCTCTGGGATCGGAGACCGGACACGACGGCGCCGCTGACCGACA 

GGCAGACGGACTCGGTATTGGAGCTGAAGGCGGCGGCAGAGAACTTGCCGGTGCCAGCTG 

AGCTTCCAATTGAAGACTTGTGCAGTTTAACATCCCAGTCACTGCCCATTGAACTGACTT 

CAGTAGTGCCTGAATCTACAGAAGACATTCTCTTGAAGGGCTTCACTTCCTTAGGAATGG 

MGMGMAGMTTGAMCCGCACAGCAGTTTTTTCTCATGGTTTGCAAAGCTGCAAACT 

CAGATGGATCAAGATGAAGGAACTAAATATAGGAGCAGTGTGATGCTATATTGAATGATG 

TAAACAGTGCTCTTCAGCATCTGGAGTC 
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GGCTGTGGCCAAGAAACGCAGGGACCGCTCTCTCCCCCGGGCTTTCGAAATCTTCACAGA 

CAATAAAACCTATGTCTTTAAGGCCAAGGATGAGAAGAATGCAGAAGAATGGCTCCAGTG 

CATCAACGTGGCAGTTGCCCAAGCCAAAGAAAGGGAAAGTAGAGAAGTAACCACATATCT 

GTAGGGAATTTATAAGTCAGCCATGACAATTATACACCACAGGCATTGTATTATCATTGC 

CMTGTCAAGAAAMGAGCTAMTTTACCMGCCATGGTTGGNTTTTTACTAAATACCAT 

GGGMTTTGTTGGTCCTTTAAGAAGAAGGGCCTTAAAATGGCAGGGATTTCTTAGTNAAA 

TGNCMTACTCTAACAGCTTTAGTATTGACTTTAGAATATATCTGATGCCCACAAAAATT 

AAATAAAAGGGNTTNGAGGAGGTTTGCCCNAAATAAGTGNGGGGCCCGGAGGGGAA 
Sequence 918 

AGTCNCCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTACGGGGGTAGCT 

TTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACCATTTC 

CTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCATTGGCA 

GTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTGAACAT 

GTCCCAGACAGTAACTTTGGTGTAAATGCTTCCTGTTTTTCTGCCACAAGCCCTTTGGTC 

TTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCAGCCTG 

ACACATATGGGACTGCTCGACAGGTCCACTGTCCCACGAGCAGAAGCTGGTCACAAAGCT 
TGGGAAAT 

Sequence 919 

GGGAGTCGACCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTCACGGGGG 

TAGCTTTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACC 

ATTTCCTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCAT 

TGGCAGTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTG 

MCATGTCCCAGACAGTAACTTTGGTGTAMTGCTTCCTGTTTTTCTGCCACAAGCCCTT 

TGGTCTTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCA 

GCCTGACACATATGGACTGCTCGACAGGTNCACTGTCCCACGAGCAGAAGCTTGTCACAA 
AGCTTGGAA 

Sequence 920 

AGTCGCCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCG 

TGGGGGATGGATGACAGTCCACCAGAAAAAAGTTAGTGGAGCGGGGACAAGCAGGGTTGC 

AGAGTGGAAGAAAAATGTTCCTGTGAGAAGAAACTGTCCAAAGAGTNTGAAGAGAAAAGG 

GAACAGGGTGAATTTGANGCCCTACAAGAAAAACAGGAGACCATTCAACAGGAGACGCCC 

AGGGAGCAGGTGGCTTTGTGGGCCTGATGTCCAAGAAAGAAGTNCTGGTGGTAAACAGAG 

ACTTGTGGATTGCAAGCTACTGTTGTCTTTCTATTGAAA 

Sequence 921 

TCGGAGTCGCCACGCGTCCGGCCAGGCGTGCCTGGAAATCCGCTTTCGCAGCGCCCCCTC 

GTAGCCCGCCTCCGCCCGCAGAAGGCGTTCCCTGGACAGAGAAGCGGGCGCGCGGGGGCG 

GGCGCGTGGGGCCTTGCCGGAGAACCTGACTCTCCGCAGCAGCAGTGGAAGCCAGAGTGA 

CGCGTTGTGTTGAACACCAGTTTTCTGGAGCGCTGTGTGTTCTCAACAGCTGAGCAGTCT 

GTTTCTCCAATCAGGTTTCAAAGCCACTTCAACTGCACTGGCCCCTGTGGGTCACTGCTG 

CACCGCCCTGGCCCATGTGGGTCCCTGAGGAGCGACCTGCCGGGGCCACCTGGCTGGACG 

AAAAAGACACACTTTGGGACTTAAGCCGTGAGAAAAAAACTTCATCAGTAAGAAACAAGT 

CAATAGACAAGTAAAAGACTAGGAGAAAATATGCATAAAACATAAAAAGTGACTTGGATT 

CCTGATCTTGGAGTATTTAMGAATTCCTATAACTTANMMGGTTTCMGTTTTTTNAA 

ATGAGCAAAANGGT7TGGGTAAA 

Sequence 922 

TCCGCAGGCTGGGGGATCCCAAAGGGTGCGCTCCAGCCCCCAACCCAGGCACTGGGACTC 

TGGTGGCACCCTGGGTGGCAGGCAAGCCTTGAAATCAAGTGCACGAGCCTTGGAAAGGAG 

GACCGGGAGAGTTATGGCATTTATGAATGAAGAAGAGAAAGAGAATCACTCGGATGGGAA 

AAGTTAACTGGATTGTTCCCACCTGCATGGATCACCCGGGTAACTGCAGTGGGACCGAGG 

GGGCGAGGCTGCGGGCTGGGGGATGTGCCGGGTTTCCTTGTGTTGCCACGAACCCAGAGA 
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GGGAGAGGAAGAAGATGGAAAGAAAAAGGAAAAAGGGAAGGAAAGTMGAGGGGAGAGAG 

GGGGAAAACTTGAGGATGAAAGAAGAGACAGAAGAAAGAAGAAGACCTTGAGAGAGGGGA 

GGAGAAAGGAAAGGAAGCNGGAAGGAGGAATTGGAAAGTGAAGGAAAGGGGGGAAACCAG 

GCNGAGAAAGAAAAGAGAMGGGGGAAGGGAAAGAAGAAAGGGGAAAAGNAAGGGGGGGN 

GGTTGAAAATCAACNCGAAAAAAAGAGG 

Sequence 923 

CNCGCGTCCGGCTGTGATGAATGAGGTCTAGGAAATAATTTGCATGTGTCTTGGGGGACA 

CAACAGTAACNGAGAGGAAATACATTATTACAGCAACTTGCGACGTACTAATACCTGTCA 

GTGTTGGCCCCCGTAAGGTATGTAAGGCACCTGNGANGTGCCCAGTNAGTNCCTTGGTGN 

AAGGCCAACATGTACTAGTTATGTAAGTATTGGTGTCTGCTTTAAAAAAGGAGACCCAGA 

CTTCACCTGTCCTCTTTAAACATTTGAGAACAGTGTTACTCTGAGCAGTTGGGCCACCTT 

CACCTTATCCGACAGCTGACTGTTGGATGtGTCCATTGTCGCCAGTTTGGCTGTTGCCCG 

GACAGGACAGGACCTCCATTGGGCGCAGCAGCAGGTGGCAGGGGGTGTGGCTTGAGGGTG 
GGTGGCAAGCGT 

Sequence 924 

CCCCGCGTCCGCACAGATCCTTGAGCTCCGCTGCAGGATAGTACAGTTTTACCGCAGAGG 

GAATCTGGAACAGTGGAATCATGTGTCTGCCCTGTGTATTGCAGTTTGTATTGCCACAAG 

CTATATTTATAC CAGT GTCACCCTTTTCTTGTAGAATATACTAATAAATCTGTGCCAACT 

CTACCTTCTCACTTTTACCTCTGACGTCATTCTTTTTTTCTGAAAGAGGTAATAATTCTA 

GTTTTGATAGACTCTGAGGATTATGTGMCAGGACATTTTTCATTTGTGAATTTAATGCT 

ATACTGTCAAGGTACTTGCT TGTGTCT GAACTCTAGTGCACTTATGATTTTGTAGACCCA 

TGTGAMTTTMTMGATACGTTTTTTTTCCTTTCTTTGGTGTGGTAGTGCAGCAACAGT 

TTGGTCTGCATTTGTTAGAAGTTTAACTCCTAACA^.CCCAAAGACCTATTTA 

Sequence 925 

GCGTCCGACCCCAAAGGGAGGGACCACATTGCACACACTGTAAGAAATGCACTTTCCGAG 

GAAGGGGATGGGGGAGCCCGGACACCCAGAGCTCCCCGAGTTGGGGGTGCCCGTCTGGAG 

CGCCCCCGTCAGCX3CCTGGCGGTGGGAGGTGAGAGCGAGTGGTTTAAGTGCCTGATTACC 

ACCACCCGCCCCCCCCTTTGTCCAGCTGGGACACGGAATGGCCGCGGGCCTCCTCCCCCT 

CCCCTCCAGCCTCTCCACCAGCCCCTCCAGTCAACCCTCATCGCCGTGCCCCCCCAGAGC 

TAGAGAGATGGGGCCCCTGCGTGGCCCGAGGGGCAGAGCTGGGCGTCACTTCGCAAGCGT 

CCTGCCCTGCCGGGGCGCGGGGGTGGGCTCTGGGGAAGCCGGTGCGCCCCCCACGCCTNC 

GCTGCCAGTGCCTTACATTCTGGAGCGACCCCCCTCCCTGGTGCCTCCCAGCGAAGGGGG 
ACCCGC 

Sequence 926 

AGACAGCTCAAGCCTTGCCACTTCGGGCTTCTCACTGCAGCTGGGCTTGGACTTCGGAGT 

TTTGCCATTGCCAGTGGGACGTCTGAGACTTTCTCCTTCAAGTACTTGGCAGATCACTCT 

CTTAGCAGGTAGGTGCCGCAGACCCTGCGGGTTAAGAGGTGGGGTGGGGGGCAGTGCTTG 

CCAAGGCCCTAAACTGGGAGCGCTGGGTGAGGGGAACAACCCACTTTGGAGGGTTCTCTG 

AGAGATAGATACACCCCATATCCTGGGCCCAGCTCGTGCACACAGCTGGAGGTCCAGAGA 

CCCAGTCCCCTCTGCTCCGTCAGCCAAGTTCCAAGAAGTTGAGCAGAGACCCTTCTGGGA 

GCCTGGCGGGGTGCAGCGGCCTCCCCTGCGGGGCCTGTCACCCGGCCGGGCGCGTGCAAA 

CGCCTCTGGCGCCTNTNTGCGCGGGAGGGGAGATAAGCGTCTGAGCCAGGGAAAGCCGCC 

GGGCTAAAACCCGCCTTTTCCGGGGGCCCC 

Sequence 927 

CGCGTCCGGTCATATACAATGTCATTGTTTGGGACCCGTTTCTAAATACATCTGCTGCCT 

ACATTCCTGCTCACACATACGCTTGCAGCTTTGAGGCAGGAGAGGGTAGTTGTGCTTCCC 

TAGGMGAGTGTCTTCCAMGTGTTCTTCACTCTTTTTGCCCTGCTTGGTTTCTTCATTT 

GmCTTTGGACACAGATTCTGGAAAACAGAATTATTCTTCATAGGCTTTATCATCATGG 

GATTCTTCTTTTATATACTGATTACAAGACTGACACCTATCAAAGTATGATGTGAATCTG 

ATTCTGACAGCTGTCACTGGAAGCGTCNGTGGAATGTTCTTGGTAGCTGTGTGGTGGCCG 

ATTTGGAATCCTCTCGATCTGCATGCTCTGTGTTGGACTAGTGCTGGGGGTCCTCATCTC 
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GGTCANGTGACTTTCTTTACTCCACTGGGAMCCTAMGMTTTT7TCATGATGATTGGG 
TGTATTCTGGGTCACTTTCTCTTGCCATMGCTATNCTCATTCCAGTAGTTT 
Sequence 928 

CCACGCGTCCGGACGCTGCGTGGAAGCGGCGGAGCCGGAGGGAAGCAAAGGACCXaTCTGC 

GCTGCTGTCCCCGCCCCGCGCGCTCTGCGCCCCTCGTCCCTGGCGGTCGCTCCGAAGCTC 

AGCCCTCTTGCCTGCCCCGGAGCTGTCCCGGGCTAGCCGAGAAGAGAGCGGCCGGCAAGT 

TTGGGCGCGCGCAGGCGGCGGGCCGCGGGCACTGGGCGCCTCGCTGGGGCGGGGGGAGGT 

GGCTACCGCTCCCGGCTTGGCGTCCCGCGCGCACTTCGGCGATGGCTTTTTCCGCCGCGG 

CGACGGCTGCGCCTCGGTCCCCGCGGCCTCCCGCTTCTTCTCTCGGGACTCCTGCTACCT 

NTGTGCCGCGCCTTCAACCTAAGACGTGGACAGTCCTGCCCGAGTACTCTGGCCCCGAGG 

GGAAGTTA 

Sequence 929 

CGACGGCCA NGGCG CCTCCGAGTTQCCCGCCAGGACTCGGAGGGCCAGGAGGGCGCGACC 

TGGGTGGATATTTTTGTTGGACGGCGCAACTCTTGGGGTGGCCCGGGAGCGGCGGAAACC 

GAGCGAGAGAACCAGGAGGCGCTGCGCAGAAGGAGGCCCGGGGGCTCCGAGGCGTTGAGG 

GGCTCGATCTGCGTTCTGGGGTTGGCAGCCGAGAGGCCGCGGTCCCTGAGTGCCAGAGGT 

GGTGGTGTTGCTTATCTTCTGGAACCCCATGCAGCCAGATCCCAGGCCTAGCGGGGCTGG 

GGCCTGCTGCCGATTCCTGCCCX^TGCAGTCACAGTGCCCTGAGGGGGCAGGGGACGCGGT 

GATGTACGCCTCCACTGAGTGCAAGGCGGAGGTGACGCCCTCCCAGCATGGCAACCGCAC 

CTTCAGCTACACCCTGNAGGGATCATACCAAGCAGGCCTTTGG 

Sequence 930 

cgtccgctttnagaccggmgacatttaamgccagtttacgtacangmgcatggtttt 

agattaactgcctgttggtacagctagamcattgcagccctatcgcttatttatcttgc 

atgttgctctgctttgctatgaaamtatcgttttatgataamc 

atgtattcggttatact ctta gggaaaataatagaaattagagtgagagaaagtgctatg 
tatattaggctittcagattttataga™^ 

aagttgaatgactacttaaatttgttgatgtgmtttmgtttt 
tmctcttctctttgtc tttgc atttaccttcccagatgttccagcctatcattttact^ 
attctc attcttgtatt attttcatc actttcttac ac aacmtatttaaacttg n cttc 
ct7tttacactggtttttggtac 

Sequence 931 

CACGCGTCCGTGGAGTATGTGCCATCTGCCAAAGTGGAGGTGGTGGAGGAGCGCCAGGCC 

ATCCCTCTAGACGAGAACGAGGGCATCTATGTGCAGGATGTCAAGACCGGAAAGGTGCGC 

GCTGTGATTGGAAGCACCTACATGCTGACCCAGGACGAAGTCCTGTGGGAGAAAGAGCTG 

CCTCCCGGGGTGGAGGAGCTGCTGAACAAGGGGCAGGACCCTCTGGCAGACAGGGGTGAG 

AAGGACACAGCTAAGAGCCTCCAGCCCTTGGCGCCCCGGAACAAGACCCGTGTGGTCAAG 

CTACCCGCGTGCCCCAC 

Sequence 932 

GGTTCGCCCACGCGTCCGCCCTGCTACCCTGGGAGAAGCCTCAGCTTTCTGGGCAGAGTT 

TGTCTCCCTGTCATTTATACTCTCAGGCTTTATACATTTACACAGTAAGTTCTCCCTCCT 

GGAGGGTTAAAAGGAATAATTTCAACAGGGTGAAGGCCTGGCACGGTGGCTCACAACTGT 

MTCCAAGGACTTTGGGAGGCTGAGGTGGGTGGATCACCTGAGGTCAGGAATTTGAGACC 

AGCCTGGCCAACTTGGTGAAACCCTGTCTCTACTAAAAACAAAAATTAGCCAGGTGAGGT 

GGCACACACCTATAGCCCCAGCTACTGGGGGAGGCTGAGGCAGGAGAATTGCTTGAACCT 

GGGAGGCAGAGGTTACAGTGAGCTGAGATGGCACCACTGCACTCCAGCCTAGGTGACAAA 

GCAGCAAGACGCATTCTNAAAACAAAACANCAACMCAACAAAAAACGGGAAAACA 

Sequence 933 

CNCGCGTCCGGTCCAGTGTCATCTCCTGGGTTTTCTCTGCTCTTrTATTTGGTGATCCTG 
GTTCTTTCGGCCGTTCACGTCATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGT 
GGCCTCTCGTGGCTGGCGGCAGGTGGGGTGATGGTGCTGGCCTCGGCGCTGCTGTGTGTG 
ATTGTGTCTGTTCTGACCAACGTGCTCGTGGGTGGAAACACCCCAAGGAAGAACCCCATG 
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CATCCCAGCTCAAGGTGGTCAGAGCTAGACCTTCTTATTCTGTTGGGGACGGCGGGCCAC 
GTCTTGAGCCTGGGCGCCAGCAGCTTCGTGGAGGAGGAGCACCAGACCTGGTACTTCCTT 
GTGAACACCCTGTGTCTAGCTCTGAGCCAAGAAACCTACAGAAACTACTTT 
Sequence 934 

TCGCCCCGCGTCCGGTTATTTTACCCAGAAGCCGGATAGAGAAAATATTACAGAGAAAAT 

CACATATCACATGGGCTCGAAAGATGTAGAGGTTTTTGACAAATGAAGAACAACCATAAC 

AGGTAGAGGGAACACCATGAACCAGGGCATGAAACTGAAAGTGCATAACATATTCTAGAG 

AGAGAAGGGTGTGGGCATGAGTTAGGGCTGGAAAAACAGGTTGGAAACAGATAAGTAAGG 

GTCTCAAATGCAATGTCAA AGAGCTTGCAGTT TATTTTCCAGGCAATGAGTAGGCAGCCA 

AAAAAAAAAAGTAAG G ATG 1 1 1 1 1 1 1 1 1 1 1 I 1 1 CCCATGGCATCATATTTAAGAGGATGG 

ATTTAAATTGTGTGAGACCAAAGCATAGAGACTAGATAAGAGGGCGATCATTATTTCAAA 

AAGAAATAATGAAGATCCAATGAAGGAAGTGGGAAATTAAAATAGGGGAAGAGAGGTA 

Sequence 935 

CCGTCCGGTTTTTTGTCTCAGAGTTCTTCAGGCTGTACAGGAAATGTGGTGCCGGCATCT 

GCTTCTGACGGAGTCTCACTCTGTCGCTCAGGCTGGAGTGCAGTGGCATGATCTCGGCTC 

ACTGCAACGTCCGCCTCCTGGGTTCAAGCCATTCTTCTGCCTCAGCCTCCCGAGTAGGTG 

GGACTACAGTGGCCATGTGTCTGAGATCTAACCAAGGGAACATGGGTGGAACTGATGTAA 

GCCACTTTGACACCACAAAACCTCCCATGGGTTCTCTCTCTTCTTCTCTGGTGTACTTGT 

TGGATGGAGAAGATGCTGAGAAATAGTGGGAAGTCCTAGGGGATGGAAGAACCCAGGATT 

CTGAATACTCCATTGGACCTTACGTTTTGGAATCAGGNATGATGCTGGCCTTCATAAAAT 

GAGTTATGGAGAAAGTCCCTCTTTTTCTGGTGTTTGGMCANGTTTTCAGAAANGAATTT 

GNTACCCAGCTTCCNTCTTTTGTACC 

Sequence 936 

CCGGTGAGCGCCCCGCCCGCTCAGCCGCCAGATCAACCTTAGCGCTGGGGCGCGGGCTGG 

GGTCGCCAGGCGGTGCGTTCTGCCCGCGCGGGGCTGAGAGTTAGGGGCCGGGGCCGGATC 

CGGGGCCGGGGGTCGCGCCGCTAGCCGCCAGCAGCGCAGTCCGGGCXJGCCACCCTGCACC 

CTCCGCCCTGTTTCTGCACCCGTCTGGGTTCTTGTGCCGCCGCCCGCAAGCCTTCCCGAG 

CTCAGGGTGGTGAGCTGCGGAGACCCGTGATAATTCGTTAACTAATTCAACAAACGGGAC 

CCTTCTGTGTGCCAGAAACCGCAAGCAGTTGCTAACCCAGTGGGACAANGCGGATTGGAA 

GAGCGGGAAGGTCCTGGCCCAGAGCAAGTGTGACACTTCCCTCTTTGACCATGAAACTCT 

NGGGTGTCTGCATTGCTGATGGC 

Sequence 937 

GTCCGCCGGCATGAGCTGTCCATGAAGGATGAGCTGCTTCAGTTCTACACCAGCGCTGCG 

GAGGAGAGTGAGCCCGAGTCCGTTTGCTCAACCCCGTTGAAGAGGAATGAGTCGTCCTCC 

TCAGTCCAGMTTACTTTCATTTGGATTCTCTTCAAAAGAAGCTGAAAGACCTTGAAGAG 

GAGAATGTTGTACTTCGATCCGAGGCCAGCCAGCTGAAGACAGAGACCATCACCTATGAG 

GAGAAGGAGCAGCAGCTGGTCAATGACTGCGTGAAGGAGCTGAGGGATGCCAATGTCCAG 

ATTGCTAGTATCTCAGAGGAACTGGCCAAGAAGACGGAAGATGCTNTCCGCCAGCAATGA 
GGAGATCACACA 

Sequence 938 

CCCGCGTCCGGAATTCCAGTTGTGGATGAAGGAAATGGTGTTATGACTGCCTCAAGGTTT 

TGTAGCAAGTCATAGGGAACCAAAAGAGGAATCTTGTTTTCCTCAGAGGTCATGCCAACT 

CCAACTCCCGTTCCCTAAACTGTCTCTGAGCCATAGACTAGTAATGGACTCTTCAAGCTC 

TACCATTAGGTATCTTTTAMGAMGCTGGTTATTACTATTTATTCATTTTTTTCTCTTC 

TGTGCAGTGCAAAAGATATGAAACATCGGCTAGGTTTCCTGCTGCAAAAATCTGATTCCT 

GTGAACACAATTCTTCCCACAACAAGAAGGACAAAGTGGTTATTTGCCAGAGAGTGAGCC 

AAGAGGAAGTCAAGAAATGGGCTGAATCACTGGAAAACCTGATTAGTCATGGAATGTGGG 

CTGGCAGCTTTCAAAGOTTTCTTGAAGTCTGAATATAGTGAGGAGGAATATTGACTTCTG 

GATCAAGCTGTGNAAGAGTACAAGAAAATC 

Sequence 939 

CGTCCGGCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTG 
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CCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGA 

CCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGATGGAGGAGAAATACGGCG 

GGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCG 

CTCG ATTMCAAAATATATTGTGTTACTATGTTTCACTAMTTTTTGAAGGCTGTGGGAC 

TTTTCGMTCATATGATCTCCTAAMGCTGTTCACATTGTTCAGTTTCATTTTTATATTA 

AMCTTGGGACTGCATTTTTTATGGTTTTGTTTCAM 

ATTACCAAACACCAGATAATTGGATCACTAAAAATTCCTGGTAGAAAAGAATTTAAAGAC 
AAA 

Sequence 940 

TCCGAAAGNGTACTGCCATGANCCGAGATAGGAGACACATAAGAGGACAGCAGAAGCCCT 

GGCCCTGGGGAGGCTTCTCGGAAGGCCTGGCTTCACAGGCAGGCCACAGAAGGATATCGC 

GGGCACCGTGCACCCAMGCMGATAGTGGCTTCCCTTTTATATCCAATCTAATCCTGAT ' 

TGGATGTCCCTGAGGCCCCTGCTGGAAACAGCCATAGGAGAGGGCCCATGGCAGTAGGGG 

AAAGMGG AAGA AATTCCCTGCMCAAMCTTCAGCTAMCTTTGATTTGTGTATTGTTT 

ACATM TAATT TTAMGGGTACATAATGTGTAMGAGTTTGGATAGAACCTCTCTTCATA 

CTATGGTTTTCGTAMGGATCTGTTGTTGTTACGGATTCATTTTTTCCCTCTAT^ 

MAGAGCAGCAGAGTTGTCTTCTCAAAACGGCTGCCAAGCTCTGCTTCTTGGGAAGAT 

Sequence 941 

CCCGGTCGTGCGGGTCGGGCGCGGGCGGGCGCGGCGGCAGTGGCGCTNTCAGGTGATTGA 

CTGGCCAGCTGCCTGAAGGAGCGCCAGGTCCTCCTTGCTGGCAGGTGGCGAAGCCCATTG 

GGGCGGCGGTGCAGACCCGCGGCGGCNGCTGCGGCGGTCTGGCTCGGGAGGCGTTCCTGG 

GGCCAAGGCCATGGCCCCGCGGCTGCAGCTGGAGAAGGCGGCCTGGCGCTGGGCGGAGAC 

GGTGCGGCCCGAGGAGGTGTCNCAGGAGCACATCGAGACCGCTTACCGCATCTGGCTGGA 

GCCCTGCATTCGCGGCGTTGTGCAGACGAAACTGCAAAGGAAATCCGAATTGCTTGGTTG 
G 

Sequence 942 

CACCCACCCCAGATGCCGCCTGGCACCAAGCGCAGCCGCCAGCTGCCGCACTTTCCACTT 

GTATTGATCACCTATNANNCCCGCGCANAACGGCTACGNCCGAGCGGACCGCGGCCAGCG 

CGCCAGCCCTTGGCACNCCCTNGGAGCAGAAAGGGCTCCGGGAGGAAACTCCTTGGGAGC 

GCCCTGTCCGGANTGCCCTTTGCTCTCTGCAGTGTGATTTCTTTCTGTTCTGGGAGGAGG 

AGGAGGAGGANGAAGAGGAGGANGAGGNAGAACGANANNCTGCCCTTCCAGAGGTTGGTG 

AGGGAGATCGCGCATGGATTTNAAAACCNACCTGAGGTTTCAGAGCGCAGCCATCGGTTG 

CNCTGCANGAGGCTAGCGAAGCGTACCTGGTGGNTCTGTTCGAAAGA 

Sequence 943 

GTCCGGTTTTGAAACAGAAATGTAGGCATTAGACTTCCTGGGCGGCAGACAAACCAAAGA 

GCGGAAATTCATGCAGCCTGCAAAGCCATTGAACAAGCAAAGACTCAAAACATCAATAAA 

CTG GTTCTG TATACAGACAGTATGTTTACGATMATGGTMGCTTTCACATTTGATTTCT 

TCTGTTTTTCCAGTAACTGTGAAGGGAAATTGGTAGGAGGTGTTGTAACAGGGCAGGACC 

CAAATGGGMCGGGGGGATGACATTGGTTTGTCAGGTACCGAGCAAAGAGTGAGGATTTT 

GGAGTCTCCCTTCTGCTGCTCTGATGTTTTCCACATGCTTATTTCTTTGCCAGGCACTGG 

AGATGCAGTCAAGAAGTGGGAAGTGGCTCTTACTTCTAGTCTGTGTGTGTATAAGTCACT 

TAAGATGGCCGTGTTGACTGCTTCTTTGGGAAATGCCCTGAATAGGAGCATGTAGGGGAT 

GCTTACCGAGGCTGGGGAAGG 

Sequence 944 

GCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGGCCCGACCCCGAGCCCGACC 

CCGAGCCCAAGCCCGAGCCGGAGCCCGAGCGAGACCCCGAGCCCGAGCCCGAGCGCGACC 

CCCGGTGCGGCGCGGCTACCCCGGCGGAGGCGGNGGGCGCGGGGCGCGCTCTGAGGCCCG 

GGGGATGCGCCGCCGCCTCGACCATGGGCGCCGCCGCCTCCAGGAGGAGGGCGCTGAGGA 

GCGAGGCCATGTCCTCGGTGGCGGCCAAAGTGCGAGCAGCCCGAGCGTTTGGAGAGNACC 

TGTTCCCAGAGTCACCCTGAGAACCGCAACGGCCGCAGATCACCTGCTGGCTGATGCCTA 

CTCTGGCCACGACGGGTCCCCCGAGATGCAGCCGGCCCCCCAGAACAAGCGCCGCCTGTC 
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CCTCGTTCTCCAACGGCTGCTACGAGGGCAAGCCTTCTCAGAGGAAGCCCAAGCATTAGG 
AAGCCCGCAGGC 

Sequence 945 

CGCGTCCGGCACGGGGGAGTCTGTGGTGGCCNGTTTACCTGGGCATCTGGCTGAGAGGAA 

GAAAGGCCAACCTGATCCTGAGGGGACCCAGACATATCCTTTGCACTGTCCCTAGAGGGG 

CGATGAGCTTTGCAGCATTAAAAAATGGTGAAGGGGGGAAATATTTTGAACCAAAGACCA 

AATGTTAGGCCGCCGTTATATTTGCAGAAGCTTTGAGAACCATGCGTATAGCCTCCTGCA 

TTCTCCCCTCTCCTAGGAGCTCTTTTGTCTCTGTCCTTACGAGGCGTCATACAGAGGCAG 

TGGGGTGGGCACAGATGAGCAGAGTGGATGGTTCGGTGGGTCCCCACGAGGGCGAGTGGT 

GGTCATATGTGATGGCACCGTGTTCACACACCCTCCTGTGTACCCCCCCAGGGTCACCCG 

AAGTCCCCACACGCTGGCTCTCCACACCCCTCCTGTTCCAGAAAGCATGTCCCG 

Sequence 946 

TCGACCNCGCGTCCGGCACTCCCTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCC 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACC 

CCTGCCTGGCCCCGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGC 

TGGGTCTTCGCCCAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAAT 

TTMCTTGGAAATCAACTAATTTCAAGACAATTTTGGAGTGGGAACCCAAACCCGTCAAT 

CAAGTCTACACTGTTCAAATAAGCACTAAGTCAGGGAGATTGGAAAAGCAAATGCTTTTA 
CAC 

Sequence 947 

ACCCCGCX3TCCGCTTTTGCATCTGGATCATTTTTCTCTTGCCCCACCATGTAAGAAGTGC 

CTTTCACCTCCCACCATGAACCTGAGGCCTCCCGAGTCATGTGGAAATCGCCCCCAGCCA 

CCCCACCCCAGAGGGCTACGTCCTGGCAGAGCTGGGTTTGGTTAGTTCTGAGGGCTGAGC 

TGGCCCAGCAGCTCCAGACCTCCAGACCTTGCACTCACCTGTGAACTTGACTCTGCAAAC 

TCCTCCAAGATGCGCCCACCACACTCCCAGTGAACAACACCTACAGGAGCTTGGAGTTCT 

ATTCTCAGATACATCAGCTTCCACATTCCTGTGTGTCCCAGCTGGAGAAGCAAGAAGTCC 

CAGACCATGTGCTAAGCACACGTTGGGGTGGGGATGAAATCCAATTGGTGGTGTGTGAAT 

CCATGCTGGATTCATGAAGCTGAGGCCCAGAGGAGGAAGCTTTCTTAATCAACTTCTTAA 

CATG 

Sequence 948 

TAAAAGCCATGGTNATTTGTGCACTGTGCAGTTTCTTATTAGCAAAGGTGCCAATGTTAA 

CAGGGCTACAGCCAATAATGATCATACAGTAGTGTCGCTGGCATGTGCAGGAGGCCACCT 

GGCAGTTGTTGAGCTTCTCTTGGCTCATGGGGCTGACCCTACTCATCGACTCAAGGATGG 

TTCAACAATGCTCATTGAAGCTGCAAAGGGTGGCCATACTAATGTAGTTTCTTATCTGTT 

GGATTATCCAAATAATGTTCTGTCAGTTCCCACCACAGATGTGTCTCAGCTCCCTCCACC 

TTCTCAAGATCAGTCTCAGGTGCCACGTGTGCCAACGCATACACTTGCCATGGTTGTACC 

TNCCCAGGAACCTGACAGAACTTCACAGGAGAACTCTCCTGCCCTTTTAGGAGTGCAAAA 
A 

Sequence 949 

CCACGCGTNCGGTCGGCCTGTGCGGCGCTGCGGCGGAGCGGGCCATGGCAGTGGGGAGGG 

GGCGAGTGTAGTGCTGCGCGGGGCAGGCGGGAGGTGATCGAGAGAGGCAGGGATGGGGGC 

GCCGGAGTGGAGCGGTTGCGGCGGNCTGGGCTGCTGACTGCGCACTTGGAATAGTAGCAG 

GCGGCG GCGG CGGAACGCCAGGCAGTGTATGTTTTAACTGGAAAAAGTCTTCCATGAAAA 

CCGTCACTTTTAAMAMTAAGGTAATGCCCATTCTGTTTTTTCCTAAAAAAGACCTGAA 

AAT GGGGG GGCCGMCACATTCCTTAGGGGCCtXCGGTGGI^ATTGAAATGTGCCTTTC 

MGTTTTTCATTAMTGCNCTCCTGGCTTATTGGGCAGGACCATTCCCTTTGAACAATCC 

TGGGGGCCGGGCTGGGATTTCAACAAGAATTAGGCAATTCTTGGAATGGGCCTTCCAATA 

ACCCTGNTGGGGMTTTTTCCCNTinrrNGCCCCCAACCTTGGGGGMTTTNATTATTTNC 

MGNTTTGGGGAAGGGTTACCCTTCNGGGGGAAANGCTTAACCCAATTTTTC 

Sequence 950 

TTNGGGAGTCGCCACGCGTCCGGCCGGCGACGGCGGCAAGTGGCGGCCCGGCCTGCAGGA 
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GCCCGACGGGGTCTCTGCCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGC 

GGCGAGACATGAGGAGACCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGAT 

GGAGGAGAAATACGGCGGGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCC 

GGTGGACGTACCCAGCGCTCGATTAACAAAATATATTGTGTTACTATGTTTCACTAAATT 

TTTGA AGGCT GTGGGACTTTTCGAATCATATGATCTCCTAAAAGCTGTTCACATTGTTCA 

GTTCATTTTTATATTAAAACTTGGGACTGCATTTTTTATGGTTTTTGTTTCAAAAGCCAT 

TTTCTTCTGGGAAAAC 

Sequence 951 

NNTCCGGAGTCGACCNCGCGTCCGCGGCTGCTGCCTGCTCTGGAGGCAGGCTGGGCGGTG 

GCGGCCGAGACTGGCGGGGGTGGACGCCCGGGCCGGGCTGCGCCCGCTTCTTGCAGCTGT 

GMTTCCTTT GGAC AATTGATGATATTTATCATTGTGCCCAGTTTCTACAAATAAAAGAT 

GGGTGGATTATTTTCTCGATGGAGGACAAAACCTTCAACTGTAGAAGTTCTAGAAAGTAT 

AGATAAGGAAATTCAAGCATTGGAAGAATTTAGGGAAAAAAATCAGAGATTACAAAAATT 

ATGGGTTGGAAGATTAATTCTGTATTCCTCAGTTCTCTATCTGTTTACATGCTTAATTGT 

ATA TTTGT GGTATCTTCCTGATGAATTTACAGCMGACTTGCCATGACACTCCCATTTTT 

TGCTTTTCCATTGATCATCTGGAGCATMGAACAGTAATTATTTTCTTCTTTTCCAAGAG 

AACAGAAAGAAATAATGAAGCATTGGATGGA 

Sequence 952 

TCNCCCCGCGCCGGTTTTGATACAGMTGAMGTGCGTAGTATTTTCATTTTGTTTATTT 

TTGCCTTATACATATAGCAAGCCCTCAATAAATAAATATTGAATGAATGAATGAGTGAGT 

GMGMTTTGTTTA TAACA GTCTGTCATCTTGATAACACTGGAATGTCTTTGGTTCTTCC 

ACTTCATCCTTTATGTTTTAA ACTTACA CACACCATTCTTACACGTCACTAAAGGAAAAT 

ACCAGTATATATTGGCTAAAAI 1 1 1 1 1 1 1 1 I GTTGTTCAAAACTGAAACTCAAATGCCTA 

ATTGGGCTAGGGGTCCTCTTAAAGGAGGTTGATGTTTGTCAAATGGGTTATTTTTTAAAA 

GCAGTAG ATAATTG CTTATTTCAAGGCAAGTAAATGAATTTAGACTAAGCTGTTCATAGG 

ATTCATCATTTTTTCCCCTCTCCCAAAGTAATTTGTAAGCCGTAAAC 

Sequence 953 

TCGCCCCGCGTCCGTGATTTCTCAGTGTTCTCCTTAGATACCAAATACAAAGGACGAGGG 

ATCAAGCTCAGCGAAAGTATCAGGCATTTAAGGTATCAGGCAGCAATGCGGGGAAAGGTG 

AATTTTCTTCMTCAGCATAGGATGGTTAGGGMGAGCATTTATCACTTTGGTTCTTATC 

CTTCAAGCCAGGGGAAAAGCAACAGTGAGGACATCAGAGACAAAAGCATTTATAGAACTA 

ACAAACACAAACGTTTGACAAGTGAGAAAGCTTTATTAAAGCACACATACATGTCAGGGG 

GGTGGGAAACAAAAGAGCAAGTTACAGCCCGGGATCCCAAGTTATGCCTTCCATTACAAT 

TGCAATCCACACCAAATCAATCTTTGAAAACATTCCTCCATTCGGTTCATACATACAGTA 

GAAACCACTGTGGCTGCCCTTAATCCAGTGTGCTTATAGGAAATCAGTTAGCAGCTGACT 
CTGTTGAAAG 

Sequence 954 

CGTCCGGACCCTTATTAAGAATATCCCAGGAAGATGGTGATGAACAGCCTCAGTTTACTT 

TTCCACCAGATGAATTCACTAGCAAAAAAATTACAACAAAAATATTACAGCAGATTGAGG 

MCCATTGGCACTGGTGTGMCMTTAACCAGCAAATGTCCTTTTCTAATACCATTTGAA 

ACTAGACAGCTTTATTTCACATGTACAGCATTTGGCGCCTCAAGAGCAATAGTATGGTTA 

CAGAACCGACGTGAAGCCACTGTGGAGCGAACGAGAACCACAAGCAGTGTTAGGCGAGAT 

GACCCTGGAGAGTTTCGAGTTGGTCG1CTCAAGCATGAMGAGTAAAAGTTCCACGTGGC 

GAGTCACTGAT GGAATG GGCTGAGAATGTCATGCAAATACATGCAGATCGGAAATCAGTT 

CTTGAGGTTGMTTTTTAGGAGAAGAAGGAACTGGCTTGGGACCCACATTAGAGTTTTAT 
GCTCTGGTG 

Sequence 955 

ACCACGCGTNCGGGCAGAAATACGGCGGCATGTTCTGCAACGTGGAGGGCGCCTTCGAGA 
GCAAGGACGCTGGATTTCGATGCCCTCAGCGTGGGGCAGCGGGGCGCGAAGACTCCTCGG 
AGCGGCCAGGGCAGCGACCGAGGATCGGGGAGTCGGCCCGGGATCGAGGGGGACACCCCG 
CGCAGGGGCCAAGGCCGGNAAGAGAGCAGGGAGCCCGCGCCCGCCTCCCCCGCCCCCGCC 



WO 01/070979 



PC1YUS01/09126 



TABLE 1 

153/467 

GGGGTAGAGATCCGGAGCGCCACCGGCAAAGAGGTGTTGCAGAACCTCGGCCCCAAGGAC 

MGAGTGACCGTCTNCTTATCMGGGAGGCAGMTCGTCMTGATGATCAGTCCTT7TAT 

GCTGATATTTACATGGAAGATGGCTTAATAAAACAAATTGGAGACAATCTGATTGTTCCT 

GGAGGAGTGAAGACCATTGAAGCC 

Sequence 956 

CCCGCGTCCGCTA CTGTA CTTTGCAGTTTGATGTTTATTMCATTCTTTGGGCACCTAGC 

TACMTATMCTCMTTTTCTGTGAAAMCTATTMTCATCCTATTTTTTCCTTGTCTTT 

AATATOAGATAAATTTTATACCACTGTTTCTCAMCX)ATCTGTTGT 

CTTTTTMTTTCCMTTGTCAGAGACCMTACTTTTGTAAAATATAATTAAAAACAAACA 

TAAAAATAAACTTATTAGAAAAATGAMTAAMGAGAAATGAMTAAGAATMTTTTATT 

ATTAGATTTMCAGATCA MTATTAT TTCAATACTCAGATCAAATGTGCAATAAGACAGG 

GTTGCAAAAMTGCACACTTTTTTTATTAAAATCATTTAT^^ 

TMTATTACAGTTGCAACTTTCTGGTGNTTCTCAACTATGACCAAACAGGAGGGTACAAG 

TAAAGGAGCAATCCCAA 

Sequence 957 

GTCGACCACGCGTCCGCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGG 

TGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAG 

AGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCATG 

CTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTTA 

TGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTCA 

GCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAGGAGTGGTGAAG 

GGGACAGTGACACACTTCATGCTGGCTGTCCTGTCCTGGTGGGAGATAAGTTGGGTGGCC 

AACAAGTGGATACATGAGTATGGACAGGAATTCCGCAGACCCTGCAGCTNCAGC 

Sequence 958 

GTCGACCACGCGTCCGCGCCAACTCCGGAGGCGCGGTGCTCGGCCCGGGAGCGCGAGCGG 

GAGGAGCAGAGACCCGCAGCCGGGAGCCCGAGCGCGGGCGATGCAGGCTCCGCGAGCGGC 

ACCTGCGGCTCCTCTAAGCTACGACCGTCGTCTCCGCGGCAGCAGCGCGGGCCCCAGCAG 

CCTCGGCAGCCACAGCCGCTGCAGCCGGGGCAGCCTCCGCTGCTGTCGCCTCCTCTGATG 

CGCTTG CCCTCTCCCGGCCCCGGGACTCCGGGAGAATGTGGGTCCTAGGCATCGCGGCAA 

CTTTTTGCGGATTGTTCTTGCTTCCAGGCTTTGCGCTGCAAATCCAGTGCTACCAGTGTG 

AAGAATTCCAGCTGAACAACGACTGCTCCTCCCCCGAGTTCATTGTGAATTGCACGGTGA 

ACGTTCAAGAC 

Sequence 959 

CCACGCGTCCGAGGGTGGGGAAAGGAGGAGAGGAAGAGCACTCCCTTCCCTGGCCCCTCA 

TCCAGCCTCCGGTGCTGTAAAACGCAGGCGCTGGGCCGCGGGCGGAGCTGAGGACAGGCC 

TTGGCTGGTCCCAGGATGAGCGACGAGTTTGGTTTTAGCTGGGGATTGTGCTGGCATCCT 

GCGAAGCTCCTCCCAGCCGGTCTCTCTGTGCTCGGTTGTCTTGGGGTGGGGCCCATCCGC 

CGAGGTGGGGACCGATAGGAGAAGCCGGTGGGTTGTACCCTTACACTTGTGGAGTCTCCT 

CTTGCCTCTACCTACTCCGCCTTTGTCCTTAAGGTTTTTGCAGGCCAGTGCCAAACACAC 

ACTAACTGTCCTGGCCTCTCCGTGACACAAGTCTCTTCCCAGCCTTCCTC 

Sequence 960 

CCACGCGTCCGCGGACGCGTGGGGCCGGGACAACTGGTCTTATCACGGAGGCTGGGGCCA 

NGGCAGCCCTTCGGTTCGGGTGGGCCCATGGACCCCAGTCCAACGCCGAGGGAATAGGAC 

CATCCAAAAGCGGAACCTTCGCCTCAGAAAAAGGGTGCGGGACCCCTCCTCACCGTGCGG 

TCACGCGTGGACCCTGCCAGCAGCCAGGCCATGGAGCTCTCTGATGTCACCCTCATTGAG 

GGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTNGGTTGGTNGCTGATTCTAGCCTTG 

GTCCTAGCTTGGCTCTCTACCTACTTAGCAGACAGCGGTAGCAACCAGCTCCTGGGCGCT 

NTTGTGTCAAGCAGGCGACACATCCGTCCTNCACCTGGGGCATGTGGACCACCTGNTGGG 

CAGGCCAAGGCNNCCCCGAAGCCAACTGA 

Sequence 961 

NCCCCGCGTCCGGGAGGCTCCATGTTGTCCCCTCAGCGAGTGGCAGCAGCTGCCTCAAGA 
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GGAGCAGATGATGCCATGGAGAGCAGCMGCCTGGTCCAGTGCAGGTTGTTTTGGTTCAG 

AMGATCAACATTCCTTTGAGCTAGATGAGAAAGCCTTGGCCAGCATCCTCTTGCAGGAC 

CACATCCGAGATCTTGATGTGGTGGTGGTTTCAGTGGCTGGTGCCTTCCGAAAGGGCAAG 

TCCTTCATTCTGGATTTTATGCTACGATACTTATATTCTCAGAAGGAAAGTGGCCATTCA 

MTTGGTTGGGTGACCCAGAAGMCCGTTMCAGGATTTTCCTGGAGAGGGGGATCTGAT 

CCAGAMCCACTGGGATTCAAATCTGGAGTGAAGTTTTCACTGTGGAGAAGCCAGGTGGG 

AAGAAGGTTTGCAGTTGTTTCTGATGGATACCCAGGGGGCAT 

Sequence 962 

GCCCCGCGTCCGCTTCTCCGAATATAGCAACGTCCAGCAGTGTCCACACTGTGGGAACCT 

GGACTACCACTTCGTGMGCCATTTTCCTCCTTCAAAGTTCTCGAAGCTTATTGATGAAA 

GCTTTGCTTTAGTAATAGCTATTTTATTGATATTATTACTTTATTACATATCTT^ 

GGAMCATTCTGTGACATTMTTTCCTTTCTMTTTAMGGAGAGTTACTTTGTTGTATG 

TGTGCCACTAAAATAGGGGCTGCCCTTGCCCTGTCTTGATTCCCGAGTGTTAATCTGTGG 

TTTTGACCAGAGCCCAGATGGGTMTCCTGTGCATTTGGGTTGGGGGTTCACTCTTACCA 

AGMTCTTTGATGCAGCTTTAAGATGGTGGGGAGATGGGGGTTGAATTTAGGGAAAGAAT 

NTTTGTGGGTTATAAACTAAGAGCTTGATAGGAGTTGGAAGGAAACTCTTACTAAAATGT 

TMCTTTCTAAAAACCTTCTTTTANATCTTNCTTGGGCCTTTGGAAAA 

Sequence 963 

GTGTTTTGGGGATGCCTTTCCTTACCAGATTCTTCTAAAGCCCAGCTC 

AGTGGGAGATAAGGCTTCTGCCCGCGGGCTCTGCGTTCGTCCACCCGGCCCCACGTTTGC 

TGTGGACTAAACAGGAGCCACTGGACTAGAGTACACTTGACTCTCGGCTCTGCGGACCAA 

AAATTCCAGGACTAAGGAATAGCAAGGTTAGGCTGAAACAGTCCACACAGGGCTTGCGGT 

AMCGTCTTTTCAGGAGCCACTCGCCCAGTGCAGTAAGTCGTGTACTTAGTTGACTCGAG 

CGCTCCAGGGAGACGCCCGACCCTACTCTGCGCCGCCCCGGGGCACCAGCTCTGCTTCCT 

CCAGGTCCACTGAGGCAGGCACGCCCAGCTCTGGGACAGGTCAGTAAACAAGCCACGAAC 

CGCGCCAGGGATCAGAGAACCCANAGTCCCCGCCAGCTGCCGGCACAAGCCAATCGCAGC 

GCANCCAGGCGGC 

Sequence 964 

GTCTAAGGGATCCAGGTCCTGTGTCTCAGGGACCTCTGATGGGATTGAATCCAAGAGGAA 

TGCAGGGGCCTCCAGGCCCNCGGGAGAACCAGGGTCCTGCTCCCCAAGGGATGATTATGG 

GCCACCCGCCTCAAGAGATGAGAGGACCTCACCCTCCAGGTGGACTACTGGGACACGGCC 

CTCAGGAAATGAGAGGTCCTCAGGAGATCCGAGGCATGCAGGGGCCTCCACCCCAAGGAT 

CAATGCTGGGACCTCCCCAGGAATTGCGAGGGCCTCCAGGCTCACAAAGTCAGCAGGGGC 

CGCCCCAGGGCTCTTTAGGACCTCCACCCCAGGGTGGCATGCAAGGACCCCCCGGACCTC 

AGGGACAGCAGAACCCAGCAAGAGGGCCACATCCATCTCAAGGGCCAATACCATTCCAGC 

AACAGAAAACGCCTCTGCTAGGTGATGGGCCCCGGGCCCCC7TCAACCAGGAAGGACAGA 

GCACAGGCCCCCCACC 

Sequence 965 

TGCGCATGCGCGGAGCGCGGCGCGCGCGGCGGTTGGGCCGTTGGCTGTTCGGCCCTGGGA 

TCCGCCGCCACTCCGCGATCAGACCGCTCTGTGCCGCGAGCCGCCGTGAGCACTCGGATT 

CAAGCCGGCGCCAACGAGTCCGGGGGCATCGCCCGCAGCGGCCAAGCTCATGGCCGGCTG 

AGCGGGACGCCGCCTNCGCCTCAGCCACCGCCGCCGCCGCCGNCTTCTTCTCCTCAGCCG 

GCGGCGGCCCGGGCCCAGCAACCATGGCTGAAGACTACTGGGACGGGCGCCTGCGGCGAA 

CAGGAGGAGAAAGGGAGGTCGCGCGGCCTCATTCCGGGCCGCCGCCCCAGGCGCCGCCGC 

GCCGGCCCCGCGGCTCTGAGGTTGCTCGCGCGCCCCC 

Sequence 966 

TGGAAMTNTTTTGGAAAAAATTACCCTTGGGACCTTGNTTTTNAANCCCNAGGTTCCCN 
GTTNNGGCAMTAAAANAATGNNNGACCCGGGATTTNGGGNTTNNAAACCGGGGGTTTTT ' 
AATTTCCCCNNNN CNNGGGNC CI I I 1 1 1 1 1 I NCCNCCCCCNCAAGGGGGNTTTGGGAAAN 
NAAAN CCCCCCCCI I 1 I I I I I I NGGGGGNGAAANTTCCCCGGGTNNNNGCCN I II I 1 II I 
TTTTTAAA 
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Sequence 967 

GTCCGCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGAAAGATG 

AGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGAACGCATTTACTGTC 

ACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAATGCAAA 

TTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATGGAGGAT 

AAGAACATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGTAGCTAC 

AGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTTCAGATC 

ACAGATGTGAAATTGCAGGATGCAGGGGGTGTACCGCTGCATGATCAGCTATGGGTGGTG 

CCGACTACAAGCGAATTACTGTGAAAGTCAATGCCCCATACAACAAAATCAACCAAAGA 

Sequence 968 

CGTCCGGGAACTCAGCAACGGTTTCTTCATCCAGGACCCGATTGCTCTGGTGGAGAGGGG 

GGGCTGCTCCTTCXrrCTCCAAGACTCGGGTGGTCCAGGAGCACGGCGGGCGGGCGGTGAT 

CATCTCTGACAACX3CAGTTGACAATGACAGCTTCTACGTGGAGATGATCCAGGACAGTAC 

CCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGGCCGAGACGGCTACATGATCCG 

CCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTTCCATCCCAGTCAATGTCAC 

CAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTCTGGTAGAAGAGTTTGTC 

CCACATTCCAGCCATAAGTGiACTCTGAGCTGGGAAGGGGAAACCCAGGAATTTTGCTACT 

TGGAATTTGGAGATAGCATCTGGGGACAAGTGGAGCCAGGTAGAGGAAAAGGGTTTGGGG 

CCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCTTCCCCAGGG 

Sequence 969 

GATTGGAGGAGTCACATCCCCTCTTCAGCCGCAGCACCCCTCCCTCCCATCCTCTAGCTC 

TTCCCGCGGTGGTCGCCTCCCTCCGACCCTGCTCTCCCCTCCTGGGCCCCGCGCAAAGCC 

CCCTCTGTTCCAGCTCCCGGGCCTCGGCTGCCTCCCCGCCCTCCCATCCCTTCCTCTTCC 

CAGGGCCTGGAGCGCTCCCTTACATTCCTGAGATGCCCTTCCTCGGGGCTGTCCCCCTTT 

GCCTCCCCAGCATCCCATTTCTAGGCCTTTTTCAAGACCCTTCCAGAGCGGCCCCTTTCC 

AGCTCCCTTTCTCGTTTCCATTTCCAACTTTGCCTCTTTTGCCTCTTTGTTCACTTTGCT 

TCCMGCTCCCCTCCCTCTTTCCC1TCGTCTTACCCTGCTTTGATCTACGCAGCCCCAAA 

CTCAAGCTCCCCGCTTTCAAGGTGGTGCGAGGTTGTTGGGGGGTGCGGAAGGGCCTGCCA 

AGTCCATTTTTCGAGGGG 

Sequence 970 

GTCCGAGATCGCGAGCCGCCGGCC I 1 1 1 1 1 1 1 1 1 I I 1 1 I ATAAGATTATTAGTATAAAAN 

GGGGAGACGAGGTTAGGGCCCTGGGAAAGGTGGGAGATCAGCCAGAGACAGGTTTCCCAG 

AACAGAATGTCTGGCCTTTGTGGTGAGGAGGGACTGTGGTATGAGCCGCAGAAGCGGGCC 

AGGGGTAAACCCTCCTGTGCGTCCTTCCTTCAGCCTGGTCCTGAGGGTGACCCTTTGATC 

CTGGGTTCTCCAGGTAGGGCTGTGAGCTGTGAGTTGGATCCTTTTGGTGAAATGGTCTCT 

CTCATCTGGCCTGTCACTCAATGTGGAATAGAGTGAGTGAGTTCTATGGGTTCTAAGTCC 

TGCTCTGGMCCATAAGTMGTTATCCTCTCTGGGCTTCAGTTTTTCATGGAAAGTTGCG 

TTAAGAATCTAGTTTAAGGCCAGGCATGGTGGCTCACCGCCTTGTAATCCCAGCACTTTG 

GGGAGGCCAAGGAAGGTGGATCATGANGTCAGGAGATCGAGACCATCCTNGCTAACATGA 

TNAAACCCGTGTCTTTACTTAAAAAATAC 

Sequence 971 

CCTGCCAGTGGTGAGCACCTTCGGCCTCCAGGTGCCTTTCTTCTTCTTCGCGGCCATNTG 

CTTGGTGAGCCTGGTGTTCACAGGCTGCTGTGTGCCCGAAACCAAAGGGACGTCCNTGGA 

GCAAATCCGAGTCCTTTTTCCGCACGGGGAGAAGGTCCTTCTTGCGCTAGGTCAAGGTCC 

CCGCCTGGAGGGGGCCAAACCCCCA 

Sequence 972 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGTCAGCCTCCACCTGGAAGAGAGCTANGGG 

CCGGGCAGGCCGGGCAGCTGCCACCCCGCCCGGCCCGACGCCCCGCATGCCCCGAAGTCC 

CTGGCGCCCACCCGGCCGCGGCCCTGCGTGTGACCCGCGGGTCGATACCTGGCAGCCCCA 

GTGCTGGGGCGCCGCGGCCCTGCTCGCCCAGGAGGAGAGCGAGGGCCCCACACTGAGTCT 

CTTGAAGCCTCACGTTTCCCTGGGGGGGTGCTGCATCGTCGGGTGTCCCTCACCCCACCT 
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GGGGAACCTCTGTCTTCAGGTCACCCCTTTTCAGGGGCCTGG 
Sequence 973 

CGTCCGGGACCCTGCTCATGGAGAACATCAGCAGCTGGCTGCTCCTTCGCTGACGCCCTG 

GGCTACGTGAACCTGCCGCTCACCTTTTTCTGCCGGGCAGAGCTGGATAGTGAGCCCGAG 

CGGGTGGCGTCCGTCCTGGAAAAGCTGAAGGAGGACTGNAACAACACTGAGAACAAAGAA 

CGGAAGTCCTTNCAGAAGGAGCTTGTGATGGCCCTACTGAAGATGGACTGCCAGGGCCTG 

GTGGTCAGACTCATCCAGGACTTTGTGCTCCTGACCACGGCTGTAGAGGTGGCCCAGCGC 

TGGCGGGAGCTGGCTGAGAAGCTGGCCAAGGTCTNCAAGCAGCAGATGGACGCCTACNAG 

TCTCCCCACCGGTGACAGGAACGGGGTTGTGGACAGCGAGGCCATGTGGAAGCCTGCGTA 

TGACTTCTTAC 

Sequence 974 

TCACCACGCGTCCGCGAAGCGTGCACCGCTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCC 

GCAAGCCCAACGCCGGGGGCAGCCCCGCTCCGGTGCGCCGCCGGNCGGAGGCCTCGCCGG 

TGCAGAAAAAGGAGAAGAAGGACAAGGAGCCGGGAAAACGAGAAGGAGAAGAGTGCCCTA 

GCCCGGGAGCGCAGCCTCAAGAAGCGCCAGTCGCTGCCCGCCTCCCCACGTGCCCGCCTC 

TCTGCCAGCACCGCCTCTGAGCTCAGCCCCAAATCCAAGGCCAGGCCATCCTCTCCCTCC 

ACATCCTGGCACAGGCCTGCCTCCCCCTGCCCCAGCCCAGGGCCAGGCCACACTCTGTCT 

CCAAAGCCACCGTNCCCCCGAGGCACCACTGCATCCCCCAAGGGGCCGGGTTCGGAGGAA 

GGAGGAGGCAAAGGAGAGCCCCAGCGCCGCANGGCCCGAGGACAA 

Sequence 975 

TCCGCAGAAACGGACTTTCTCATCATGCTTTCCTATGGTGGGTATGAGGGGCCAGCTGAT 

ACCAACCAACTGGCCTGTATCTATCTATCTGGATTTGACTTGAATTTTTAAAATGTGTAT 

CGTTTAAAAAAAAAMTGTTTGCAAATTTTGCACATAGGATCTTGCACTGTTCATTTTCA 

GTGGGGTGAGTCTTCACACTAAAAACACAAGCAGAGCTCCTGGGAAAAGAGACTGGAAGT 

GGTTCAGGATAAAGAGATCCATGGTGGGCAGGGCTCTTAGGTCACAGAGCTCTAGAAGCA 

GCTGGACTTGAACCCACAATGGCTTGTGTAAATTCGTAAATTTCATGGTTTCTAGGAAAA 
GCTGCATTG 

Sequence 976 

GCGT CCGG MGMCTGTGGMCTGCGGTTGGCAGACMTTTTGTTACAAATATTAAAAGG 

AAAGATTTTGCCAATATGACCAGCTTGGTGGACCTGACTCTATCCAGGAATACAATAAGT 

TTTATTACACCTCATGCTTTCGCTGACCTACGAMTTTGAGGGCTTTGCATTTGAATAGC 

AACAGATTGACTAAAATTACAAATGATATGTTCAGTGGTCTTTCCAATCTTCATCATTTG 

ATACTGAACAACMTCAGCTGACTTTMTTTCCTCTACAGCGTTTGATGATGTCTTTGCC 

CTTGAGGAGCTGGATCTGTCCTATAATAATCTAGAAACCATTCCTTGGGATGCTGTTGAG 

AAGATGGTTAGCTTGCATACCCTTAGTTTGGATCACAATATGATTGATAACATTCCTAAG 

GGGACCTTCTCCCATTTGCACAAGATGACTCGGTTAGATGTGACATCAAATAAATTGCAG 

AAGCTACCACCTGACCCTCTCTTTCAGCGAGCTCAGGTACTAGCAACCTCAGGAATCATA 

AGCCCATCTACTTTTGCATTAAGTTTT 

Sequence 977 

NCTCCMCAATTATGGCTCATCCTTCCTTTTACTCTGTCTCACCTCCTTTAGGTGAGTAC 

TTCCTTAMTMGTGCTMACATACATANACGGAACTNGAAAGCTTTGGTTAGCCTTGCC 

TTAGGTMTCAGCCTAGTTTACACTGTTTCCAGGGAGTAGTTGAATTACTATAAACCATT 

AGCCACTTGTCTCTGCACCATTTATCACACCAGGACAGGGTCTCTCAACCTGGGCGCTAC 

TGTCATTTGGGGCCAGGTGATTCTTCCTTGCAGGGGCTGTCCTGTACCTTGTAGGACAGC 

AGCCCTGTCCTAGAAGGTATGTTTAGCAGCATTCCTGGCCTCTAGCTACCCGATGCCAGA 

GCATGCTCCCCCCGCAGTCATGACAATCAAAAAATGTCTCCAGACATTGTCAAATGCCTC 

CTGGGGGGCAGTATTTCTCMGCACTTTTAAGCAAAGGTAAGTATTCATACAAGAAATTT 

AGGGGGAAMAACATTGGTTAAATAAAAGCTATGTGTTCCTATTCAACAATATTTTT 

Sequence 978 

CCCCGCGTCCGGGTCCCCGCGACTCCCGGACTGGAGAAAACGGCTCTTGCGATGGGGCGA 
AGTCCGAGCTGCGGCGGGCGTTGGTCCGTGCAGGGAAGTGGGAATCGTTAGGTTCGTTCT 
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GGACCCGCCGCCCCATGGCCCAGGCGTCTCGCTCAGGTAGCCTGCCTCCACTCGTTATCG 

TGCCCCCGCTGAGGGCGCAACCCGGGGGCACTGGGGAGGAGCAGTGGGAGAGAAGTCGAA 

CGGNCGGNCTTCCGCTGGGCAGAGCTCAGCAGTACTTGGCAGCATGGGACCCAGCTTCCT 

TCCTGCTCCTGATCCAAAAGGACTTACCTNCTCTGTTGCATGAGGCAGAAGCTTTGTATA 

GCCTGGCCTCAGAGGAAAGCTTAGCTCTGGAAGTGGAGCAGCAGCTGGGCCTGGAGATCC 

AGAANCTGACTGCACAGATCCAGC 

Sequence 979 

AGGCTGNTACGAAGCGAGCTTGGGAGGAGCAGCTGGCCTGCGGGGCAGAGGAGCATCCCG 

TCTACCANGTCCCAAGCGGTGTGGCCCGCGGGTCATGGNCAAAGGAGAAGGCNCCGANAG 

CGGCTCCNCGGCGGGGCTGNTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 

GGTGMGAAAGMCCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTA 

TGCACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCANAT 

CTACCTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTTCATCATCCTGNTTGTGGG 

CCGANCCTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATNCCC 

Sequence 980 

ACCCCGCGTCCGGAAAAGAAGAGTGGCCNGTTCCAGGGGTAGCTCCAAAAGAGACTGCAG 

AGCTGTCCGAGACCCTGACAAGGGAGGCCCAAGGCAACAGTTCCGCAGGAGTGGAGGCAG 

CAGAGCAGAGGCCTGTGGAAGATGGCGAGAGGGGCATGAAGCCAACAGAAGGGTGGAAAT 

GGACCCTGAACTCCGGGAAGGCTCGAGAATGGACACCCAGGGACATAGAGGCTCAAACTC 

AGAAACCAGAACCTCCAGAGTCAGCAGAGAAGCTTCTGGAATCTCCCGGTGTGGAGGCTG 

GAGAAGGGGAGGCTGAGAAGGAGGAGGCGGGGGCTCAGGGCAGGCCTCTGAGAGCCCTGC 

AGAACTGCTGCTCTGTGCCCTCCCCCCTCCCACCAGAGGACGCTGGGACTGGAGGCCTGA 

GACAGCAGGAAGAGGAAGCAGTGGAGCTTTCAAGCCCCCACCACCAGCCCCTCTGTCTCC 

CCCACCCCCAGCCCCAACTGCCCCCCAACCTTCTGG 

Sequence 981 

GCCIXGCGTCCGAAAAGAATGGGTGAACCAATCGGCCTTTGTGAATTTATTCAGTGCCTT 

CTCTGTACCAAGCACTGGGTAAGGCACTTTTGTGGAGCATTAGACAGTAACCCTCAAGGA 

GCTA GAGAA C CGGATGGGA GACATGAGCGGTAATTAACTCACTTGTTCCCCAGAGTTrCT 

ATTTGTTTTGATTTTCTTTTTCTGTGACTTATTTTCCTATTTTCTTTCCTCCATGTAATT 

TTCACTATGGCCCAACTAATATAAACACCTGGAAATTACAAGGAAAAAAAATTCTTCCTC 

TMTMCTTTCCAMTTTGTGGMTATTTATTTGTAATAGCAGTTATCAGTTATGCTTAT 

ATAGCATTAAAAAT TCTCCT CCTTTGACTACACACACAACCACAGTGTGGTTCTAATCAT 

GGAGATATCAGTAATTTTTAGTMCTGMTTTTGAGGACATTTCTNTGTTTAGCATGTAT 

GCAMCTGATATGTAATCTGAGGTTCCAAAGTCMTTTTTTTCTTTTTT 

Sequence 982 

TNGGGAGTCGACCCCGCGTCCGGTTTTTGTGAGGCAGTGAGACCTAAGGTAACCTTTATC 

AAAAGGATGGAGTTGGGAAAAGGAAAACTACTCAGGACTGGACTGAATGCGTTGCATCAA 

GCAGTGCATCCGATCCATGGCCTTGCCTGGACCGATGGGAATCAAGTTGTCCTAACTGAT 

TTGCGGCTTCACAGTGGAGAGGTCAAGTTTGGGGACTCCAAAGTCATTGGACAGTTTGAA 

TGTGTCTGTGGGTTGTCCTGGGCCCCACCTGTTGCAGATGATACACCTGTTCTACTCGCT 

GTCCAGCATGAGAAGCATGTCACTGTGTGGCAGCTGTGTCCCAGCCCTATGGAGTCAAGC 

AAATGGCTTGACGTCTCAGACTTGTGAGATTAGGAGGGATCACTACCTATCCTTCCCCAG 

GGCTGTGTGTGGCACCCAAA 

Sequence 983 

GTGTCGACCCCGCGTCCGCGCCCTGCCTGCAGTTGAGATTCAGATGCCTTCTGACAGAGT 

TCAGCCTCTTGGAGAGTCTTGGGGATTGTTGGCACCTAAACAGAATCAGNGACCCGGGTG 

CTTTGTGGCCAGCAGCACAGAATCAAACCCGCATCCCAGCATTGGGCCACCCATCTGAGG 

GAGGCCAAAATCATCACAGATGCTGCTGTGCTGCAGACAGATACATGCTAGTCCAGAGAG 

CCGCCCCTGAGATGGCTGTGAGAACCATGTGTCTAAGGCGTAAGATAAGGATGGAAGGCT 

GTCCMGTTATTTGGAAGGCCTCGGCAGCTTGGGATTAGCTTGGGAGCGCAGCGCTGCAA 

AGTGGAAAATATGAAAAGACCACACAGGCCCAAGCAGTCCAGAAACTGGGCAAAAATATT 
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CTGCAGTGGGGATTT A I 1 1 1 1 1 
Sequence 984 

CACG CGTCCG GAGTACGGAGTTGTTCCTTTACTGGCTGAAAGATATATTCGAATTGTAAA 

GATGCTT TTTCTCATGCATTGAMTTATACATTATTTGTAGGGMTTGCATGCTTTTTTT 

I I 1 1 1 I ICTCCCGAGACAGGGTCTTGCTCTGGCGCCCAGGCTGGAGTACAGNGGCATGAT 

CTTGGCTCACTTCAGCCTTGACTTGGGCTCAAGTGATCCTCCTACCTGAGCCTTCTGAGT 

AACTGGGACTACAGGTGTGCACTCCTCGCCTGGCTAATTTTTTATTTTTTGTACAGGCAG 

GATCTTGCCACCTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCATGCCATCTGCCTGCCT 

TAGTCTCCCAAAATGCTGGGATTACAGGAGTGAGCCACCATGCCCGGCTGGCAGTTGCAT 

GGMGAGAACACCThrrTTATGGCTTACCCTCTAGAATTTCTAATTTATGNGNNCTGTTGA 
AATTTTTGG 1 1 1 1 1 1 1 ACCT 

Sequence 985 

GTCGACCACGCGTCCGCTCGGCTTCCTGCTGATGGTCAGGGTTTTGGCAACTCCCCGGTG 

TGAGAGGGGTAGGGAGTGCTCCCGGCGGCGACGGGGCCGAGTTCACCAGCCGCCGGGGCA 

GTAGTCGAAGGCCCGGCGCGGCATGTCCTGGGTGCCGCGGTGCGGGCAGTGAACGCGCGC 

CGGGCGGGATGGGCCGGCGCCGGGCGCCAGAGCTGTACCGGGCTCCGTTCCCGTTGTACG 

CGCTTCAGGTCGACCCCAGCACTGGGCTGCTCATCGCTGCGGGCGGAGGAGGCGCCGCCA 

AGACAGGCATAAAGAATGGCGTGCACTTTCTGCAGCTAGAGCTGATTAATGGGCGCTTGA 

GTGCCTCCTTGCTGCACTCCCATGACACAGAGACACGGGCCACCATGAACTTGGCACTGG 

CTGGTGACATCCTTGCTGCAGGGGCAGGATGCCCACTGTCAGCTTTCTGCGCTTTCAGGC 
ACATTAACAGCA 

Sequence 986 

CGCCACGCGTCCGCGTACGCGTGGGCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTG 

GGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTG 

GGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCA 

CAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACC 

ATTGATAGATTTMTMCTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGAC 

TACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAG 

CCAGTGTGGAAGAAGGGACT 

Sequence 987 

GGTCGCCCCGCGTCCGTAGCAGTTACATCTACGAGGCTATTATGGATTGGAGGATGAGAA 

GGGAACT GCATGT ACCTCAACAAGGCGTCGGTCAACACCGCGAAGTTTGGCAGGCTTGAC 

AAGTGGAGTTTTTGAATCTATAATGGTTCAAGTTTTGAGACAGGAAGAACAGCTGAGAGC 

AAAAGAAGAAAAAAGGCTTCGGGAGCAGGAAAGAAAAGAAGCAGAAGAAGCTAGTCAAAA 

GGAAATAGAAGAATGGGAAAGAAAACTTCTAGCTCAAGCAGCTCCAACTTGTATGGAGAC 

CATGTGGGAMTTCCAGCTATTGGGCATTTCCTTTGTTTAGCTCAGCAAATTCTAAATTT 

GCCAGAAATAGTCTTTTACCGAACTGGAACCGTTGTCTTCTGATGCCTCAGTGTAATGCT 

TTTCTATCGAAAATAATGACTTCTTATTTAAGTCCTCCCCATCGCAGA 

Sequence 988 

NCCCCGCGTCCGAGTCCCCTGTCTGTGGCACCAGACACTCCCGACTGTGCGCTGACTCTC 

CCCGCCCAGCCAGCAGCCTTTTCCAGAGAGGCTGTGGTCCATAGCCTCTGTTCGTTTTCA 

CTGCAGGACCAGGCACGAMGTTMMCAAMTGAAGATTTTTTCTGAATCTCATAAAAC 

AGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATGTCGAGTT 

TGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATATCTAAATC 

ATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATGATATATT 

TCCTTTCAAAMGCTGGTGMTTTTATTGTGAGTGACTCTGGAGCACATGTTTTAAATTC 

TTGGACTCAAGAAGACCAAAATTTACAGGAGCTAATGGCAGCATTAGCCCGCTGTTGGGC 
CTCCTAATCCTCGGGC 

Sequence 989 

GTCGCCC ACGCG TCCGTTCGTTGTCTGATGGACCTGCTTGCAAAAGGCCAGCTCTGTTGC 
ATTCCCAATTTTTGACACCACCTCAAACACCAACGCCCGGGGAGAGCATGGAAGATGTTC 
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ATCTCAATGAACCCAAACAGGAGAGCAGTGCTGATCTGCTTCAGAACATTATCAACATTA 

AGAATGAATGCAGCCCCGTTTCCCTGAACACAGTTCAAGTTAGCTGGCTGAACCCCGTGG 

TGGTCCCTCAGAGCTCCCCCGCAGAGCAGTGTCAGGACTTCCATGGAGGGCAGGTCTTTT 

CTCCACCTCAGAAATGCCAACCATTCCAAGTCAGGGGCTCCCAACAAATGATAGACCAGG 

CTTCCCTGTACCAGTATTCTCCACAGAACCAGCATGTANAGCAGCAGCCACACTACACCC 

ACAAACCAACTCTGGAATACAGTCCTTTTCCCATACCTCCCCAGTCCCCCGCTT 

Sequence 990 

GTCCGGCTGGGACCTCCTCCTGTTGGGGTCCCCATGAACCCTTCCCAGTTCAACCTTTCA 

GGACGGAACCCCCAGAAACAGGCCCGGACCTCCTNCTCTACCACCCCCAATCGAAAGACA 

ATGCCTGTGGAAGACAAGTCAGACCCCCCAGAGGGGTCTGAGGAAGCCGCAGAGCCCCGG 

ATGGACACACCAGAAGACCAAGATTTACCGCCCTGCCCAGAGGACATCGCCAAGGAAAAA 

CGCACTCCAGCACCTGAGCCTGAGCCTTGTGAGGCGTCCGAGCTGCCAGCAAAGAGATTG 

AGGAGCTCAGAAGAGCCCACAGAGAAGGAACCTCCAGGGCAGTTACAGGTGAAGGCCCAG 
CCGCAGGCC 

Sequence 991 

NCGCGTCCG CTTAA ATGACTCGTTATCATTTTGCAATGAATGGAAAATCATTCTCAGTGA 

TACTGGAGCATTTTCAAGACCTTGTTCCTAAGTTGATGTTGCATGGCACCGTGTTTGCCC 

GTATGGCACCTGATCAGAAGACACAGTTGATAGAAGCATTGCAAAATGTTGATTATTTTG 

TTGGGATGTGTGGTGATGGCGCAAATGATTGTGGTGCTTTGAAGAGGGCACACGGAGGCA 

TTTCCTTATCGGAGCTCGMGCTTCAGTGGCATCTCCCTTTACCTCTAAGACTCCTAGTA 

TTTCCTGTGTGCCAAACCTTATCAGGGAAGGCCGTGCTGCTTTAATAACTTCCTTCTGGT 

GTGTTTAAATTCATGGCATTGTACAAGCATTATCCAGTCTTCCAAGTGTTACTCTGCTGT 

ATTCTATCTTTMGTMCCTAGGAGACTTTCCAGTTTCTCTTAATTTGATCTGGCAATCT 

TTTGGGTAAGTGGGTATTTANAAT 

Sequence 992 

TTTrCACTGCAGGACCAGGCACGAMGTTAAAACAAAATTGMGATTTTTTCTGAATCTC 

ATAAAACAGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATG 

TCGAGTTTGATATGCTGGTGAAGAATAGAACCCAAGGMTCATTCCTTTGGCCCCCATAT 

CTAAATCATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATG 

ATATATTTCCTTTCAAAMGCTGGTGMTTTTATTGTGAGTGACTCTGGAGCACATGTTT 

TAAATTCTTGGACTCAAGAAGACCAAAATTTACAGGGAGCTAATGGCAGCATTAGCCGCT 

GTTGGGCCTCCTAATCC 

Sequence 993 

CGCGTCCGGGCAGGAGCACCACTCAAGGAGCTACACCCCTTGCATCGGCTTGACCGCCTT 

ACCTCAGGGGTGCTTATGTTTGCCAAGACAGCTGCAGTCTNTGAGAGAATTCACGAGCAG 

GTTCGGGACCGGCAGCTGGAGAAGGAGTACGTGTGCCGGGTGGAAGGGGAGTTCCCCACT 

GAGGAAGTGACCTGTAAAGAACCCATCTTAGTGGTGTCTTACAAAGTAGGGGTGTGCCGT 

GTAGATCCCCGGGGCAAGCCCTGTGAGACAGTGTTCCAGAGGCTAAGCTACAATGGCCAG 

TCCAGTGTGGTACGGTGCCGGCCACTCACAGGCCGCACACACCAGATTCGAGTCCACCTT 

CAGTTCTTGGGCCATCCCATTCTCAACGACCCCATCTACAACTCAGTTGCCTTGGGGTCC 

TTCTCGAGGCCGGGGCGGCTACATTCCCAAGACAAACGAGGAGTTGCTACGGGACCTGG 
Sequence 994 

ACGCGTCCGCGACCGCTGGGCATGCGGGTGTTGGCGCGGTATCCCCGCCCTGCCCAGCAT 

CTGCCCCACGTTTCTTCAGGCTAAACTACCGGGATCCCGGGCTTCTTCCTAAAGTAAAAC 

TCGCTCCGGAAAGGCCAACAGTCCAGCGGCCAGACGGGCACCTGGGAACGCGGGCCTAAC 

GCGTACTGGAGACGGAGTGGCGCCCGGCACTGCGCGCCTCCTCCCCGCCGGGAGACTGCG 

TGCTAAGCTCAGCAAAGCCCCGCTGTGGAGACGGAGCCATGTCGCCCATTACCTAATGAA 

ACTGAGAAGGGAGACTCAGTCTCTCTTCTAGCCCCGAGCGCAAGCTCTGCTGGACTTGGC 

ATCGTCCGCCCTCCACGATCCCACACTCCGGGTTTTCCCCATTCCCAGCTCGGCTGCAAC 

CGAGAGACAGACGGAAGAAAC 

Sequence 995 
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TCCTCCTGGCCCTGTTAATGTCGGGGCCNGGCCGGGGGAGGATGGCGCCCTAGAACCCGG 

CCTTGCTGGGGTAGGGGCGGGAGGGGACGGGGTGGGGACCGGCCATGTCGGAGGTGACCC 

GGAGTCTGCTGCAGCGCTGGGGCGCCAGTTNTAGGAGAGGCGCCNNACTTCGACTCTTGG 

GGCCAGCTGGTGGAGGCGATAGACGAGTATCAGATATTAGCAAGACATCTACAAAAGGAG 

GCCCAAGCTCAACACAATAATTCTGAATTCACAGAAGAACAAAAGAAAACCATAGGCAAA 

ATTGCAACATGCTTGGAAT 

Sequence 996 

CGCGTCCGGCCTGAGCCGGCGGGTCCCCTGTGTCCGCCGCGGCTGTCGTCCCCCGCTCCC 

GCCACTTCCGGGGTCGCAGTCCCGGGCATGGAGCCGCGACCGTGAGGCGCCGCTGGACCC 

GGGACGACCTGCCCAGTCCGGCCGCCGCCCCACGTCCCGGTCTGTGTCX5CACGCCTGCAG 

CTGGAATGGAGGCTCTCTGGACCCTTTAGAAGGCACCCCTGCCCTCCTGAGGTCAGCTGA 

GCGGTTAATGCGGAAGGTTAAGAAACTGCGCCTGGACAAGGAGAACACCGGAAGTTGGAG 

AAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGC 

CGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTTCAGGTGCAGAAGCA 

CTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGGCC7TATTGT 

CAACAAGGCGCCCCACGACTTTCAAG 

Sequence 997 

GTCCGGCCAGGAGCCAGGCCGAGCGGGAGCTGACCANGGCTTGACTCGGGTACAGAACGA 

GGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCAGTGGATGGAGGAGGCCCAGCCCTGA 

GGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCCTACAGGGTGGGTAGGCGGGCGTGCC 

GCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCCGAGGGGCCTACGCTGCGGCCCGGCA 

ACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCCCACCCGATCGGAAGGCGGATCCTTT 

ACCAGGGCCATAGGCCAGTGACTGGGCGGGCCCCTTNGGGCCTCCCATTCGGGGCCCGGA 

CTANGGMCNAGGCCCGGNNGAGGCCCCTTGGCCTACCAGACCCTTTTNTNANGCCGACA 
GCCGNCANGGAAAGAT 

Sequence 998 

CGTCCGGCCCAGAGCCCGCAGCACGCCGCCGCCGCAGCCTAGGTCACCTCCAGCATCTAG 

CACAACGTCTGCMTGGAACAGGCGAGCTGTGAATATTTGTGGAATGCATGGGTGGACTA 

AAGACCTATCACCTCACTCTAGAATGCCCAGCATGTTGGAGCATGAGGACCAAGAACCAT 

GGTGTTCCTCCACTCATCAGAGCCGTATCATTTTGATGCATGCGCCAAGAAAGAAAATTC 

AATCATCAGACTGAAGCAATCAAACTCCAAATGGTGCTGTAAACTGAACCACACATAGAC 

ATGCCATTCTTCTAAGGACCCTTAAGATCCACCCCAGGAGGAGCGCTAGCTGCTGTTCCC 

CATTCGATGCCCCTTTTCGGCCCGGAAGTAGCCGGAAAGATTCGCCCGCCAAAATTCCCC 

TAACCAGCAAGTTAGGTGTGGCATCTTCCACAAGCANGGAGCCGTTGTAGGAAAAAGGNG 

GTCTTGGGGAAGGTTTTTCG 

Sequence 999 

CCCGCGCCGGCAGTTTCNATGGTGTGTACTAATTTGAGAAAATGAATGTGTATACATACA 

AGAGTMGTCAGATTGTTAGACTCATCCCTCAGTATTCCATATGTTTTGTGACTGATTTT 

ACAGTTCTCTCTACCTTTCTCATTTACAMMAAAAAMGAAMGAAMTTTGATTCAGC 

AATTCCTAAMGTATT GTAT TCAGTGACATCTTTGGAAACACCAGTTTCTGTTATCAACT 

TCAMTAA TAGTC MGTTTTATGTATGATCTAMGGGMMCAAGTTTGTTTTTCAATCC 

TGTGATAAT TTTTC TTTTAGAATGAGGTGTTGCAAGAAATGGAAAATTAAAAAAACCACT 

CTGTMCAATmGCTGTGCTTCTTTGATTTTTCTCTGTTm 

TATTTGTACCTTTACATATTGMTTCATGAGGAGAGGTTATGCACAGCCTAGTTATTTGA 

CATTCCAGGGGGTTTAAAAAAAA 

Sequence 1000 

CCCGCGTCCGGCGGTGGCGGTGGTGGCGGTGGCGGCGGTGGCGGCGGCGGCGAAGGGGGC 

GGAGAGGAAGGAGCGCGGCGGGACCGGGCCGGGACAGCGCGTACTTTGGGCTCCGGGAGT 

CGCTCCGCGCCCGCGGTTGTAGCAGCTGCCGCTGCAGCCATAGCAGCAGGTCAGTCATTG 

GCACCATGAACTGGAATAAAGGTGGTCCTGGCACTAAGCGAGGATTTGGCTTTGGAGGTT 

TTGCCATCAGTGCTGGGAAAAAGGAGGAACCCAAACTCCCACAGCAGTCCCACAGTGCCT 
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TTGGGGCMCCAGCTCTTCTTCTGGATTTGGAMGTCAGCTCCACCACAGCTTCCTTCTT 
TCTACAAMTTGGATCTAAGCGGGCCAACTTTGATGAAGAAAATGCCTATTTTGAAGATG 
AGGAAGAAGATTCTAGCAACGTTTGATTTACCTTACATTCCTGCT 
Sequence 1001 

CCGGCCGCCGCCGCCGCCCCGCCGCCACCGCCTGGGGGTTGGTTGAGGCGGACGGCGGGG 

TCCGGGCCGGAGTACGTCGTTCCCGCTGCGCTAGGGGAAGCGGGCAGTCAGAAAAATGGG 

TAAGAAGAGTCGAGTAAAAACTCAGAAATCTGGCACTGGTGCTACAGCAACTGTGTCACC 

AAAGGAAATCTTGAACCTGACCAGTGAGCTGCTGCAGAAATGCAGCAGTCCGGCGCCTGG 

CCAGGAAGAGTGGGAAGAGT ATGT GCAGATCCGGACTCTGGTTGAGAAAATACGAAAAAG 

CAAAAAGGTCTTGTCCGTTACTTTTGATGGAAAAAGAGAAGATTACTTTCCTGATCTAAT 

GAAATGGGCCTCTGAAMTGGGGCTTCTGTCGAGGGTTTTTGAAATGGGTTAACTTCAAA 

GAAGAGGGGCTTTTG 

Sequence 1002 

GTCGACCACGCGTCCGACGCACCAAAGGGCAAATACTCGGTAGCGACTCAGAGGGAAAGT 

GGGGTCTCTCCTGGGAGAGCAGGAGGCTGCCAGAAAAGAACTCAGGTCAGGGGTGCATAG 

GCGGCTGAGGAGTGCGGGACGGGCTGAGAGTTGGGGTGCCTCCCCGCCCGCAGGTGGGTC 

CGCAGATCCCGCGGGCCGCATTGGCCCGGCTGCTGCGGGATGCCGAGGGGCTGCAGGAGC 

TGGCACTGGCGCCGTGTCACGAATGGCTGTCAGACGAGGACCTGGTGCCGGTGCTGGCGC 

GGAATCCGCAGCTGCGGGAGTGTTGGCGTTGGGCGGCTGCGGGCAACTGAGTCGCCGGGC 

GCTTGGGGCTTTGGCCGAGGGCTTGCCCACGCCTGCAGCGCCTGT 

Sequence 1003 

' CGCGTCCGCTTTNCCTTCTTGGTTCCACCTCAAACATCCCTTCCGAAGTGAGGCTTTCCC 
TGACTGGGGAGCATAAAGTAGCATCTCTCACATNCCATACACCCCTACAACGAATCTATG 
CAATGGCCCTGCTCTGCCATCGCCACCTGAAACCATCTCAATAAACACATTTTGGATAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 
Sequence 1004 

ACGCGTCCGTTGGCTGCGAGGAGCGCCCGAAAGGTCAGAGGAAGGAGCTGTGGGAAGCTC 

GCAGCAGGTATCGGAGCTTAAGCCAGTGGATTTGGGGGCCCTGGGCTCCCTAGCCGGCTG 

CGGTGTGAGAATGGAGTGGGCAGGAAAGCAGCGGGACTTTCAGGGCTCTGGGTTTGCAGA 

GAGCCGAAATGACCATGACTGCCAACAAGAATTCCAGCATCACCCACGGAGCTGGTGGCA 

CTAAAGCCCCTCGGGGGACTCTGAGCAGGTCTCAGTCAGTCTCTCCACCTCCAGTTCTCT 

CCCCACCAAGGAGTCCCATCTACCCGCTCAGTGATAGTGAAACCTCAGCCTGGAGGTACC 

CCAGCCACTCCAGCTCCCGGGGTGCTCCTTAAGGGACCGGCACCCCCCACT 

Sequence 1005 

NCCACGCGTCCGGCAGCGCTGCGACGGGACCGCGCGATTCCTCTCCCACGCATCTGGCCC 

GCGTTCCTGGGCCTCGGCACCGGATCCCGGCGGGGGTGTGGACCCAGGGCCCACTCTCCC 

CGGCGCGGCCAGGGCCCCCCAGCGTGCGAGCGCCTAGGGGATGCCGAGCTGCTCAAGATG 

AGGAGGTGCGCGGGGCCGGGGCGGAGCAGTCGCAGTTCCCCGCGTGTGAGCCCCCACCCA 

TCCCTGGCGCCAGCGCTTTCCCGACCACTCGGGTTCGGCTATGCGGGAGCCGNGAGGAGG 

AGGCTTGCAGTCGTGGACCTGAGACCTCGGGAGGTCATGCTGTCTTGCTTTTAAGTGGCT 

TNGGGGAAAGTGAAAGAAAACNCCNAAAATTGGAGGACTTTGCTACCAGGGACCTAACGG 
CACCAGTGG 

Sequence 1006 

ACCACGCGTCCGGGAAAAGCCCGGAAGTGCCACGGGACTTCCTGTCTAAGGAAGAGCCTC 

GTGAAGCTCCTCCACTGGGGAGTCAGTGGCCTTCGTTGTATCTGCCCCGCTTGTCCACCT 

CCTAGAGTGAATCCCCGCCTGGAGGCTGGGACACTAACCAAGAAGTGGCACATGGCATAT 

CACGGGAGCAATGTTGCCGCTGTACGGAGAGTGCTGGACCGAGGGGAGCTGGGAGCAGGT 

ACTGCCTCCATCTGANGCCGTCCTTTGAAGGGAGAACCTGGGGTAGGGTTCGAGGAGCCN 

GCGAGAACTGTGCACCTCCTCGGGAGGAGCAGCCCCCTCCTGTGCTGCTTTCCCCCTCCC 

TTCAATATGCTGGGGGCGGAGACCCTGGCCTCCAAAGTGCAATTCCGGGACCCCAAATCC 

CAGCGGACGCACCAGGCTTTAGGTGGGCGTCAAGTTGNTGTGTGCCCCCTGGCTTCTACA 
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CCCCGGGACCCCCTTCCG 
Sequence 1007 

TCGCCACGCGTCCGGAAAMTTTATGCCTTTTTATTCATAACCCAGCTGTGGACCACTGC 

CTGAAAGGT TTGT ACAGATGCATGCCACAGTAGATGTCCACATAATAAAATTCATAGTTA 

CCMTGCAGTTTTGATATATCATTGGATTCTGTCTTTGAGTTGTAGGTTATTTCTTAGCT 

GCATGTTTTAMCTGMTTTGCATAGAGTTGTATGTTAATGTTTCAGTTAAGAGAAAAAC 

TTAAGATACATGAGTCATTACATAATGGGTATGAAATCTTTATAATCACCCTTCCACCCT 

CTATGGTGTCAGTACACATCACGTGTCATAGATACTTAAAATGTAAATGTTAACACTTTT 

CCTTCCTGCTGAGGATGTTTAGAGCCTAGTGCCAGACCCATTCATTTCCTTTTGATT 
Sequence 1008 

GCGTCCGGGCNGCGGAGTTTTGTCCATAACGTGGGCAACCGCGCAGCTGGAGGATGGCCT 

CACTCGGGCCTGCCGCAGCTGGGGAGCAGGCGTCGGGGGCTGAGGCGGAGCCGGGCCCCG 

CGGGGCCGCCGCCGCCGCCCTCACCGTCCTCTCTGGGGCCCCTGCTCCCCCTGCAGCGGG 

AACCTCTCTACAACTGGCAGGCGACCAAGGCGTCGCTGAAGGAGCGCTTCGCCTTCCTCT 

TCAACTCGGACTGCTGCGATGTGCGCTTCGTACTGGGCAAGTTGGCGGNGCCGCCGCCCG 

CTGGGGGCCCGCAGCGCATCCCCGCCCACCGCTTCGTGCTGGCGGCCXaGCAGACGCCGTC 
TTTG 

Sequence 1009 

gcnccccgcgtccgttagagctcaggaagttattaggtgcagcctctggagccatactca 

cgctgcagtgcataatgggaaaattaggagcattaataagaaatttcagtagtgtttgta 

aggaaaataagctacttactgagatctgtttcttctattgcatgtttgcttttgagggac 

agctt ctgtca aaagtgaaatcatcaccagaactgggcctgttaggaagaataqggtttt 

atttactttttatgtcaattaacttcaacaaaaaggccacgctggctgctgtcatgccat 

ctgggtatgcattamcattmtgatgatcagccttgaggttctatttatcttgatttgg 

ctrtataaagtttgtcagaatggtgctggaggnccagaagtgctaaggagaaagaagcta 

tgggccaagttaaagaatttgaatgcaaaggccaggnatgggagttttcataa 

Sequence 1010 

CGCCNCGCGTCCGGTGAGCCCCAGCAAGGAGATCAAGATCGTGTCTGCCTGAGGAAGCAG 

AGCCATGACAATCGGAAATCTACCAGCTCAATGTCCTGCATGTAGACTACCGGACCGTGA 

GCAATCTGATTCTGACGGGCCCACGGACGATTGTCATGGAAGTCATGGAGGAGTTAGAGT 

GCTGAGCTCCTGGGCCTCCCAGCCCTCCAGTGGCCTGTGGGTGAGGGAAGCCAGAATGAC 

ACAAAGCAATGCAAAGACAAGATTGCCATGCAAATGGATGGTTTTGGACATACGAGTCTT 

CTCCGC ACATAC ATGTCTAAAGTTGAGTTTTATACACTGGAATGTGGAAGAACCCGGGTA 

TCATATCTTTTTTAAAAAATGTCCAGTGTAGAAAACATTTGGGAAAC 

Sequence 1011 

ATTTTTCTMCATGGGTTTGMCGCTTATMCCAGTTTTATAAACCCCTTGAACACTGCA 

GTGAG7TATCAAAGCCACTGCCTGCAMGTGGATGATTTAAGATTTTACACGCATGAAAA 

TGAGTGTGCCA TCTCC TGACCAGTGCCTTTTGACTTAGGTACCCAGATGCCACTTGTCAG 

CAGCAGGATACTTTTTACAACACGAAAGCATAATTATTTTAGAAGAAGAGAGTAGAAGGG 

CAGAATAGAATTCAACTTACAGAAGCACCGGAGTAGTGTTGTGGTTGGCTGTTATCTGTC 

CCCCTGGGAGGAGGGACTGT7TTGCTCCCTTGTTTTNGATGTTAAACAGTAGCTTAAAGG 

CTTTCCCCCCCATAC CAACTT ACAGNCAAATGACAAAGAACCGGTGGNGGTTTTCAACAG 

ATTCTACAAACATGCATTTTTCCCTTCCCACTAAATGGG 

Sequence 1012 

GTCACCNCGCGTCCGCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCC 

AGCCCCCCACACTAGGCTGTTCCATGCAAATGAGGAGGAGGAGCCAGAGAAGAAGGAGGT 

ATCGGAGCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGA 

GGCCCTCAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTT 

GGAGGTGGACCTGCTGAAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATC 

AGGCTCCACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCT 

AGGCCTGGCACTCACCCATTCCTTCGGCCCCAGTCTTGCAGACACAGACCTGTCACCCAT 
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TGGATGGCATCAGTACTTTGTGGGTCCAAAGAGGGGAAGTGACCTTCGGGTGGGTGGTGA 
AGG 

Sequence 1013 

CGCGTCCGAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTGC 

AGGGGATGGTATGAAT GCATA TGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATT 

GTCTCAGMGCAMCAGTTTTGCAGGTAAAAAAGNAAAGATNTTAGTGACTTTCTTGGGT 

CTTTATTGAGAAGCTTTCCNGAGNMGCCACTCTCAGAMTTGGCTTCATATGTTCCCTC 

CCGCCCCCGGAGTAAGNAGCCCTCATAGNGGGATTGACATAAAAGTTGAAAAGNTTGNGG 

MCGGAAAGAATTCNTTCTTTCTGGCAGNAATTTTCTTTGAAAAAAACCNGAGGGGCCCG 

CTTTAGAAAMGGGTACCCTTTCAGGAGGGATTTGTTAAAATTCANTCCTACCCCATGGT 

TTAACAAGGGACTCCTTGACNGTTCAGTMGTAGGTTTCTTTGNNAAAAAAGGAAAGGAA 

GCTGCCCACCGTGCCCG TNGG GGATACCCCNAAGACAATTGANGTTGGGTGGCTTGGNTC 

CTTTNCTCAMGGAATGGTTTTCAAANCAAAAAGCCCACCMGATTGCCCNGTCCANGCC 
AAAAAAT 

Sequence 1014 

GTCGCCCCGCGTCCGCGGNCGCGTGGGGTGCTNGTCACCAGACTGCACCCTTGCCAGCAG 

CTTCGCAGCTCTCGAAGTAANTTATCGCANGATGGCCGGCGCCTCACCTAGGAGAACCAG 

GMGGCAGGCCNCGCTAGMCGACGGNATTGMTTTTACTATTGNCAAAACAATCACATT 

CAAATTCATTCCACTTAAAACCTGAAAACATTGGACCACACAA 

Sequence 1015 

AGTCGACCACGCGTCCGGGCGGAGGGAGCGTGACTGCGCTGCGCAGGGCGCTAGGAGGCA 
TTGTCGCCGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACA 
CCTGTGTCTGCATGCTGAAGAAGATGGGTGAGGGCGTGGCCAGAGTAGCAAGGAAGGTCA 
' ACGAGACGGTGGAGAGCGGCTCTGACACTCTGGACCTGGCCGAGTGCAAGCTGGTCTCCT 
TTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAGATCCACCTCATCACCC 
TGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTCC 
GAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCX)CCAGCGAGGGTCAGTGCCCTG 
CAGCACCTCAAGGCCATTGACCTGTCCCGGAAACCAAGTTCCAAGGACTTTCCT 
Sequence 1016 

CGCGTCCGCTTTCAGTGAAGAAAAGGGAATTACACATNGAATCGACACATCAGTAATACC 
GATA CAGTGAM TGGGCCTCTMTMGMTTTNAGCGNGTTTTCTGATGTGCCATTTTTT 
TTGTCI I I 1 1 I AAAMTATACCATANTTATAAAANTGGNAAATANNTTTTTGNACACCAT 
TTAMTTGACCCCCTTA NAGNACN CTTGCCGTNATGNTGAAANGCTAGACCTATNGAAGC 
TGNCTTGANGATATNTGTTTTTTTAAMAMTTTTTTACMCTNTACTTGT^ 
TAATATGCACTATAAAATATGATCNTATATCCTATTATCTATNATCTAAAAACACTTCCT 
T GGACNC ATTTANACGTAAAATTA AAAATG GGTCTTTAANGAAGANTAATGGGGAGGCCC 
CI I I II IMMCCTATGGNNCMTCTTTTTATGNCAMGGGGNGGACCATTTTATTAAAA 

Sequence 1017 

GCGTNCGCTGCGCCCGTGGGACCGGTGAAGTTCTGGCGACCCGGTACAGAGGGGCCAGGT 

GTAAGCATCTCTGAAGAGAGACAAAGTCTGGCTGAAAACTCTGGGACAACGGTTGTTTAC 

MCCCTTATG CTGC CCTTTCCATAGAGCAGCAGAGGCAGAAGCTGCCGGTATTCAAGCTT 

AGGMTCATATTTTATACTTGATAGAAMTTATCAGACAGTGGTGATTGTTGGTGAAACA 

GGATGTGGGAAGAGCACACAGATTCCTCAGTACCTTGCAGAAGCCGGCTGGACAGCTGAA 

GGAAGAGTGGTAGGAGTGACCCAGCCTCGAAGAGTGGCTGCTGTTACACATGATCTTTCT 

TNCCAAAGGTTGCAGGGAGAGTAGCTGAAGAAAGGGGTGCAGTGCTGGGCCACCAGGTGG 
GCTACTGCATCCGCT 

Sequence 1018 

AGTCGCCCCGCGTCCGGTGGGAATCTTTCNACTTCTTGATCCATCTGGGAGAGAAGACGT 

ACCATTATGTGCCCGAATTCCGAAAAGTGTCCATAGCAGCTACCATCATCTATGCCTATG 

CCTGGCTGGTTCCTCTTGCACTCTGGGGTTTCCTCATGTGGAGAAACAGCAAAGTTATGA 
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ACATCGTCTCCTATTCATTTCTGGAGATTGTGTGTGTCTATGGATATTCCCTCTTCATTT 
ATATCCCCACCGCAATACTGTGGATTATCCCCCAGAAAGCTGTTCGTTGGATTCTAGT 
Sequence 1019 

GGAGTCGCCACGCGTCCGGTGGCACGATCTTGGCCCACTGCAAGCTCCGCCTCCCAGGTT 

CACGCT AT TCTCC CGCCTCGGGAGCTGGGACAACAGGTGCCCGCCACCACGCTCGGCTAA 

TTTTTTGTATTTTTAGTACAGACGGAGTTTCACCGTGTTGGCCAGGATGGTCTCGATCTC 

CTGACCTCGTGATGCAC CTGCCTT GACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA 

CTGCGCCCGGCCMTAATTTTTTTAGTTTMGTT^ 

ACAGCAGCTGCCCAGTGCCTCTGTCCACACCCACCTCCCTGGTGTACTTGCCCCCTACAG 
CAGCAGCCCAGATCCTCCCCTGGATTAAAATTGCAACTGGTGCCCTAACCCAAAACTTGA 
GAAAAMTTCTCTATCACATCCACTCTTCTGGCATTTGTAGAATCTC 
Sequence 1020 

GTCCGGTNGATATATATATTTACCTCCTTAGTMTGCMGMGTGTTTGTGGGAAGCAGA 

GAAGCMGCAACT GTAT TTCTTGTTCTCACCTAAGCATTACTGGAGGATAAGCCACATCA 

GTCTACAAAGAGGTTTTCATACAAACATAATAAGATGTAAAATGGACCAAAAGTTGAAAG 

CAC ATTCTT GCMGTMGCACCTGTTACTCTCCAAGCMCCATGGGTTTACCATATTTGG 

GGA TTTTTT GAACACTTAGNCACTTTCTTGCTCCCNAAGGGGACNTTTACAAAAAGTGNA 

NACATTTTTGTANTG TNNCCC GTTATTAAAAAAGCTMCNTTTT 

CAAAAGGGCTTGGTNTTTTTGGACAMTTCAAAMTGGAAAMTGGATTTrCCACCGT^ 

TMGCCTCAATTTTAAGCCCCAAGGTTCCCAAAMTTTTTTAANGAAAAAAGTTATTCGG 

GTATTAGGTNGGNCCTGGGTTAAAAAACCAAGAANAAAACCATTNAAAACAAAGCCCATT 

CAAAAATTCTTNTGGAAAAAAA 

Sequence 1021 

CAGGGAGTCGACCCCGCGTCCGAGCATCTTGGGGMTTTATATTCCTTTGTGAGAAATGT 

TTTGATCATAAGCCTAGAATGATAAGTAAAAGAAATAAAGATAATTCTACTGCTTGTTCT 

CACCCGGTTACAMGCATGAGTTTGMGACAATMGTGCCTTGTCCACATTTTGCGAGAG 

ACMCAGTAAMTACTCCMAATACGTTCTTTTCATGGTCAGTGTCAGCTTGATTTATGT 

CGACATGMGTTCGGTATGGCTGT7TAAGGGAAGATGAGTGCTTTTATGCCCATAGTCTT 

GTGGAACTGAAAGTCTGGAf AATGCAAAATGAAACAGGTATCTCACATGATGCTATTGCT 

CAAGAGTCTAAACGATATTGGCAGAATTTGGAAGCAAATGTACCTGGAGCGCAGGTCTTG 

GTAATCAAATAATGC 

Sequence 1022 

CNCGCGTCCGCAAAGCAAACGACAGCTCAGGGGGCTCCAAAGACCTCATTCATAGCAGCA 

AAAGGAACTCAGGTAGTCAAAATATCAGTACACATGGGACGTGTCAGTTTAAAACAGGAG 

CCCCGGAAGAGTCATAGTCCCAGCAGTGACACATCAAAACTAGCAGCTGAAAGGGACTTG 

AATGTGACCATCAGTCTTAGTACTGATAGACCAAAGCAGCGATCACAGGCAGTAGCAAAC 

GAGAGGGCACACCCTGCCAGCACAGCAGTGNCGAAGTCTGGGGAAGCCATGGCCTTAAAC 

AA AACT AAGACTCAGAGCAAAGAAGTCAATGCAAATAAACACAAAGCCAATACGAGTCTT 

CCTTTTCCTMGTTCACTGTCAATTCAAATCGCTTAAGGAAGCAATCTATTAATGAGACA 

CCTTTGGGAAGTTTGTCAAAGGATGATGGAGCTAGAGGGGCTCATGGGG 

Sequence 1023 ■ 

CNCGCGTCCGGCCAACCGCCGAGGAGCAGTGCCCTATTCAGCACAGTGCGCCAGGGCCAC 

TGGCAGATTGTTGATCTTTTACTCACCCATGGAGCTGATGTCAACATGGCAGACAAGCAG 

GGCCGCACTCCCCTGATGATGGCTGCTTCCGAAGGCCATCTAGGAACCGTGGACTTTCTG 

CTTGCACAAGGTGCCTCCATTGCTCTTATGGACAAAGAAGGATTGACAGCCCTCAGCTGG 

GCTTGTTTGAAGGGCCATCTCTCAGTAGTACGTTCTCTGGTGGATAACGGAGCTGCCACA 

GACCATGCTGACAAGAATGGCCGTACCCCACTGGATCTGGCAGCTTTCTATGGCGATGCT 

GAGGTGGTCCAGTTCCTGGTAGATCATGGGGCCATGATCGAGCACGTTGACTACAGTGGA 

ATGCGCCCTTTGGATAGGGCAGTGGGGTG 

Sequence 1024 

GTCGCCCCGCGTCCGAGAAGTCAGGGAGTGGAGGTTCTATAAGGAATTAACAGCTGAGGA 
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CGGAA GGGTTTGTTTCCCGTTTGAACCTAAACGCAAGTGGAAAAGAATACTCAGAATGTA 

I 1 1 1 ICTACTTTACATCTGCTGGGGAAGGAAATGTGTCAGGAAGCCGCTGCATCTGGTCA 

TTTCATCGCATCAGAATCACAGCAGACGTGGAAGATTCCATGTGGTGGGGAATAAAGAAA 

TAACTTTATGCTCTCCTGAAAAACAGCGGGAGCCTATGTGTGTGTGCGACACTGTAATCT 

CAA GGAGA TTCACTCAGAGCTGTCTCAGTCCAACTCCTGCATGACCAGATCTTCCCTTAG 

CATCTTTTCTGTGATGAAATATTATCTTGTGTTAGAGTTAGGAATAGGAACTAACCTGTA 

GGAGCATGTCCCCAAATGGACATTTGAATGGACTAACAAAAACAACTGGAAAGACTGAAT 

TTCCGACACAMGGAATGATGGGATCAAAAAGAAAGC 

Sequence 1025 

GGAGTCGACCCACGCGTCCGGTCGACAGCCTCCGCCACATCCTCCACCTCTCTTGGTCCA 

GCGAGCGTTGCCGGGCCAGGGTCAAGCGGAGGGCTCCGACGGCGCGGACGGAGCGAAGCG 

CCGAGCCATGGCGCACCAAACGGGCATNCACGCCACGGGAAGAAGCTGAAGGAATTCTTT 

GCCAAGGCACGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTNGAGGACCGAGCAGT 

NTCGTGCCTGGGTGCNCTTCGCAAGGGAGCCAGNTAAGGCNCGCTNGGGGATCAGGGACT 

ATTGAACAGGNGGCCCGTGGCTTGCNCACCTGCNTGGGACCGCCCCAGGCAGGCCCTGCN 

TACCTGGCTCTACCGCGCTTCGACTNACAAGAAATGGCTCAGGGGNCTTTCGAAATGGGG 

CTTCTTTCCTTCGCGCTTGGTTCCGCNCTNGAATAAACCTCCCCCCCGTGGCGGCTTGAA 

AGANTGCCTGTACCGCCGGGNCAATGCNGGGCCCCACAAGTGGAAAAAAGGGAAG 
Sequence 1026 

AGGGAGTCGACCCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGG 

GCCGCCCGACCCCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGGNGATTCTGCTGC 

TTTTCCTGACGGAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATTAACCGC 

GGAGATCTGTTCTCCTGCCCCTAGGACTACGGACCCTGCCGGGCCCTACNTTCTCCGTTT 

ACNTACTACGACAGGGTACACCGCAGTAGCNTGCTCGCCAGTlfTCCTGTTACNGGNGGGC 

CTGCNGAGGGGGCAACCGCCCAACCAATTTTCTTACACCCTGGGGNAGGGCTTGCCGAAC 

GMTGCCTTGCTTGGGAGGGATTAGNAAAAAAAGGTTTCCCCAAAAGTTTGCCCGGCTGG 

CAAGG ATGGAA GTGTTGGGACCGAACCCAGGATGTGAAGGGGGGGTTCCACCAGAAAAAA 

AGGT TAI 1 1 1 1 CTTrTAATCTTAAAGTTTCCANTGGAACATGGTNGAAAAAAAATTTCTT 

TTTTNCGGGTGGGGGTGGTTCACCCGGGAAC 

Sequence 1027 

CGTCCGTAGTCTCTCTCGTGGCCCGGAAAAAAAAAAGAAAGAAGGTTGGGGCCAGTCACC 
CCCACATCCCTTTATGGAGGCTTCCAGATCATGGATCCTGTCACGCGCATCCCGGTGAAG 
AGAGTCCACCAACAGGCTTTGTATGGGGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTG 
GTTCGATATGGCTCACTGCAGCCTCAGCCTCCCTGGGATCAAGTGATCCTNTCACCTCAG 

CCTCCCAAGTGGCAGGGACCGCAGGAAGGCCTGGACGACGGCCCGGACTTCCTCTCAGAA 

GAGGACCGCGGACTTAAAGCAATAAA 

Sequence 1028 

CGCGTCCAACCCCTTCCTCAGCCTGTCGGGAGCAGAGGGCAGTGGCGGTGGCCCCAAAGG 

AGGGACCGCTGACAAAGGAGCCTCAGCCAACCAGGAAAAAGGCTAAATCCACCCTTACCC 

CTCCTGACCCCCCCAAGTGGAGGGAACAGATCCTGGCCTGAGGGGTCCTAGCCTGGAGCA 

GGCGCCTGCGCCCAGACCCTGGAGAGCCTTGACCCAGAGCCTGTGCTGAGGTCCAGGGAG 

TGTGGAGAGCTCCTGGTGTCGAGGACTGAGACTGACAGGGGAGCCCCCTCCATCTGGCCC 

CCTTCCCTTTCCGCACTGTCCGCTTTGTGAGGCTCAGAGGAAGGACAGTCTGCAAGCCCG 

CCTAGGAGGTCCATCCCCAGCAAATGTTTTGGAGGTCCCCCCAGAGAGCAAAGTGGGCCA 

TGGCAAGAAGTAGGGGGTTGGTTGGACCTGTCACATGAAATGGATCAACACTTGAATGGG 
GA 

Sequence 1029 

CGTCCGGAGAAGATGGGCCTCCCGGGCTCAGACTCACAGAAAGAGCTGGCCTGACCACCA 
GGCACCTCACTGGCACTGCTGACCCATCCCAGAAACACAATCTCAGGGACCCGAGCAGCT 
CCAAGGACGAGAGGATACAGCAGACACAACCTAATAGAGAGGGCGCCTGCAGCCTTAACC 
TCCACGGCCTTCGATACTTATGCAAGCCTGGTGTTGCTCCTGTCCTCAGAGTCATCCTGC 
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GCTCATGCCTTTTCCCGAATGGGTTCACCTCTGGCAGTTGCCGCTTCAGTCTTGGCCTTA 
GCCTCATCTTGAAGTGGGTAGCTGGCGGGAGAGGGTGGGCTGCGCCCCCTGCTGGCCCTG 
AGGCTGCAGAGTTGGGAGCAGGACACCTCACCTGAGTTTCAI 1 1 1 I I I IC ATGTCCAAAC 
CATGCACATACTATAGTCCAGAATCAAAGCACTTTTGAAAA 
Sequence 1030 

CCCCCCCGCTTTTTTNNANNCNCGGGGGGTGNGGCAAAACTTTTTCCTAGAATTNGCGCC 

ATGTTGGAAACNNTNTCNCAGCANCCCGGCTGCACAGGGNGTAGCCCANCCTCCAGGTCC 

ACGGAGTGGTGTGGACCTCCCACCTCACAGCTGCCTCTGGCAGCCAAGCCTCTTTTCGCC 

CGGCCCCAGCCCCTCTGGTTGATAAACGGGTGGGCCTCCTCAGCAGCGTGGCTGCCTTTC 

ACCTTGATTTCCCCAGGGCTCTCGGCAACATCGATAAACCAAGCCTCGCCCACCAGCTGG 

GCCCTCCXJCCACCCAGTCTGCCAGGCTGGGAGCTGGAGCTTGCTGAGTCTTGAATGCCCT 

TCT AG ATGGCTTCTCTAGAGGCTCTCCTGGCAAGAGAGGGTCCCAAGGGGAGCCCTGCAA 

AGCAAAGGCTCCTTGTCTGGGGCGGGATAGAGAATCTCGCCTCTGTCTGGTGTGTTACCT 

ACTGGGGGCACAGGAACAATTTCCTCAAGGAGACAGTGGCATGGAGCTTTGAAAGACGAA 

GTANGTGTTAGCAAGGAAATAAGGAGGAACGGGGGTTACGGGCAGAGGAGAAAGCACATG 

CCAAGTCAGCAAAGAAAAGTAGAATTCGAAAAACTTTTTTA 

Sequence 1031 

GGCCAGAGCTACTACGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGC 

CGCAGCCCGGGAGCGGAGCAGCCGGTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGC 

CGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTT 

CCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCA 

GGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTT 

TGTGGACTATGCTTCCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCG 

AGATTATATTCAGTCCA CTATG GAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGC 

CAAGACAATTGGGCAGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCT 

CACCACCTCCAACATGCTNCTGAAACCCCCCTGGAACAAGCTGAACATTGTGCTTATAGA 
CT 

Sequence 1032 

TCG CCCCGC GTCCGCMTTTCTTTTGMTTTCGATCACTTCTACATTCCCAGCTTGCCAC 

ACTCi I 1 1 i iGATGAAGTTGTGAAGCAGATGGTAGCTGCCTTTGAAAGAAGAGCATGTAA 

GCTGTATGGTCCAGAAACAAATATACCTCGGGAGTTAATGCTTCATGAAGTCCATCACAC 

ATAMGGCAAAAMGMCTGGTGCCACCTGCTTCTGACTTTAGTTTGTTCACTTTTAGGA 

AGTATTTTCATGACATGTTTTCAGAAGCCAGAAAGCATTTGTTAAACGCAGCTTTGGTTA 

TAAACCTGCACCATTGAAAATTTGCACATAGAATATAGACTCACTTGTACATAGAATTAT 

TTCTTCAAGTATAATTCAAAATAATATGGACATTATCATGTTCTGCATTACAATAATGGG 

ATGTCATCACCATTGCTAGAATACTGGCATGATTCTTCTGAGCAGAAGTTGAAACTGTAA 

ATTTAAACCTTTTAATTATCACCTTACCT 

Sequence 1033 

NCGCGTCCGGCCTTTTGTTCAGCTTGCCGGGCAACCGGCCCCTGCTGGGGACTACAAGTC 
CCGTAAGCCTCCGCGGCGGCACGTCCTACCCTACACTGTCCAGCCGGCTCCCTTTTTCCC 
CCTCCCCGGGGGCCAAGGGCTCCGGCTGCTGCCTGGCGGCCAACGGGCCAGGTAGGATTT 
CCGGGAGAGGCTGCTGTGGAGGCTGAGGAGGCGGCGGCGGAGATCTGGAATGAATTTTCT 
ATCTGTGAAATTGTGAAGACGAAAAAAGAAAATTGTGCTAGTTTTGCTGTTGTACTCCAC 
ATTGCAGAAGTTACAATCACAGTGCACAAAACAGTATCTCACCTCCCTAAACTGGTTAAT 
AGTGGCATGGAAGATCCATTTGAGGAAGCAGACCAGCCCACTACAGAGCCAGGCATGGTC 
CTGGACAGTGTGGAAGCAGGAGACACAACACCTTCTACCAAAAGGAAGAGCAAGTTCTNA 
GGCTTTGGCAAGATCTTTAAGCCCTGGAAATGGAGGAAAAAAAAAAGTAGTGATAAATTT 



Sequence 1034 

CGTTTTNNTTCTTCCCTTGGATNAATATTTTCTACTGAAAAGAAATGAAATCTCAGTTCC 
ACTATGATAAAAGAAACGAATCACCAGTGTCTGTCAGTGCCCCCACTGCTCCTCCTCATA 



A 
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CCAGCATTGACACTGGTAGCTGAGGAAGGGGACTGCTCAATTTCTAATGTGATCTATTCA 

AGAAGCCACATATAAAAAGGCATTGAGGGCTCACTGTCCAGAGAATTGCTTTTGTAAATG 

CTCCACAACGTATGGGCAAAAGTTTAAGGACTACTGCCTGATGTACCAGGAATCCTGAGT 

TCTGTGAAAATCTGTTCATTCCAAATCTGTTAAGCATTTCCAAACATCCAGAGAATGGTT 

TCACTGTTGAGGGTGCAT GTGG CAGAATCTGTCTCTCTCTGCACCTGTCTTCTGTTACCA 

TCCCTGGACAGTGACAGATTTTMGCCAGCCACCAGMTTCTTTTCGAGTTAACCATTTC 

TTAGTTTCTGTAAA 

Sequence 1035 

GTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCCTCCCTCCCGGGGGGTG 

CACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCXJCAAACCCGCTCAGGTC 

CTGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCC 

GGCAGCACACAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCCGCCGTCG 

Sequence 1036 

GTC CGCC CCGCGTCCGG GAAAA TGCCGCAGTTTGTTGCCTTGAAACCTAAGAGCAATCCT 

TGGTTTTGTTGCTACATTATTTTTCCAGACCMCACATCTACCMGTNGMTTTTATNNA 

CTTTMTTTCATAATAAAGTTAGTAGAGTCACTCAACTTACAAACTTTATTTATGNTGGC 

TTT GGGC AAAAAAATCACTTATMGGCAGCTCTAMTTTGCCTTGATAAGCTAAATAAAT 

TACTTTTATAACTTACTAAAGC AGAAC AAACAGTGAAACTTTCTAAAATATTCTATNCTG 

GAAATAGN GGACAGG GG GATCTT TTATTTATAATNCTCATCAAGATGAGTGAGGTTGTTC 

ACCAGGATATTTTTTATGG 1 1 I 1 1 1 1 1 IMTTTTTCTCCAAAGNAATTATTTTTAATAGG 

AATTCCCAAMGNAATCAAGAAATTAGTTTTTCAAAAATAAATTTTTTCCAGNTNGATAA 

AMGGAAGGTNGTTTGTAAA ATTAM TCCATTATTTTACCACCTTAAAAAATTTGGGGGG 

MTACCATTCTAMGGGAACCTTTTMTTCTTTACCTAATTCANGNTAGGGGNGTCTTGG 

CMTTTNGAATANTTTTCNTTTT 

Sequence 1037 

GCGTCCGAAAATATTTAGGTAATGTCATAAAATTTATTTTACCTTTCTCATTTTCTGAGA 

AMTAMTGAAAAAMCCCTAGATATTGCTTTATTACCAACAGTGTGTAGGTTTTTGTAC 

ATATGGAAATTTGACACAMAAAATAGGGAATTTGTATAGAGAAGTTTCCCTCTTATAAA 

AGGACTCCCATTTGATTGTTCGAAACTATAAMTGCACTTTTACTTTACCATATCTTGAA 

ATGACAAAATATCGCCCTTTGGAA AACC TGACTCTTTGCACCGTTGTAAATTCCCAGAGT 

CTACCTCAGTTAACCAGGCCTTAGTTTTAGGCAAGGAATGAATTGAATTAAAATTTCAGT 

TCMTCATTTTATGCCAGMTTTTGTTTTTCTTTTTMGGCACCATCCTTCCCTCCTTGG 

CTGGNTGGCCCTTCCCTCCCCATTTMCTTTTCTTTTTTTMTNCTTTGAAAAATTGGGT 

TTAAAAATATTTCCMTCCTTTTCCTTTTCCTCTAGCCAMGTNGGTTTTGTNATTTNCN 

AAATAAAAGGGCCCTCTGGTGGAAAANGGGNCTGGAAATTAACCTT 

Sequence 1038 

CGCCNCGCGTCCGCAAGACTTTGAAAAATNNGATCATGGTTCTTCTCAAAATACCAGCAT 

GTCTAGCATCTATCAGMTTGTGCAATGGAGGTTTTGATGTCCAGTTGTTCACAGTGTAG 

AGCTTGTGGAGCTTTAGTTTATGATGAAGAAATTATGGCTGGATGGACAGCAGATGACTC 

AAATTTGAATACAGCTTGTCCATTCTGTAAAAGCAACTTCTTGCCTCTTCTCAATATAGA 

ATTCMAGATTTGAGAGGTTCTGCAAGCTTTTTCCTGAAACCAAGTACCTCTGGTGACAG 

TTTACAAAGTGGAAGCATTCCATTGGCAAATGAATCCTTGGAGCACAAACCTGTATCCAG 

TTTAGCAGAACCTGACTTGATCAACTTTATGGACTTCCCAAAACATAACCAGGATCATAA 

CTGAAGAAACAGGCTCTTGCAGTTGACCCAAGTGATGAAATAAAGAGAGCXJAGTGGGAGA 

TGTCCAAACTATTGAAAATTTCATCTGTGNCTTAATAGGTTTATC 

Sequence 1039 

TGANATTAGCATCACTTCGTCTACTAAGAATCTTAATAGATGTAAAAATATCTTTTAAAA 

CATATGGTAGGATGGGTAAAATTTGGCAATACTATCCAGGAAGTCACTAAGTACAGATGA 

ACTGATTTAGTCCTAATTCCAAGAAGTGTGATTCCACCTACTTGACTAGAAATTTATACC 
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TGGTAATMCTCCTTGTCCTTGAAGATTTTCAACTMGGAAMCTGTTTTTCAGCAGGAC 

CTGATTATGC ACTGCTA TCTAGGTAGGGTCACTTATGGTTTTATAATATATTTAATTGGA 

TTATAATATTCC 1 1 1 I 1 1 CTTGTCTCTTGGACAAAATCCTAGCTTTACTGTAATTTAAAA 

AGATGAGTTTAAMTTTCAGGCTTTAAAAACATACCAAACATTGATAAAAATGAAATCTA 

GATAAAAGTATTTTATCAATGTTCAGTTGCCTGGATTCMTAACTGTATTATGGTTAfGT 

AAGGATAATATCTTAGGAAATCACATTATGGTATTAA 

Sequence 1040 

GTCGACCCCGCGTCCGGTAGCTTAGTTGAGTAGATMTCTTTTGTTGTTTCCTCCTTGTA 

ATATACAAGCCTTGGCTTCTGTGACATCATACTCTCCTAGATTTCCCCCTGTCACTGTGG 

CTTCTTCTCAGTCTCTGTCCATCCCTGGTGCTCCTGMGGTTCTGTTCTCAGCCTTACAC 

ACATTACCTGGGTGATCTCATTCTCTGCCATGACTTCACTTGCCATATATGTGCTGATTT 

TCCCCAMTTCCTATTTCTCCCGACCTTTACATCTATTTTATTTGCAGGTCATATATCTA 

ATAAGGAATTGATATCCAGTGTACATGTAGAACTCCTGTAATTCAATACAGAAACCAAAC 

AGTCCAATTMTAAATGGAGAAGAGATTTGAATGAACATTTTTCTAAAGAACATCTCAAG 

CTCMGATTTCCCAGATMCTTTTCTTTCTTCAAAATCTGCTTCTGTGTTTCCTCATCTG 

TAGGTGGCACAGCATACATCTGATTTCCCAAGCCAGAAACCTCATAGTTATTCTrGACTC 

CAGGAAGAAATATTATTGAGTTTTTAAAAACTC 

Sequence 1041 

CGACCCCGCGTCCGTGCTGMCTGAGCTCAGGTGTGTTTTTCTTCCAAGCTTTCTAGCAA 

GGTTTCTACTTAAAATCACCTGTGTGCAAGCCCAAAGGACATTTCATCTATTCTAAGCAG 

AAAGGCTGTTTTGTTCATTACAGTGAGTGCTGTTCATCTCATGGAGTGGGAGGAGCACTA 

AACCAGGAGACAGAGGACATGGATTTGGTTTCCAGCTTAACCAGTTAGGACTCTGTCCTC 

TGCATTCTGGAACCATGAT GCCTG CCTGCCTGCCTCACAGGGCTGTTGTGAGGACCAGAT 

GAGATGATGTATGTTCATACTTTTGGAATCTCTAATTTAMGTCTTMTATTTTGTCTTC 

TGAGTGTGAGGGGATAAACCTGGATGTAGACTATTAAGCAGCATAGGAGAAAAGAACAAT 

AGAATCTAATGGACTGGGTTTGCAATCTCTCTCTAAATGCACTGCTTCAGACAAAGTGAA 

ATCCAAAGGTGTGAAAAAGTATAGCTGCAAATTGGAAAAATGTGTTTCAAGAGT 

Sequence 1042 

AGTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCTCCCTCCCGGGGGGTGC 

ACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTCC 

TGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCCG 

GCAGCACACAAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCGCCGTCGCGAGCGT 

CTCTGGGACCGGAAGTGCGGGCGAGCGCGGNTCCCCGGGTCTGACAGGAGCAAGCTGTGG 

GCACCGNGGCGGTAGTTGGAGGCGGNAGAGGGTNCGTAGCCGNGCCGNCTGCCCGNCATG 
GGCCTNC 

Sequence 1043 

AGTCGACCCCGCGTCCGCAGGGGCGTGTTGGCCCCGCACAGATTGAGCCGAGTTGTCGCC 

CCGGTGGGAGAAGTGACCCTCCTGCGCCTGAAAAGAAATTTCATTTCAATATAGGTGACT 

ATGCAGCCTGCAATTCAAGTATGGTTTGGAGAAGATCTGCCTCTAAGTCCTCGGAGTCCT 

CTGACTCCCAGACACGGACCAGGATTGGCTAATGTTTGTCAGTACGATGAGTGGATAGCT 

GTGAGGCATGAAGCCACTTTGTTGCCCATGCAAGAAGATCTGTCAATCTGGTTATCTGGT 

TTATTAGGTATTAAAGTTAAGGCAGAAAAATTATTGGMGAACTTGATAATGGAGTACTA 

TTATG TCAAC TGATTGATGTTCTTCAAAACATGGTGAAAACATGCAACTCTGAAGAATCA 

GGGMTTTTCCMTGAGAAMGTGCCCTGTAAGAAAGATGCTGCATCAGGTTCATTCTTT 

GCTCGGGACAATACCCGCAAACTTCCTTCACTGGTGTAGGGACATTGGGGTTGATGAAAC 
TTA 

Sequence 1044 

ACGCGTCCGCCCACGCGTCCGGAGTCCTCCTCCCCGGGTGCCTGCCCGCAGCCCX3CTCGG 
CCCAGAGGGTGGGCGCGGGGCTGCCTCTACCGGCTGGCGGCTGTAACTCAGCGACCTTGG 
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CCCGAAGGCTCTAGCAAGGACCCACCGACCCCAGCCGCGGCGGCCG6ACTTTGCCCGGTG 

TGTGGGGCGGAGCGGACTGCGTGTCCGCGGACGGGCAGCGAAGATGTTAGCCTTCGCTGC 

AGGACCGTGGTGMGCCTCTGGGCTTCCTGAAGCCCTTCTCCTTGATGAAGGCTTCCAGC 

CCGCTTCMGGCACACCAGGATGCACTTGCCACGGN7TGCCGTGCCCCCTNTTCAGCAGT 
CCCT 

Sequence 1045 

GTCCGCAGAATTGACNAATTCAGGAGGTGTAAAAATAAACAGTGTTCTCTTCTCTACCCC 

AAAGCCACTACTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAAGGCATGC 

ATTAGCCACTACACAAGTCATTAGTGAAAGTGGTCTTTTATGCCTGCCAGCAGACAGACA 

TCAAGGATGAGTTAACCAGGAGACTACTCCTGTNACTGTGGAGCTCTGGAAGGCTTGGTG 

GGAGTGAATTTGCCCACACCTTACAATTGTGGCAGGATCCAGAAGAGCCTGTCTTTTTAT 

ATCCATTCCTTGATGTCATTGGCCTTTNCCACCGATTTCATTACGGTGCCACGCANTCAT 
G 

Sequence 1046 

ACCACGCGTCCGCCCACGCGTCCGGGCGGGGGCATGGACTACTGACCCATGCGGGGCAGC 

GTCCCTGTGACCTGGCCGATGAGGAAGTACTGAGCCTGTTGGAGGAACTGGCCCGGAAAC 

AGGAGGACCTTCGGAACCAAAAAGAAGCTTCCCAGAGCCGGGGCCAGAGCCCCAAGCGCC 

CTCTAGCAGCAAACACAGAAGGAGCTCTGTGTGTCGTCTGAGCAGTCGCGAGAAGATTTC 

CCTCCAGGACTTGTCCAAGGAGCGCCGGCCTGGTGGGGCTGGGGGGCCCCCCATCCAGGA 

CGAGGATGAGGGGGAAGAAGGTCCCACCGAACCACCCCCTGCAGAACCCAGAACCCTCAA 

TGGCGTCTCCTCCCCGCCGCACCCCAGCCCTAAGAGTCCCGTGCAGCTTGAAGAGGCCCC 

CTTCTCCAGGCGCTTTGGCCTCCTGAAGACAGGGAGTTCTGGTGCCCTGGGTCCCCCTGA 

AAGGCGGACAGCGGAGGGAGCCCCTGGGGCTGGGCTTGCAACGCTCGGCTTTCTTCTCCT 

GGCTGGAAGGGACCTTCACTTANGCCAAGGAGCTTCGTNTTGGCAGAATTACCCCGACCC 

CCTTCCCGAAGCTTGCCGGAGCCCTTNTGTCCTTGTCTGAGGGTCACCAAGCCTTCTTCC 

TTTGCTTGGGAGMCTTCCTTGNCTTNCCTTCAGGAATTCCNGAGCCTGGATTCCCAGCG 

AANCCNAACGTTCCCACAGGCTTTCACGGGGC 

Sequence 1047 

TGTCGACCCACGCGTCCGCTCCCCGCCGAGGCCTCCTGCACCACCTAGAGCCCCACCCCC 

GACCCCACCCCGGGAGGGCAGAGCCAGAAGAAGGCTCATTAGACCTGGGGGACCCAAAGG 

GTCTGGCCTCTTTGGGCAGCCCCAGAGATGAGGGGTCAGCAGAGGAGAGCTCTGGGGTTG 

GGGATGGGTTAGGGACGCAAGCTTGAGTTCTAGCCCTTGCTCTCATTCAGCTGTTGTGTG 

ACCCTGGGTAAGACCCTTCCTTGTTTGACCCTCAGCTTTCCCATCTGTTTAATGGTGGCT 

TTGGCCAAGGCAATCCACAAACGTCAAAATTCCCCTTCCCATCAGTACACACACCGATGC 

ACACACACTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACAC 

ACACACACACACACTAGTTAGTGCCTTGGATGAGGCGGGGCAGTGTGTATATGGACCCCT 

GGACTTGCTACCTTCAGGGTTCCATACTCGTCCCTCCCCTCCTGGCTCTGCTGTCTGGAG 

TCTGGCAAGCGGG 

Sequence 1048 

CGCGTCCGCCACGCGTCCGCCGNCCGCCGCCTGCCTGGGCCGGGCCCGAGGATGCGGCGC 

AGCGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCTAACTTCCCCCGGTACGT 

CCCCGTTCGCCCGCCGGGCCGCCCCGTCTCQCCGCGCCCTACGGTCGGGTCCTCCAGGAG 

CGCCAGGCGCTGNCGCCGTGTGCCNTCCGCCGNTCGCCCGCGCGCCCGCGCTNCCCGCCT 

GCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCATGCTGCCCCTCTGCCTCG 

TGGCCGACCTGCTGNTGGCCGCCGGGCCCGGGCCGAGCCTGGG 

Sequence 1049 

NCCACGCGTCCGCA AGGCG CTACGTTTATTGCCTCGTCTTATTCACTGACGTTTGTAATG 
ATACACAGTGMTTCTTTTTGACAAAGAGAAATGCAGTGTAGTATGCAGAGCTGCTGTTT 
TAATGC CTATGCAT TTACTCTTTCCTGATTTAGGCAGAGGTGGCATTTTCTTTATTGCAT 
TTCTCTA'I 1 1 1 1 I IAAT GTAC CCTACCTTCAGTATTCTCTTTGTAAGTTGGTGACTTGCA 
TCTGTGGCCTTGMTATTTTATTATCACATGTGGCATAACAGTATCCACACTTTTTAGTT 
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CTTTAI I I I I I I I I I I I I ATTTTGAGCAATTCTCCTGCCTCAGCCTCCCAAATAGCTGGG 

ATTACAGGGTGCATGCCACCACACCCAGCTAATT 

Sequence 1050 

CGCCCCGCGTCCGGGATGGACMGACAAATCTCTTGTAAAAAAMTTACAAGTMTTTTT 

ATAGAAGCTCTGCCCTGAGGGAGGGGGAGCGTGACTTCTCACTCCTTCAGTGTGGGCTGC 

ACAAGTGACTTCCTTCCATATGGGATCGTTATAACAAAAGACTGTAACAAGGGCTATGGG 

AGTTATAAGACAGGAATTGTGGACAAAAACCAGTGTATATCATAACATCACACCTTGTAA 

TGTTGGCAGTACAGTCACTGACCTTTGATAAATGTTGATGACATGTTGAGTAAAGGAATG 

AGAGAAAGAGGATTGTTrATCTCTGTTTTTATCCTTCTCAGAGAAACTTAGAGTAACAAG 

GTGTGTTATCAGCCATGCTGATGCCTTTGGTAMCTATTGTGTGANATNGGGTGGTTTGA 

ATTGGTCAAGTAGAAACTGGGGCTGCCAGGCGCAGCCGGTAAGCATTTNCATGTGAGCCT 

TAGGGAANMGTGCATTTTGGTAGGAGCX^ATCAAAAATAGCTTCTTGATATTTCATTAAA 
AG 

Sequence 1051 

GACCACGCGTCCGGGGCTCCTGGGTGTGCCCGCGGCCTCTGGCGCGCAGCGACTCAGAGA 

ACGTCTACGAGGTCATCCAGGACTTGCACGTCCCGCCGCGGGAGGAGAGCGCAGAGCAGG 

TGGACGACCCACCGGAGCCCGTGTACGCGAACATAGAGAGGCAGCCCCGGGCCACTTCAC 

CGGGCGCCGCTGCAGCCCCCCTTCCCAGCCCGGTGTGGGAGACGCACACGGACGCGGGCA 

CCGGGCGCCCCTACTACTACAACCCAGACACGGGAGTTACCACCTGGGAGTCGCCCTTTG 

AGGCTGCCGAGGGTGCCGTCAGCCCAGCCACCTCCCCTGCCTCGGTGGACAGCCACGTGA 

GCCTTGAGACCGAGTGGGGCCAGTACTGGGATGAGGAGAGCCGCAGGGTGTTCTTCTACA 

ACCCGCTGACGGGCGAGACGGCCTGGGAGGACX5AGGCCGAGAACGAGCCCGAGGAGGAGT 

TGGAGATGCAGCCGGGCCTGAAGCCCTGGCAGCCCAGGGGGACCCGNGGNCCC 

Sequence 1052 

CGGCTTTGCCGCMCATGCTCAATTCCCGATCATCGCTCAMGTGCTAMTTTTCAGGAG 

TGAAAAGAAAAAGAGGAAGGAAGAAACCCCTCTCAGGCAATCATGTACAGCCACCCGAAA 

CAATGAAATGTAATAC ATTCA TAAGACAAGTGAAAGAAGAGCATGGCAGACACACAGATG 

CMCTGTGAAAGTTCCTTTTCTTAAGAAATGCAAGGAAGCAGGACTTCTTAATTACTTAC 

TTGAAGAAATATTAGACAAAGTTCATTCAATTCCAGAAAAACTCATGGATGAGACTACTT 

CAGAATCAG ACTAT GMGAMTCGGGAGTGCACTTTTTGACTGTAGATTGTTCGAAGACA 

CATTTGTAMTTTTCAAGCAGCAATAGAGAAAAAAATTCATGCATCTCAACAAAGGTGGC 

AGCAGTTGMGGMGAGATTGAGCTACTTCAGGACTTAAMCAAACCTTGTGCTCTTTTC 

AAGAAAATAGGAGATCTTATGTCAAGTTCTACATCAATATCATCCCTGTCTTATTAGGGA 

TTACCGTTTCCTAAGCCAAGAGTCATGTCAAATTGCAATCAGGC 

Sequence 1053 

GACCCCGCGTCCGGGAGGTTGAACGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAG 

AAGAAGCCAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCA 

GTCACTGACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATC 

ATGGTAGATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGC 

TGTCAGGACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACA 

AAACTTCGGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGC 

CTCAGCTCTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAG 

CAGCCCTCTAAGGAGGAGGAGGAAGAGGAGGAGGAGGAACAACTGAACCAGACCTTGGCA 

GAAATGAAGGCCCAGGANGTGGCGGAATTGAAGAGGA 

Sequence 1054 

GTCGACCCCGCGTCCGCAAGGACCATGTTGTACCACAGCCTCTGCTGAGCTGAGGGACAC 

ATGTCCTTGGTGMGACCTGCACCCCTGGAACCTCCCACCATCATCACMCTGNAGTCTC 

ATTTGCAGTGGAGAAMGAACCCGACGTCCCACAGCCAGATATACACCCAGCTCCATGCC 

AGCCCTTCATGTTTACCTTTTGCTTTGTTAATTACATGTCAGACTCCTAGAGGGCCTCCA 

GACTAATAGGAAGCATTTCTGTAACCAACCTGCCACCCACTGATTCAGAAATGGAAATCA 

CATTCCACAATCTATGGCTTCCACCAGCTAGCCCAGGAAATACTTGAAATCAGCATTCCT 
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NAAAAAAAAAAAAAAAAAAAANAAAAANNNANAAAAANNAANAAANAAAANNAAAAANGG 

GGNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGTNCAAAT 

AAAGCAATAGCNTCCAAATTTCCAAATAAAGCA I 1 1 I I I I I C ACTGNATTCTAGTTGGGG 

GGTTGG 

Sequence 1055 

CCACGCGTCCGCTGTTCCCGACAGCATGGATTAGCTTCCGTGTTCTGAAGTTGTTCTTTT 

CATGGTGTCTGACACCGAGGGCCGTTGTTCGTCCATCAGGCGGGATTGGATGGAGTCTTG 

GTGTTTTGCCTTCTCAGGGACCAAAAATGTATCATTGACTCCTTAACAGTGACCTTCCTC 

CCMGGACATATCCGTGTTCATTTTTCATAGGTTTTACTCATATTCATAGGTAGATTCTG 

TTAATGTGAGTTGGAAA GAAAAG ACCAATTTGTACACCAGTCACACCACAAGACAGTTTA 

TCATATAAMTACCTCAATTTTTTGTATTCCTCATTTCCACCTCACAATTGTACTGGTGA 

TGAATTTTAAGGGTCTG 

Sequence 1056 

TCGCCNCGCGTCCGGGCAAGGAAGGCTCTGTTAATTTATAGCTGTTTAGAGGGGAAAGCA 

GTGCAGACCACTTATTAAGCCCGCTGAGGACTAGCTTTCTGTCTTTCATACATTTGGGAA 

AGATAGGAATGAC TGTTT CAAAGAAGAGAGGTGCACATAATTTATGCAGGCAAGTATGAT 

ACTTATTTCATTGGTTTTGTGAGTCATATATATACATATATATATATACACATATATATA 

TGACTTGAAAATTAAGATTTMTACTTTAATGTTTTAAGTGTGGGGGGTTTGGGAAGGAA 

GGAATGTMTATNTGGGATTTAGCCTTAGGCTTTAAGTTTTAGGCTGGCAAAAGAAATGT 

TATTCAGTGGGTTTGAGGTTTGGACTACTTTCTTCAAACTTAGAGAATTATAACAGGATG 

GTGTTTACCTTTGTTCACCTGGGATGTCCCCAAGACTCTAGCTTCTTCTATCAAGTGGTT 

GGGTCTGATAGAAGAGGTAAAAATTGCTCTTGAAAATNNTCTACAATTATGCAGGTTCTT 

TGATAAAATTTTCTGGTTAAATCC 

Sequence 1057 

CCNCGCGTCCGCCNCGCGTCCGCCCAGGCCTGGGCCGGGCCGTGACGGCGGCGCTAGGAC 

CCGGCGGGCCGCGGGTGCGGCGAGGCCTGGGCGGCCTGAGGAGCGCGGACCCCGGCGCTC 

GGCTCCCGGCGCCATGTGAGGGGGCTCGGGGGCCGCGGGGGGCCGGGCGCTCCCCGCCGG 

AGGTGTGAACCCACATCCCTGCCCCCAGGGCCACCTGCAGGACGCCGACACCTACCCCTC 

AGCAGACGCCGGAGAGAAATGAGTAGCAACAAAGAGCAGCGGTCAGCANTGTTCGTGATC 

CTNTTTGCCCTCATCACCATCCTCATCCTNTACAGCTCCAACAGTGCCAATGAGGTNTTN 

CATTACGGNTNCCTGCGGGGCCGTANCCGCCGNCCTGTCAACCTCAAGAAGTGGAGCAT 
Sequence 1058 

TGGGGCCGGGAGGAAGTCTAACCTTTGGGAGACTCCAAGACCGCAGCTCCGAGGTCGGCG 

GGGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAG 

GAGGACGAGGAGAGGGACTAGGTTGGGCCCGAGGGGGCGCTTGGCAAAGAAGCCCCTTCC 

AGCTGACCGCCGAGGACNGTGTATNACATCTNCTACCTG 

Sequence 1059 

TGGCGATCGCTGAGAGGCACGGAGGGCCGAGGCGGNCCTGGGAGGCGGCCCGGAGGTGGG 

GCGCCGCTGGGGCCGGCCCGCACGGGCTTCATNTGAGGGCGCACGGNCCGCGACCGAGCG 

TGCGGACTGGCCTCCCAAGCGTGGGGCNGACAAGCTGNCGGAGCTGCAATGGGCCGCGGC 

TGGGGATTCTTGNTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAG 

GAGCAGCCCCCGGAGACAGNGGCACANAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGAT 

GATTGTACCTGTGATGTTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTTCCCAA 

GACTACAA AAACT TCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGC 

CCGTGTCCTTTTTGGAATGACATCAGCCAGTGTGGAAGA 

Sequence 1060 

CCACGCGTCCGGGGTGTTGGTGCGGCGCTGTTGGGGTCCTCCGCTGGCTCATGGCGCCAG 
GCGTGGGAGGCCGAGTCCCCAGTGGCGAGCACTGGCCCGACTCGGCTGGGAGGACTGCCG 
GGACTCCAGAGTCCGCGAGGCGCTGCGGGCGCTGCACGCCGCCAGGGAAAACAAAGAAGA 
AGAGTTAATCGACAAACTGGAGGTGGTCACAATGCCTTCCCCATCACCAAAAGGACTGCC 
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AGTGAAGCAATATGCTGTGCAGTCTCAGCTTCCCGTATATGAGTGGCCGGATGTGGGATC 

TGGAGAATATGATGTTGGAGTAGTGGCTTCGTTTGGCCGACTTTTGAATGAGGCTCTTAT 

TCTTAAATTTCCCTATGGCATATTGAATGTTTCATCCCAGTTGCCTCCCGAGATGGCGTG 

GCCCAGCCCCTGTAATCCATACAGNTGCTTCACGGAGACACAGTTACNTGGAGTAACAAT 
TTATGCAAATTAGACCTAA 

Sequence 1061 

GCCGGTTCTTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCCCATGTTCCTGTTGCT 

GCCTTTTGATACGCCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCACCC 

TCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCAAAC 

TCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAGGAG 

CCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGGATC 

CCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGGTAGTAGTAAGGCTCTGG 

ACMCACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGGAGA 

CCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTGGAGTCCTGGAACC 
TGCTGAGCAGAACCAATT 

Sequence 1062 

CCNCGCGTCCGCTTTGAATNCTTATCTTTGATTTMTTTACACGCCAGCATTTTGCCACG 

TTCTAAATAATATTTAGCTCAACTGATTCATACGTATTAATGACCATTCTAGCAAAGGCC 

TACAAGTGGTGTGGGAATCAGGGAAAGGCTGCCTCTTTGGTATCTCAACTGGTATTGATT 

ATTGCTATCAACTATTTGGGGAGAAAAAATCAAAATGAAGCCCTGTCAAATTTTAGAAGT 

ACTATCTTTGGTCCTTCAAACACTTTGTGATGACACCTTAAGAAAAATAAAGTTGAAGTT 

CAGGTCTTGCCATTGCCATTACAGACAAATTAGGAGACTTGGTTTACCTGGGAACAAATT 

TACTTGAATATTCAGTACCTGAAACTATGCCAAACCAAAGAGCAGCTGCAGTACATTCGT 

TATT TTAMTGMCAAGGTTTACAAAGNTTATTTTCATCTATACCGTAAGGNTGGATTTT 
TTTTNAA 

Sequence 1063 

GTCACCACGCGTCCGCCNCGCGTCCGGCGTGCATGGAGGAACGCTGGGCACGGGCCCGGC 

GCGGGTCGGGGGGCGCCCGAGGGGCCCGGGCCGAGCGGCGGCGCGCAGGGCGGCAGCATC 

CACTCGGGCCGCATCGCCGCGGTGCACAACGTGCCGCTGAGCGTGCTCATCCGGCCGCTG 

CCGTCCGTGTTGGACCCCGCCAAGGTGCAGAGCCTCGTGGACACGATCCGGGAGGACCCA 

NACAGCGTGCCCCCCATCGATGTCCTCTGGATCAAAGGGGCCCAGGGAGGTGACTACTTC 

TACTCCTTTGGGGGCTTGCCACCGTTACGCGGNCTTACCANAACTGCAGGCGAGAAGACC 
ATTCCCCGCCAAA 

Sequence 1064 

GTGCCACGCGTCCGCCACGCGTCCGCCTGCCCCTCGCCGCCCGCCGCCTGCCTGGGCCGG 

GCCGAGGATGCGGCGCAGCAGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCT 

AACTTCCCCCGCTACGTCCCCGTTCGCCCGCCGGGCCGCCCCGTCTCCCCGCGCCCTCCG 

GGTCGGGTCCTCCAGGAGCGCCAGGCGCTGCCGCCGTGTGCCCTCCGCCGCTCGCCCGCG 

CGCCCGCGCTCCCCGCCTGCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCA 

TGCTGCCCCTCTGCCTCGTGGCCGCCCTGCTGCTGGCCGCCGGGCCCGGGCCGAGCCTGG 

GCGACNAAGCCATCCACTGCCCGCCCTGCTCCGAAGAGAAGCTGGCGCGCTGCCGCCCCC 

CCGTGGGCTGCAAGGAGCTGGTGCGAGAGCCGGGCTGCGGCTGTTGCGCCACTTGCGCCC 
TGGGCTTGGGGA 

Sequence 1065 

CGCGTCCGAACGGCATCATCACGCCCGCCACCATCCCCAGCCTGGGCCCCTGGGGAGTCC 

TGCACTCAAACCCTATGGACTACGCCTGGGGGGCCAACGGCCTGGATGCCATCATCACAC 

AGCTCCTCAATCAGTTTGAAAACACAGGCCCCCCACCGGCAGATAAAGAGAAAATCCAGG 

CCCTCCCCACCGTCCCCGTCACTGAGGAGCACGTAGGCTCCGGGCTCGAGTGCCCTGTGT 

GCAAGGACGACTACGCGCTGGGTGAGCGTGTGCGGCAGCTGCCCTGCAACCACCTGTTCC 

ACGACGGCTGCATCGTGCCCTGGCTGGAGCAGCACGACAGCTGCCCCGTCTGCCGAAAAA 

GCCTNACGGGACAAGAACACGGCCACGAACCCCCCTGGCCTCACTGGGGTGAGCTTCTTC 
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TTCTTGTCGTCATCGTCCTTCTTCAAGCTTGGCX5AGCAACGAGAACGCCACAAGGAAACT 

NGTGAGCCCACGTTNGGCCGTCGGGAAAACACGGGGN 

Sequence 1066 

CGCGTCCGGCCTCCAGCACATCCTGCCTGCAGAGGGTCTGGCTAGCTGCCTTTTCAGCTC 

TCGAGGGATAGAGATTCTACAACCTCCCTCTGTCATCAGTTCCAGAGCCACTCCCCTTTG 

CACTAGAAGTTCTTGCTTTCAAAGAATGAGGGTGTGAGGGAGGGAGGGGTCAAGAAACAG 

AGTGACAGGGGAAACAGGCAGAACAAGTCAGGGCAAAGGACCCCAGCATGAATAGTTGTG 

GAGGTGGAGGTGGGGAAGCAGCCTCACATCTCACACTTCXJTTCCTTCTCTTAAATGTGAG 

CAGCTGACTCCAAGCCTTGTGGAAACTCTAGAAGGTAGAACCAGCCATCTGGGGAAGCTG 

GCCTTACAGATGCCCCGTCTGGCATAGTGGGAGGTTCCTGTGCTCTGAGAACCCCAGTGT 

GAATCTAGACATTCCACTGCAGCCTGGGAAGAAGCCTGTG I mTCTl I AAAAAGTCT 

Sequence 1067 

GCGTCC GGTTC TTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCACCATGTTCCTGT 

TGCTGCCTTTTG ATAG CCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCA 

CCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCA 

AACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAG 

GAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGG 

ATCCCACTTCACTAGAAGANGAGATCAAAGAGATTNGTCGAAGGGGNCNNAGTAAGGCTC 

TGGACMCACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTnTGCCGGAAAGGAATGG 
A 

Sequence 1068 

TCGACCCCGCGTCCGGCTGGTTTTCCGTCTGGTGAGGGGTTACTTCCGGGTCGGACGGCG 

CTAGCTGCAGCATCGGAGTGTGGCAGTGCTGGGCTGGCCGGCGGGCTGGGCTGCGGCCCG 

CGCGCGGCCGGCGATGCANGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGA 

CCGGGGCTGGGGCTGTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACC 

CCGAATATGACGA ATCTG ATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAAC 

AAGCCMCCTTCCCTTTTGCMGTTTGCAMCCAACTCTTGCTGGGTTTCTTTGCTGGAA 

GCACNTTGAGCCCACTGTGCATGAACCA 

Sequence 1069 

CCGTCCGGGAGGTTGAAGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAGAAGAAGC 

CAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCAGTCACTG 

ACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATCATGGTAG 

ATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGCTGTCAGG 

ACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACAAAACTTC 

GGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGCCTCAGCT 

CTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAGCAGCCCT 

CTAAGGAGGAGGAGGGAAGAGGAGGAGGGAGGAACAACTGAACCAGACCTTGGCAGAAAT 

GAAGGCCCAGGAGGTGGCGGAATTGAAGAGGAAAGAAAAAGAAG 

Sequence 1070 

GCGTCCGGTGCTGGAGGAAAATGTTTCTGGGGAAGATGACTCAGTCATTTTGTGGCGAGA 
CACCCTTTGG TAACTC CCACTGACCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTG 
AGCGGAGATGTTTTTTGCAAAATGAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAAC 
CAAGAGAAATCCAAAGACCTGGGGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAGGG 

AAGGG GAAATGAAGTGAAAATGGCGTGAGGGTGTGAGAGAGGTTTGGGTTAGGAAACATG 

TTTTTAGTGCTATTTNCAACCAGGG 

Sequence 1071 

CACGCGTCCGGGACTGATCTCNAGGACCAGCACTCTTCTCCCAGCCCTTAGGGTCCTGCT 

CGGCCAAGGCCTTCCCTGCCATGCGACCTGTCAGTGTCTGGCAGTGGAGCCCCTGGGGGC 

TGCTGCTGTGCCTGCTGTGCAGTTCGTGCTTGGGGTCTCCGTCCCCTTCCACGGGCCCTG 

AGAAGAAGGCCGGGAGCCAGGGGCTTCGGTTCCGGCTGGCTGGCTTCCCAGGAAGCCCTA 

CGAGGGCCGCGTGGAGATACAGCGAGCTGGTGAATGGGGCACCATCTGCGATGATGACTT 
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CACGCTGCAGGCTGCCCACATCCTCTGCCGGGAGCTGGGCTTCACAGAGGCCACAGGCTG 
GACCCACAGTGCCAAATATGGCCCTGGAACAGGCCGCATCTGGCTGGACAACTTGAGCTG 
CAGTGGGACCGAGCAGAGTGTGACTGAATGTGCCTCCCGGGGC 
Sequence 1072 

CCCGCGTCCGCGCGACGGCCGCGGCGCGGGACCTTAGGACCCGCGGGCTCCAGGGCTACT 

GTCCGTCCGCCACTGCGCGCCAGCAGGTCCTGGTCTCCGCTCTCCAACAGCTGAAAGGCC 

GGCGCAGTGAACACAGAAACGAAAACCAAGAAATGCCTTATTCCACAAACAAAGAGTTGA 

TACTTGGCATCATGGTGGGCACTGCTGGAATCAGCTTGCTGCTCTTGTGGTACCACAAGG 

TCCGTAAACCAGGGATAGCMTGMGTTACCTGAATTTCTTTCTCTGGGTAATACATTTA 

ATTCMTAACTTTGCAAGATGAMTACATGATGACCAAGGAACAACAGTAATCTTTCAAG 

AAAGGCAACTTCAGATACTGGAGAAGTTAAACGAATTCTGACAAATATGGAAGAACTCAA 

AGAGGAMTCAGATTTCTTAAAGAAGCTATTCCAAAGCTGGAGGAATATTTACAAGGATG 

AACTTGGGAGG 

Sequence 1073 

CGCGTCCGCTGAGTTCNAGGATGGTTTTTTCTTGGGACCAGACATGAACAAAAGTTGACC 

TCATGAGCACTTCAACCTCTCCAGCTGCCATGCTCCTCCGGAGGCTGCGGCGACTCTCCT 

GGGGCAGCACTGCTGTCCAGCTCTTCATCCTAACAGTGGTGACAGTTTGGCCTGCTGGCC 

CCCCTGGCCTGTCACCGACTTCTACACTCTTACTTCTATCTGCGCCATTGGCATCTGAAC 

CAAATGAGCCAAGAGTTCCTGCAGCAAAGCTTGAAAGAGGGTGAGGCTGCCCTCCACTAT 

TTTGAGGAGCTTCCCTCTGCCAATGGCTCAGTGCCCATTGTCTGGCAGGCCACCCCCCGG 

CCCTGGCTGGTGATCACCATCATCACTGTGGACAGGCAGCCTG 

Sequence 1074 

CGTCCGTGAAAATCCAAAGATGTATCATTTTTATTTGAATCCATCATGCAGTGTACATTT 

CAGATAATTTCCTTCAGT CTCCAGATA GGAGTGTATCCAMCATCTAATTTTATGTGCAC 

TGTGTATCTTATATGMTGTTTTATTTTATATACCACATGCAAAAATGNCCATATGCACT 

A TTTAA ATGTTTTAMTAATATATTCCTTCTTTATMTGCTAMTCT ATATGAGTACCAT 

ATTITTATMGTCAGTGGTCTGACNGGrsnTCATTmNMNTMCNNN 

GGTAT TCAANGNGAAAAGGGTGGNTGTGAGGAGAANATGTGAAAGNGGNNTGGGNGNNCT 

CTTTTGCCTTTGGGCCAGGAATTNGGGGGGCNAAMTNNACCCANAACCTGGNNNAAAAN 

TAGGNCCANTTGGGGGNGANAGGTTTCACTTTGGGGCNCAAAMNAA 

TTTNTNTTTNCCAAMTANATTNTTTTTGGGA'I I I I I I I I I GTNCCCCCCCCGNATTAAA 

TGGGGANTTGGCTGGNGTCTTGGCNCCTNTCATTTGTGCCAGACC I I I I H I I ATTAAAA 

MGMCCTTGGGAAAGGTCTTAAGTNCATTTGGGAAAAAAAAAAAAAAAAAA 

Sequence 1075 

GAGCCGNCCCACGCGTCCNCGGNCGCGTGGGCTACCTTGGAAGCAGTCATCTCTCAGTCT 

TACATTTGGAGAATGTGGATGGCATGACATCAGAATTCCT7TATATMTTTAACTTCAGA 

ATAGTCTGAGATCATCGAAGCACGATGGTCMGGGMTTCCGTTTTTGTTTTAGAGCAAA 

TATGTTTGCTGTTTGTCTTTCATCACAAACATCAGTGGAGTTTCAGCACCTTACAGAGCT 

CAGTGAACCCCCTGGTCACCATCAAAGTTAGCACACAACAAAGCCAACCACGTGTCCCCC 

TCACAGATGACAATGGCTGAACTCTGAGTGAAACCACCTGTATGGCCGGGCACAGTGGCT 

CA 

Sequence 1076 

GCCCCGCGTCCGCTTTTTGAGAATCTCTGCTCTGTTCCTAGGTTCAGTGCTGGGTCCTGG 

GAATACAGCAGGACAGACCTCAGCTTATCTCTTCATAGAAATTATACAAAGAGAATTGGG 

GAGACAGCTAAGAAGAAMCAAAGAMTAMGCAGTTACAMTTGTGATAAAGTGCTTTT 

GAAGGAAAGAAGGGGTCTGAGACAACAACAGGGAAGGGGCCTCTCTTGAAACAGTAGTTG 

GGAAGGAGGCAGACATGCACCAGTGATGTGGTGACAGGTGCTCTGAAGGAGGTCACCAGG 

ACCTGACCTCTTTGAAGGATCAGAAMTACTTCCCTGAAGGACTGACATTTGAGCCTAGA 

CCTGAAGGGTGAGCCATCAAGCTAAGACAATTGGGGAAGAGCATTCCANGGAGAGGGAG 
Sequence 1077 

CGCGTCCGAtCTTTGTCTGCTTTCCTATAACTCAGTACTGTAACTCAGTACTCTGAAATA 
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GTTTCCTTTGTTAATAGAGTCACTTTTATAGTACTGNGCTTGAGGNNATATACAGAGTAT 

TGTGTCCAAATTTATCATTGCACAAAGTGTTTTTGGAAATTCTTGGTTACTCCTTAGTAA 

ATTACCTGTAATT GGGT TAAATGCTGGTAGGGTTTAAAATCTGATTGCTAAAGTGAATTC 

TCTA TAM GTGAGTTT TGATAC ATAGAAACTTTNCATATAATTCTTAAACTCATGTGTCA 

TGTATTTTCATTTATAGTTTTTCATATTCATTAACATATGTTGTTCCTTACCATTTACAG 

CTCANMTTCTGCANATGCAGATTTTTGCAAACTTTGATGCATTTGGACAGTCTAGTGGT 

TCGAGTAATTTTGGAGGTTT 

Sequence 1078 

TNCGGGCGGCTGCGGCGGGCGGGCAGGCGGGCAGGCCGGCAGGCGGGTGCGCGGAGGGCT 

GGTACCCCTNAGCAGGTGGGCGGGGTGCGGTTGGNGGCGGCGGCTGGGCCGGGGGCTGCC 

CGGCTGCGCTCGGGCCGTGCGCGGNGGCCGTGCGGGCACGCCATGGACTTCAACATGAAG 

AAGCTGGCCGTCGGACNGCGGGCATCTNTNTTCACCCGGGCCGGTGCANTTCACGGAGGA 

GAAATTTGGCCAGGCTGAGAAGACTTGAGCTTGATGCCCACTTTGAAAACCCTTCTGGCC 

CGGGCAGACAGCACCAAGAACTGGACAAGAAGAAGATCTTGAGGCAAGACAAGAGGTGCC 

TGCTGCAGCCCCAACCCCAGTGCCCCGAGTGGAGGGAATTANCTGTATGAAGAAGCTTG 
Sequence 1079 

CACGCGTCCGTGCTGAGCGGTCCCGAGGGGAGGGGCCTGAGGCCGAGTTCCAGTCGCTGA 

CCCCTAGCCAGATCAAGTCCATGGAGAAGGGGGAAAAGGTCTTGCCTCCCTGCTACCGGC 

AGGAACCTGCCCCGAAGGACAGGGAGGCCAAGGTGGAAAGGCCCAGCACCCTCCGTCAGG 

AGCAGCGTCCTCTTCCCAACGTGAGCACCGAACGTGAGAGACCCCAGCCTGTCCAGGCCT 

TCAGCAGTGCACTGCACGAGGCTGCCCCCTCCCAGCTCGAGGGGAAGCTGCCATCTCCTG 

ATGTCAGGCAGGACGATGGGGAAGACACCCTGTTCTCGGAACCCAAGTTTGCACAGGTCA 

AGCTCMGTAATGTCGTCTTGAAGACGGGATTTGATTTTCTGGACAATTGGTAAAATGTA 

1TAGAAAAATACMT GAAAG AACCCTAAAATGTTTTCCAAAATGGTGTGGTGGAGGAGGA 

TAAAAMGGG(XACCTTTTCCTATGTATTTTACTGGTTTCTTGACACTCTTTTCTTAATC 

ATTTGGAAACTGGTCAATACTGNCAGATTTTTT 

Sequence 1080 

GTAAGCCAGGTGCTCCCCCTTCACTTCCTGTGTGCGGAGCACGCTCGCCCTGGGAGTTTC 

ACTAGAAAGAAGGTTGCCATGGGCCAGTGGGACAGCTTGGATCTCAAGTGCACGCGGATG 

CCCCAGAATCCCAGGATCTCAGCTGAGCTGTTTGTGGATTATTAGATCTGACTTAAAAGA 

ATATTATCCAGCAATGCAAATGAACAAACTATAACTACACACAGCTGCATGGATAAATGT 

CAGAAACATGACGTTGAAGTGTGAGAAGCCAGATGCAAACCGAGGACTCACTGTGCAATT 

CTGTGCATGTACAGTGGCCAGGAGMGGGAGCACTGGCTTTTGCTTTCATCAGGCCAAAG 

ATGCCTTTCTTTGGGAATACGTTCAAGTCCCAAGAAAGACACCTCCTCGGAAGGTCGGCA 

TCTTTCTCCMCCTGCATTCTTTGGATCGATCAACCCGGGAGGTGGAGCTGGGCTTTGAA 

TACCGATCCCCCGACTATGAACCTGGCAGGGCAAAAGCCTGAAAGTTTGAAAAT 

Sequence 1081 

TGCTCTCTCACCTGTTAGCTGTGTAGCATTGGGCAGGTTACTTAACCTGTCTGTGCTTCA 

TCTGTGAAACAGGMTMCAGCATTATTMGGATTGTTTTAGGATTGGATGGGTTAATAG 

ATGTAAAGTCTTAGAACTATATTGAGCATCCCATCAAGGCATTGTATTATATTGAAACAA 

TGGGGTTTNTTTCTCTTTATNCTTTTTTAMCTATATMTGMCACTTTTGATCTTAAGT 
ATTNCTAA 

Sequence 1082 

CCCGCGTCCGGTGAATGTTAGTATTGGGTGTGGGATGCATCAGGGACACAGGTTTGTAAA 

CCATGACAATTCAATTGTAAAACTAAAGCCNAGTGCCCCCTGTAGTCCCAACTGCTGGGG 

AGGGTCACTTGANCCCTAGGGGGGGAAAATGCAGTGGACTGGATTTTGNGCCCTTTGCAC 

TNCAAACTGGGTGACATAGTGAGGCCCTTGTCTTCTACCANAAAAAAAAAANNANNANNN 

NNANGGTGCCGGGCCGCTTAGAACTAGGTCTTAGAAGAAAAAAACCCTCCCACAACCTTC 

C CCCT GAACCCTGGAAAACATNAAAATGGAATGCCAATTNGTTTGGTTGGTTAACCTTTG 

GTTTTATT TGCM GCTTTATAAATNGGGTTTACCAAAATTAAAGGCCAATTANGCCATTC 

ACCAAAAATTTTCACAAAAATAAAAGGCC AI I I 1 1 1 I I 1 1 CACCTGGCATTrCTTAGTTT 
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NNTGGGIvrrrTTGGTCCCAAAACCTCATCNAATGGGTANTCTTAATCATGTCCTNGGGATC 
CCCCCCGGGTTACCCGGAGCCTTCGGAMTTAMTTTCC^CriTTCCGCTTTTCCTTCGC 
TTCACTTGACCTCCGCTTNGNCTCGGGGTCCGTTTCCGGGCTTG 
Sequence 1083 

TCGACCNCGCGTCCGTGGAGGGCCCATCTGCCAGAGCCTGGAGTCTGCGAAGGCCGGGAC 

CCGGTTCCCCGGCCCACAGTGGGGGTGTGCAAACCCGAGAGAACTGGGAAGTGCCGTCAG 

AAGCGATAACTGACGACGTCTAATGTCTATCTGACCGCAGTCGCTGAAACCTCTACAACT 

TAGTTGACCGTAACTGCCAGAGCCCTGCCCTGAATTCCTGTCCTTACTCCCTCTTTAAGA 

TTGCGTACCCACTGCAGAGTGCTGAAGACGGGGTAGCCACCGAGGTTGCAAATTCGTGAA 

GAATCAGCATCAT GTTTG GCAGCTGAGTATTGGAGCCAGGAGCCTGCCATGAGGTTTTGA 

GAACAGAGTGCTGTTTTAGAGCTGGCAGCAGCATCTCAGCCCAAGAGAAGGTTATATTCC 

CAGAGGATGTCAGTCCCAAGGACCAAGTAGCTGCCATCAGTTTGGATTCTGAAAACTAAC 

TGGCATCAAACTGGGTGTAGAAACATG 

Sequence 1084 

CGCGTCCGGACTGTCGCTCTAAAAGAATGAAGGAAATAATAAAGTGATAGACAGGGAAGG 

ATAGAAAAGACTTAACAATATACATATGTTCCGTCTTTGCTGTTTTGGAGAATGATGGAT 

AAGTANGTGTTTCCTGATTCTGMGCATAGCTGMCMTTTAATTGTGGTTTACCATCTT 

TTTGGTTCCCTCTTCAGTAATTAACCTATCGAAAATCTGTCCTAAATGTTTGGACTGGGG 

CACAGTTCCCTCCATCGCTTTGGGAGAAAATCATTAATATGGCATACTGCAGATTGGAGG 

GCAGGACCACTGAGGGTGTCATAGACATTAGCTCTATGGAATTCTGCTAGCAATTTCCAA 

GTGACAGTGAGGAATTATGGATATATGTTTGAGGTCATTCAAGCTTCCTGAGTACCACAT 

TCCCCAGCTACTTAGACACCGGGTTAAAATATTAAAGATGTCCTAGTTCAACAGCTTGAA 
TTCCATTGATTGGAT 

Sequence 1085 

GACCCCGCGTCCGGCTTCCTGGGTTCGAAAGMCCCAGTTCAGGAGTTTCTGTTTTAGTT 

TGAGATCTTATAGGCCTGTCTCATCAGGTTGGTGTCAGCCCAGCTAGGATTAGGCAGAAT 

TGGG TGGGGGCTGTAGTGCATCTTTGGCACAGCATGTACCTGTCTGACTAATTCTCTGTC 

TmcmCCTGTTGCMTTCATGGGTCTTAGCATCTTCTGAATGGTGTTTAGTAGGTCA 

TCCTGTTGATTTCCTGCTAGGGAGTAGCATACTCTGGCTCTGTACCATTGGCCAAGGGAC 

TTAAGGATAGGTGAAGGGCTGCAGTTTTGTTAAATGGAACAATATGAAGAGATGGCATTG 
TAAAAAAACTTNTGNCAACTNAA 

Sequence 1086 

T GTCGC CCCGCGTCCGATCAMTCTTGATGMGGATTGTAGATTTTTGCTTTTTCTTTTT 

GTTTTTAAMCTTATTCCMTTGCTAMTTGGTAGTTTTTCAGTCTTTATAAATACAGGA 

TTAAAAATATATATACAGTTATATGAAATGTTTATTTTCTATGTGTGTGCATATAGTTCA 

ATATTATGCAATAMTTTGGTGTTTTAACTTAAAACTATTTCTTATTGTACTTGCAGAAT 

GGATAGCTTGCTTTTAGTAGAAGCATTAGGTCGTATACTCAGATAATCTAATAGAAGGTC 

AGATTTGATTCCTGCATAAGAAAGTAGAGCCCAAGTGCTGCAGAAATGGAGAAGAAAGCA 

GGGGCAAGGGAGCAGATGGCATTAAGGAAGAATGAAGTTTTTGAAGGTTGGGGATGGACG 

AAAAGGGTGTTTCTCATGGAGAGGGGATGCTTTAGCAAAGGCTCAAACATTGGGGCATAT 

TTAGGCAAGAGCCAAGAAACAG7TTGAAGGGGAACATCAGAGGAAATAGGCCAAATTAAT 
AGTAAATN 

Sequence 1087 

GGNGTCGACCNCGCGTCCGGAAATACTCAAATAATGCAACATTACTTCCCAGAAATGAAA 

ATACATTGCATCTCTTATTGAAAGAGCCAACAGAGGCAATGAAAAATAAATGTACACATA 

TGTGCACCCTTAATTCTTTATAATGAACCTTGAGAAATCAAGGCTTTAGAAAAATGCTGA 

GAGACAAGAAGCTTCTGAGCAGCAACACTGGATGCAGGAAAAAAAATGGAGTAATAAGTG 

TAAAGTTCTGAGGGGAAAAAAAATATCAGAACCTAGAATTCTAAATTTAACCAAACTGTC 

ATTAAAGTAAATGAGA AAAGTAA ATTCTTTTGAGAAACGCAGAGCCTAGAAAGGATTACT 

AATCGGTAGGATCATCTTTTTrrGTTTGTTTGTTTGGNTTGGGGACAGACTCTCACCTCT 

GTCACCCAGGCCTAGGTTTGCAAGTGAGCCCAGGATTGTGCCACTGCCCTCCAGCCTTGG 



WO 01/070979 



PCTYUS01/09126 



TABLE 1 

177/467 

GGGTGACAGGAGCAAGGACTTCATCTCAAAAAAAAAAAA 
Sequence 1088 

TCCCCCTGCCCCCCCAAGCTGAGGGAATAAACTCTGCCTGTCGGGTGCCCCACCAAGGGA 

AATAAACATGGCCTAGCTGCCAAGTCATGCCTGTAGGGTGCTCTCTACTGGCAGTTTCTG 

GGTGGTGCATACATCTAGTCTCCCTAGAAGAGCACAGTCCAGAATTGAGGAGGTACAGCA 

GAAACAGACTGCAGGCAGAGAGAGGTCTCATAGAGCTTGGACAGGGCTAGGCACAGAAAG 

AACAGGCAGCGTATCCAGAAGGGGGCAGGGAATGGGTGAAGAGGTTGTGCTCTAGGGCAG 

AGCTGAGCTCTGATCTAGAAAGGACAGCAAAGATACCTGGAAGGCCTCCCGATTCTTGCG 

TTGTTGGCGTCGCTCCCGAAGCCGGG 

Sequence 1089 

CGNTCCTTGGGTAAAAGCGTCCCAGAGACGGGAAGAAATATGGTATAAGCGAGAAGGCCT 

CATAAATCTGGGCTGTTAAAAATCTAAGTTAAAAATATGTTTTAAGTCAGAAAAAAAAAA 
AAAAAAAAAAA 

Sequence 1090 

CGCGCCTGGTAAAATTATATAAGCTTAAAAAAACAAAAACAAAAACACTTGCTTTGAAAA 

GAGTCTCTCAGCAGCAATTTTGTCCTTGCCCCTACTTCCACAGTTCCTTTTCTTACCATT 

TCACATCTGGATTACTACATTGGCCTCTTTGCTTAGACTCCCMTATTCATTTGCTTTCC' 

TTCA(XCCATTTTATGGAGGACTGTCAGATCMTCTTTTAMGATAMTTTTATAATGTT 

ACTACTGTTGC CTATT GGATTAGAGCCCTAGGGGTGCTTTTTGTAGTCTCACTGACAGCT 

GACATTAGTGATTTTTCACCCTCTTCTTATTGCTACCCTGTGTTGATGGCCAGTTTCCAG 

GTGGGCACCTGCTCCACTTGCTTTCAT 

Sequence 1091 

GGGGTATGTGTGGTTCTTCCAGGAAAGTGCTGAAAATATCACCCAGGCCTCTGCGCCACG 

CCCTGGGAGAGTACACTCCTGGGCTCACGCCTCTGCATTCCAAGGCTGACAGCTAGAAAT 

ATACTTTGTAAAATACCAACAACTTATTCACAAATATTCCAACTATCTACCAGCTCCAAT 

GAGCTTGCTGAGGATGGGTATGACCCCAGTCTAAGGGGAAAGAATCTAAAACACAAGTAA 

ACCTGTTTAAAGGCCAGATCTCCAGATGGAGATCCAAGCAGATGGCGCCTAAGGTTTGCC 

CTTGAAAACTACCAAGGAAGCCACAGAGAGGGATCTTTGGACCTTCTGGAAAATGGTAAG 

GCCCCAGGTAGATTATGGCTCCTCTGCCCTGGAGGCTGAGCCGCCCTCTGGTTACCTCAC 

ATCTTCTGGTTTCTTCTGAGTGGGACTTGATCTCATTTCTGCATTCACAGCAAGGNGGAA 

CTGTCTGGCMGAGCTTAAAATTAGGACCTGNTGGTGGGGACCTTTAATAGCAGGTGGAG 

GG7TTGAGATCCCNTGAGATGCCCNAGATTAATTCAATAGGGGGANGAAAGATTTGGCCC 

AATTCAAAAGNGCTTAAAAAAAAATTTT 

Sequence 1092 . . 

CGCGTCCGGTTCTTTGGTTGAGCTTCTTTGTATCAGTAACAAAAGGAAGCATCATTCAGT 

CTTTCTTTG TMCT TAATGTAAGTCTCTTTTGTACATCCTATTACTTCAAATCATTGAAG 

TGAACTCC ATTTT CACATCTGTTGGGAACAATCATCTAGCTTCTTAAATGACTCATCTTA 

AAATATGAATTTTAGACTGCCTAAACATTCTGAGGGAGTACAGTGTGATATAGCAGAAAC 

AACCGGGGCTTAAGAAGGATCAAAATGAAAGGTTTTGTGAAGGATGTGCCAGAGACTGCT 

CTCTGTTCACCAAAACCACATTCTTCTCTTCTTCCTGGCATAGAGCCAGACTAGATTTCC 

MGCTTTCCTTGCAATTGAGCTCTCAGAGTTCTCGTGATTAGAATGTGAATGATAGGCCG 

GGCGCAGTGGCTCATGCCTGAGATCCCAACACTTTNAGAGGCCAAGGTGGGTGGATCACT 

TGAGGTCCAGAGTTCNAGACCAGTCTGGCCAACGTGGTGAA 

Sequence 1093 

CGCCCC GCGTCC GATAAMCTGGATTTGATTTCTTTTTATGAAANGTTTCATATGAATGT 

MCTTGATTTTTTACTATTATAATCTAGATAATATGATATAAGAGGGCTAAGAATTTTTA 

MTTGAATCATATATATGATATAATTTGATCCTTCTTGTATCTTGAAGTTTTGTACTTGG 

GATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAAATATTTTCTTGGAGCTCT 

GTGTCAACTTTGATCCTTTGTCTCCCAGGAAGGTGTGACCTCTCCTTTGCCTGCATACCT 

CAAGGCCAGGGGAATATGCCTCAGTGATGCATTTATCTTTGTATATCAGGCCGCATGATT 

CCCAACTTTCTGCCACACTTAAATTACGTTCCTCCATTTCAGTTTTGTCTTTTCTGTCTA 
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AAGT TCAGTCAAAGAGTATCAAAAAATTATGTTTCAGCTAGACTGGTGTAATGTATAAGT 

TT7TGTATCTTGTATTAGAGGATTTCGTAGCTTTTATTAGAGG 

Sequence 1094 

GCCCCGCGTCCGATCCCTAGATGACATAACAGCCTTACAAAAGGACAGGGAGGAGTGTCT 

GTTCCTACTCTCACATAGCGGAGGAAAGTTAGAGCCTCTCAGTCTCTGTTTATGAGGACT 

CATTMTCTCAMTMTTGATGCATTTTTCATACATTAGGGTCTCTGTCCATGTGTCTTC 

CTGATATTGTTATAGAAATGGCTTCAGGCTGCTGGTAACAGATGCTGCGGAAAAAGAATG 

CCTTAAACAAAGCCAGGCGCGGTGACTCACGCCTGTGATCCCAGCACTTTGGGAGGCTGG 

GGTGGGAAGGATCACTTGAGCCTAGGAGTTAGACACCTACCCAGCCTGGGCAACACGGTG 

AGACCTCGTCTCTACAAGAAACAAATAATTGGCTAGATGTCGTGGCGCACAAGCTTGTGG 

TCTCGGCTACTTAGGGGGCTGAGGCGGGAGGATTATTTGAGCCTGGGGAGGTCAAAACTG 

CGGTGGGCTGGGATTGCGCTACTGCACTCCGGGCTGGGAGACCGAGTANGACCCTGCCTT 

AAAAAAAAAAAAAAAAAAAA 

Sequence 1095 

AGTCGCCCCGCGTCCGCTCATACCCAGTGAATCTTCAACAGAATCTCTTAAAGATCTCCA 

GGMGTATMGCTCTCATTMTGTTTGAGTTAGAAGAACTTATTCTGGGCCT7TAATTTG 

TTGCATGTGCTGTACTTAMGCATCCCAGATAATTTTAGCTTATATTTTCATAGTGTTTA 

TACAGAGCTTGAATTGGMTGGTCCTTTCCTTCCTTGCCTCAGTACTTCCTTCCATAATC 

TTTCCTGCCATMCCAT TTATTTTG CCACCATTTCTTAAACACTTATGTGGCAGGCATTA 

TGCTAGACTGTMTATGTTTTTTTTAATCCCAGTTGAAGTGGATGTGGGAAGGTATTAGA 

AAGTAGAAGAAAGTATAGTCTAAAATAGAGAGGAAAGAAAGGAAGAGAAAAGTGGGATAT 

TTCAAAACCATTTGCGCAGAGGTAGAATGAAATTCGCCAGAATGGGAATCTCCGTATTTC 
TTTACAAT 

Sequence 1096 

GTNGCCCCGCGTCCGAGTNAACAGTGGTAGTNAAATTCAGGGTTGGTAAGTTTTTCCATA 

GAAGGCCAGATGGTAAATATTGTAGGCTTGCAGACCATGTGGGCTCCACGACTCAACTCT 

GCCACAGTAGTTTGAAAGCAGCCACAAACAGCCTTGGTGTGACTTTGTTCCAGTAAACTT 

TCTTTATAGAATGGGAGAAAATATTTGCAAACAATGCATTCAACAATGGCCTGATGTCCA 

GAATTCATGAGGAACTTAAAAAACTCAACAACAAAAATCACCAAATAACATTTAAAAGTG 

GGCAAAAGATATGAATAGTCATTTTTCAAAAGAAGACATACCGAATGGCCAACAAGCATA 

TGAAAAAATACTCAACATCTCTAGGCTTTCAGAGGCATGCNAANTAAAACCNCCATTNGA 

TATTATNTTACNNGANCCCNAAATGGG I 1 1 1 I I 1 1 A AAAGGCCAAA 

Sequence 1097 

CCCCNCGCGTCCGTTNAAGTGTGCTTTGGAAAAGGGAAAAAGTCCTAAGTAGATATAAAA 

CCCTAACTAAGGAAGAAAGCAGGTAGCAGTGGTGGTCCAAGAGACCGTGTAGTGGATGCA 

AGGACCGCTCGTATTTTACACGCTATATTTCAGCAAAGGGTGGCCCATCTGGCAGGAAGA 

TGGGGACATATGTCACATATAGAGCAGTTAAGGAACTAGGGAAAGTGGAAGACTCAGAAG 

ACCTGTCTTTGACCTGGTATGTTCTATCTCTACAGAACCtAATATGGCTTATACATACTG 

CCACAGAAAGGACTGAGGTAGACAGTGGCAAAGAGTTCCTAGGAGTTGAACCCCTGAAAT 

TACATAAGGAGTAGGACCCCACCAGAATTCTGTCTTTGTAGGCTGCTGACTGCAGAAGAA 

ACGGTGTAGCGGAGGCAGGGGGAAGAGGAGTCAGNANAGTACACTGGGAAGGAAGAAACG 
GGTTCTTTTCTCTT 

Sequence 1098 

TCGCCCCTGGGCCCTCCTAACCAACCAGGGGAGGGGAGAAGGACCCAATTCTTTTCCTTT 

GGTGCACGTAGCCTGGACCCGTTATGGACAGAGGCCAAAGGAAGATAACAGTGTGGTGTC 

CAGAGATGAAACCAAGTGGTTGATGGGCAGTTCTTTGAGCAACCTTGTTTATGAGCCTAT 

TGATATGCAGATATAGAGGCATCCAATACTATTGACTAATTTAAAATCTTATTCAGTGAG 

TCAACACTCTAMTMGCAA TGGAG ATGGTTCCATTCATTTTTTTGCMGTATCATTTTT 

ATAAACATAMTTTCCTGAGATTTTTGTTTTCATCTTAGCCTCTGTGGAGCTGCTTCGTG 

GTTATGATAAGTGCTGTGTGATGCTCACCTTGGGAGGTCTGCGACATATA1TGAAGTCAT 

CTCTAACCTGAAGTACTGACAGACTTTCTGGAAGAAAAGGCTTGTAGGAGGAAACTTCAG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

179/467 

AATTCTATTAAATGGTGTAAATGATGAAATTATAGTTGATATATGCTAGAGCATCAGTGC 

TGGGTATTTTAGAAGGGATGGA 

Sequence 1099 

CGGGCCTGTTCCTAGAGCCTCATTGGAGACATTGACAATGCCATGAGGACCTTCCTCAAC 

TACTACACTGTATGGAAGCAGTTTGGGGGGCTCCCGGAATTCTACAACATTCCTCAGGGA 

TACACAGTGGAGAAGCGAGAGGGCTACCCACTTCGGCCAGAACTTATTGAAAGCGCAATG 

TACCTCTACCGTGCCACGGGGGATCCCACCCTCCTAGAACTCGGAAGAGATGCTGTGGAA 

TCCATTGAAAAAATCAGCAAGGTGGAGTGCGGATTTGCAACAAAAGATCGCTTTGGCTGC 

TTTGTGAAGAATAGATTGAAAGGGTCAAAGGTGAGAGCCATCTCACATCCATGCAGGAAC 

CAAGCAGGCAAGATATAAATATGAAAGTAGAAGAAMTAGTCTGGAAGAAAATCCC 

Sequence 1100 

CGCGTCCGGGAGTGCACCCCAAGATCTAACAGCTGTTTCAGAGCTGCTCATTTTAGAGTG 

ATTGGTAGGGAGTTGGTGGCTCAGAGGTCCTAATCAGAATGTGTCCTGGGTTCTGAATGA 

CTAGCAGACTATCATTAACCAAATAAATTATGGGATTTTGTCTTAATTATATACATATAC 

ATATACACACACATACACATACACATACATGTGTATATATTCCCTAAAACTTAATAAAGC 

TCAAATAA TAAAA TCAGATTTCTTAAGTATTCCMTTCCCTTTAAAATGTAAATCAGATT 

TTATAATTCTTTTGTTCAAAACTGTCCATTGGCTCCCATTTCACTTAAATCAAAAGCTAG 

ACAATAAGCTAAGATAGCAAACATTATTATCTATTTACTTATGAGTTACTTATGT 

AACTCAAGCATCCAATAACACTGTAGGGTGCTCAATAAAATAGTTGCTGAATGGATAACT 
TTC 

Sequence 1101 

TGTTTTACGACAGAGCTTAGTGCAGCCNGTTCTTGATGGCTGTGCMTGCTTTCCTTTTA 

AGAGTGGAGTTAGCCTCGTCATAAAGCGTGTTTTTGAGTCTGTTCGAACGGGTCAACAAC 

GMGGGAAGTTTCAGGCAGATCTTGTATGCCTGGCCCTGGTGGCTGCTTTCATTTCCTTC 

CAGTATCAGTGCTAAACAGGAATGAACATGTTCAAGCCCCGTCTCACCCACCTCTGGCAT 

CTTCGCGCTAACTCTGCCCTAGAAGACCTTTCCTTCCGTATCGTCAAGAAAACTGAAGTT 

GCTGTTTCACTCCTTCTCCCACCCAGAAACTTCGCTGCATCTTCCTGGATCCCTAGCTCC 

TTGCACCCATGATCCTGTCTCCTTCCTCAGCCCGGCTTCTGGCTGAGCAGCCTGCACTTG 

CTGTCTTCACTCCTACACGCTGCCCCCACTCCTACACGCTGCCCCTGCGTGCTTNTCACT 

TCTCTACCCTTCC 

Sequence 1102 

GTCCGTATCCTATCnTCAMTTTTTTAAATATGTTCAAAATATCTGGAGGGTGAGAAGTT 

ACCMGTTTGT ATGTT TTGTTTGACTCACCATCTTTATTTTCTGTATATGTAGTAGCTGG 

CMTTGCATATATTTTCTTGATTAACATATTAGAGACTGCTTCCATCATCTTATGTAAAC 

CTGG AAAC AAGCTGAAACTAGTCTTTTCTGAAGAACCGTGTATCAGTGTTAGATGTGCAT 

CCCGT TTTGTCA TTCCCTCAGACTTTGAATACAGTCATTACTCTCTGGAAGAGAAATGTA 

AGTATAi 1 1 I l IGTTATCTGCAGTATGGTTTAACATGTATTAATAATACACATATGCAGA 

CTCACTAAAGTATCCCCAGTAATTAGTAAATTCCAAAT 

Sequence 1103 

ANTTGGGTACCCCCGGCCNGGCCAGNTGCGCGCGGCGGGGCATGCTGCTCGTCCCCCCGC 

GCCCCCGGCCCGGACACTTGGCGGGTGCCACGAGGACCCGCAGCAGCACGTGCGGTCCCC 

CGGCGTTCCTGGGCGTGTTCGGCCGCCGTGCGCGGACCTNNGCGGGAGTTGGGGCNTGGG 

GGGCGGCNGCCGTTGGTNCGGACAGNCNGGGTGCGCACTTGGGCCCCCNTGNCCATGGCN 

GCAAAGGTGGACCTGAGCACCTCCACCGACTGGAAGGAGGCGAAATCCTTTCTGAAGGGC 

CTGAGTGACAAGCAGCGGGAGGAACATTACTTNTGCAAGGACTTTNTCAGGCTGAANAAG 

ATCCCNACATGGAAGGANATGGCGAAAGGGGTGGCTGT 

Sequence 1104 

TGCCNCGCGTCCGAGCATCTCAGGTAACAATTTGAGCATAACTTTAACCATAACTTATGA 
TAGCATAATAACAT TCATTA GTAATTCAGTAGCCGTATGTGCCAGGCTGTGTTAGGTGCT 
TTATATATTGTTTAATTTTTAAAAACTTGTGGAGTGTACAGATTGGTAAGGTGACATTGT 
ATCACAMGCTAGTCTTTGAGTCCAMGTTTTGTGGTTTTATGTTATGATATACTTTTAT 
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CATGGMTTGTCTTATTAMTGTTTTGCCAGTGGTTCTTAMGTGTGTTTCTGACACCAG 

TAGCATTGACTTCACTTAGAMCCTGTTAGAAATACAAATTATTTGGCCCCACCCAACAC 

TTGAGTCACAMCTT TGCAG ATGGGGCTCAATCTGTTTTMCAAGCGCTTCATGTAATTT 

TGATGCAG GCCTAA GTTTTTGAGCCCTGCAGTATGCATTTCTATTTTTAAGCAAAGATCT 

TGGTCTTTCTTmGGACATO 

NTTTGG7TTGGT 

Sequence 1105 

ACGCGTCCGCTCTGGTCAAGCAGGCGGTACTTCTCCTTGGATGTCTCAGCCACAGTGCCT 

ATCAGGGTACTGAGGGAGAGCACACATGGCCCGAGGCCCTNGGAGCCCTCGGAGGCTGAG 

TCAAMGAGTCTCCCTCGMTTGGTGGGCCTTTAGAAGACTTGGCTTCTTCACTGGAGAG 

CTATAMGTAMCACCACACTGAGGGCCCTCGTCCCAGGMGGCCTTCAGAGCATTTTCA 

TTTCTGAACACGTCCCCTCATCTTTCMGATTTTCTGGTCCTCTAAAGCTGAGAAACTAC 

MGCACTGAMTGAGATGAGTTTTGATAAGGATGGTAATGAAGCACAAAAGCGTTATTCA 

CATTACTCACTGACTTTMTATAATTTTGAMTATTTCATACTm 

CCTGGGCGACAAGANTGAGACTCCATCTCAAAGGTAAAANAAATTTAANCTGGGTGCCNG 

CCGCTTGACTATGTCTAGAGAAAAAAACTTCCACA 

Sequence 1106 

GACCCANAGAAMGGNGCCA MGG GCATGTCMGCAATTGAAGTTAAGCTCATGTTTTTA 

AAGATCCGTTTATTGAGATGATTTTGAMTGCTCCTTTACCATTCATAATTTAAAAATAA 

AGTTTMCAATGGTTTAAATTCANAATGGATTAAAATGGAGTTGGGGGTGGAAAGTAGAG 

CCATTCTTAGTAAATATAAATAACTGAAAAGTTCTTCTGAGGAGACTATGTACCGAAGTT 

ATCATTGCATCTTTCAGTATAGGCAGATCTCTCCCTCATATMCCGGATGTTTCTTGGCX3 

CTTGGMTATCAGATAMGGTAMGTTTMGAAACTTCTCTAGCGGGGGATTTAGGGAAC 

TTCTTAAAACCTAGAGTTAAMGCTGTTGCGTGTTTGTTGTGTTATTTTAGACCAATCAA 

CTTCATAGGCTAGACTAGTCTAGA 

Sequence 1107 

ACGCGTCCGAAAATTCACAGGGTGTGTTGGCACACGCCTGTAATGCCAGCTACTCAGGTG 

GCTGAGGCATAAGAATTGCTTGAGCCTGGGAGGCAGAGATTGCACTGAGCCGAGATCGCG 

CCACTATACTCCAGCCTGGGCAACANACATCCTGTCTCAAATAAATTAAATTACATTAAA 

TGTTTAAGAAGAAGTCTAAATAAGATTCATATGCTGCCCTCCCTCAGATAATGAGGGAAC 

CTGGGGTACTTCTGGGCTACTCTGGGGGACAAAGTATAACTATTCAAATGGCAAGTTGAA 

TTAGTACAGTCTAGGAGCCTTGGAGATGGCTTC7TGAAAGAGGTAGAACCTGAAATTCTC 

CTTCCTTGAGGGACGGNCAGGATTTGGCCAGATGGAAAGGCAAGTGGAAGGCTTTGCAGG 

GACAAGCAATGTAANCAGANCCTAGAAATGG 

Sequence 1108 

TCGACCCCGCGTCCGGNGTMTTCTAGGGGAMTNATATTTCTGAACAACAATGTTGGTT 

TGTGCAGGAAMTCACCAMGMCATGACTAGAMGTGTATAGCTACAGTTTCCCTCTTT 

TAAATGGGAATAGCAAAACATATAAAGAATATTGATAGGCCGGGTGCGGTGGCTCACGCC 

TGTMTCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAAGAGGTCAGGAGATCGAGAC 

CATCATGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGC 

GTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGA 

ATCCGGGAGGTGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGA 

CAGAGCGAGAGACTCTGCCTCAAACAAAAAAAAAAGNNAAAANAAAAAA 

Sequence 1109 

CCGTCTCTGGCTTGGCCAGGTTTAAATTAATAAAMTGAAGATGAAAATAAGTTGTCAGA 

TTTAGGATGTATTTTAGAAACCCMCTGATAATTTGCCAACTAATTGGATGCAGAGAGTA 

AGAGGGAGACTCMGMCACCTCTMGATTTTTACCCTGATCAATGGGATAGGTGAAAGT 

ACATTAATGGAGATTGAGAATCCTGGTGGAGGTACAAGTTTAGGGGTACTGAAGAGTGCT 

TTTGGACATGTGAATTCTTAGAAGCCTACTAGATTCTCCAAATGGAGACATAAAACATAA 

TTGMTACAAAAGTCAGGAGTTCAGGAGAGGGCTGAGCTAAAGATACAAATTTGATAGAC 

ATGAGCATTTAAAAAAAACTGCATGAAAATACTAAAGATAGGCTGTCCTGCCTATGGAAT 
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AGCCATTCTTTGATCCCTTTACTTTCTTMTAMCTTGGTTTCACCTTACTCTATGGACT 

T CCCC CAMTTCTTTCTTGTGTGAGGTCCAAAAACTCTCTGTTGGGGTCTAGATCAGACC 

CTTTTCAAGTACATCTTNCTGATGAACCACAAANGGATTATACTAAAGAGACCCCCCACC 

Sequence 1110 

CCGGNAAT TTGC TTATTCTAATTGAGACACANTGGTGGGGAGTGGGGGTCGGGGACTACA 

CAGGT GCATT TTCTGAACATTTATAAAATGAAAAAGATGGAGGCTTGGCTAGAATGGTTA 

ATCCCCTTTTCATTCCTCTMTTCTATGACMTTTTTTTAAAAMCCAAACACMCCA^ 

ATMGAGTGGACAGTTGAGAATTACCTTTAGGTTCCCATGACCCTGAAGACTGTATTTGG 

CCTTGGATCCATTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1111 

CCCACGCGTCCGCX3GCCATTTCTGTATCCCCCTGCCTGGGTTTGCTGCCCTTTATGCTCC 

TACCTCACCAGGTACAAGGAACATGAAGATGGCTATATGCGGCTGCAGCTGGTTCGCTAC 

GAGAGTGTAGAGCTGACACAGCAACTGCTGCGGCAACCACAAGAGGGATCGGGCCTGGGA 

ACGTCGCTGAACGAGAGCAGCCTGCAGGGCATTATTCTAGAAACAGTGCCAGGGGAGCCA 

GGACGTAAGGAAGAGGAAGAGGAGGGCAAGGGTAGCGAAGGGACAGCCCTCTCAGCCTCT 

CAGGACAACCCCAGTTCTGTCATCCACGTGGTGAATCAGACCAATGCCCAAGGCCAGCAA 

GAGATTGTCTACTATGTGCTGTCTGAAGCCCCAGGGGAGCCTCCCCCAGCCCCTGAGCCA 

CCTTCAGGGGGCATCATGGAAAAGCTTCAAGGAATAGCTGAGGAGCCAGAGATCCAGATG 

GTTTGMGGCCGCAGAGCCAGACCATTTCTTCCCAGGTCTGAAAGTTTGAGCCAGGCAAG 

TGGCAGTGCCCCTAGTGGGCAGCCGTTGCCAATGGATGCC 

Sequence 1112 

CCCCCGCGTC CGTAA TTTTAMGMCCTTGTTATTAGAAAATCTCAGCCTAATACAATCT 

GMGTTMGAGTTTTAGCAGCATTGTTTTTCTMGTAGATTTAGCTATAGATT^ 

GGCCAMCAAGGMGAGTATATGCCCTTGTAMTGAGTCTTGTTTTGTTATTTAAATAGT 

CAGTCAAAACGTAGAAATCAGTATACGTAAAATAAMTGCATGAGACTATTAMTCTTTT 

CATATACTCTACAAATAAAATGAAATCTGTGTGTGGTCCTGGTTGACTGGGCATCTAAAG 

GGMTC AGA AAAG AGATTGTGAAMGTTATATATATATCCTCTTCXTTATTTTAGTTTTG 

CTTTTTCCTATTTTCCATAATTAAGTGCCGT7TACAMGTGGCATCAAAAMTTGAAGCA 

GGCCAGGCATGGTGGCTCATGCCTGTGGTCCCAGCAGTTTGGGAGGCTGAGGGCAGGTGG 

ATCACTTGAGATCGGGGGTTCGTGACCAGCCTGGCCAACATGGTGAAAGCCCATCTCTAC 

TGGGAATATAAAAATTAGCCCGGCGTGGTGGCATGTGCCTGTGGTCGCAGCTACTTGGGA 

GGCTAAGACAGGAGAATTGCTTGGGCCCTGGGAGGGGGGAGTTCANNNGNCCTGGANCGN 

CCCCTGNNCTCNANCCCTGGCAACCANTGNNGACACCNNCTTAAAAAAAAAA 

Sequence 1113 

TCGACCCCGCGTCCGGTTTTTTGTCCCAGCAGTGGCATTTAMTTACTGTACTTTAAGAC 

ATGGMTTGCTGGAGGCTTGGAAACTTGAGTGCAATTTCCCTAGTACGACCTCCAAGGAG 

MTAGAGCAAMCAG TGGTA GGAAAMCTCTCAMATTTTACCCMTTGTATGTTTTCTA 

CATTGTCAGTATCTAGTTTTATATAGTTMTATGTACTTCTAAMTTTCTGACAGTG^^IT 

GGTGTATAAAACAGACCAAGCTCAAGATGTAAAGAAGATTGAGAAATTCCACANTCAACT 

AATGCGACTTATGGTAGCCAAGGAAGCCCGCAATGTTACCATGGAAACTGAGTGAATGGT 

TTGAAATGMGACTTTGTCGTGTACTTAGGMGTAMTATCTTTTGAATTAGAGAAAGTG 

TTGGG ACAG A MGTA CTTTATGTMCTMGTGGGCTGTTCAGMGCTTAGAGGTCATTTT 

TTGTAATTTTNTTTTTMTTAC^ 

MGCCTTTATTTACTTTTTGGGAAA 

Sequence 1114 

TCGACCCCGCGTCCGATTCCTTCTTCATATATTATGTCAGAAGAGTTTGAGAAGAAATGG 

TATTMTTCTTCTTTMATGTTAGGTTGACTCACCAGTTMTGCAGCTATTTGGTCATAA 

ATG TTTC TTTGTTAATCACTTTCGATTACTMTTCMTCTGCTAGGTTATAGGTCTATTC 

AGATTTTCTCTTTCTTCTTGAGCCACTTTGGTAGTTTGTGTCTTTCTAGTGATTCGTCCA 

TTTCATCCAGGACAGCTMTTTGTTGTTAGACAGTTGTTCACAGTATACTCCTGTAATCC 
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TTTTGTATTTC TGTAM GTTGGTAGTMTGGCTC 

AGTCTTCCATCTTTTGCTCAGTCMTATAGTGAMGGCTTGATCTTTCAAATAATCTATG 
TTTATTCATTCTACTGCTCTCCAMCTTCTATTTTATTGATTTA 
TCTATTATTTCTTTCATACTGCTAGCTTTGGATTAGGCTTATTTTGNC^ 
Sequence 1115 

GCCCCGCGT CCGGG ATGACTAATGAAAGCAATNAGCTTGAACATTTAGAAAAAATTCATA 

TATGATCTAAATTTTTATATTATCATTTCTGTGCCTTCTMTTCCTGCATCCTGTTCAAA 

ACATCTTTCCAGACATTMCTTACACATTGTATAAAACCGACCAAAATGATTTCCTAAAG 

TTCATG CAAAAAAAAAAAAAAACMCCTMTTTTCTGTTAATATAAAAG AAACTTCAGTT 

TACTGACCGTGAAACAGACTATGTACTGACATCCAGGGTAAAGTAAAAGACTTTTAAATA 

TTGGTCATTAMGGACAGGAGCTMGCTAGCAMGCAAMCATCTTTAGCACTTTGCAGA 

TCTC MGCAGTTMCCAGGCTCTGATTCCCTTCCACTGTTTTATGAATTAATTCCAGTTC 

TTTTCA TGTATC TTTGMCCTMGATTATGMGTMTTTCCCTATTAGGGACTAGAATGA 

CTTCAGTTTTTTCATTTGATAAAMTCAGMCTGCTACCT 

AMTGTAGATGAGTGCATTMGGGTTGTAAGGATCTTTATCATTTTATGNCATTCATTGA 

AAATTGAAATGTTCATTCTTTTTAATGGTT 

Sequence 1116 

CNGCTTTCGTCTCTTCCTTTNMGTTG 

TGGGACCTCCAGM TMTG TTTCATGMGTAGCATGTATCCATATCTGGTTCTTGACTTT 

TTCATCATTATMTTGTTTTCTATGGGTTACTTATCAGTTTMGMTGCTTMTTCCTAG 

ATGMC TAAGAG TGTTTATTACATGTTGAGATTTATGGTATGCTTTTTCTTCCTCAAGAT 

AATGCATTTTTTGTA TTATCT GTTMTGTGATAGGTTATCCATTTGTGTATTTTCAATCA 

TTGMCAACCC TTGAT TTTTTTGGATAMCTCTATTTGGTCATTATGCATCATTCTATAA 

ACCCTGCTGMTTTTTCATTTGCCAACATCTTATTT 

CMTGAGATTTGTTTTCTTTTGCMTTTGTTTTGMTTTTTGGTAT 

CCTTATAATGGAAAATACATATTTCTCAMCTTTTACACTGATATATTCATAGTATTTrT 

TTATMTTTGAAAMTCTTGTCAGTATCTGTATTMGGCCTNCATTTCAGTTCTGCTATT 

TCATATTGCCTTMGGTGGCTATTTGGCTCTTTMGGACCCCGATTTTGATTTTGTCATT 

TTTAAAATAAACCCCATTTATGCTATAAAAAAAAA 

Sequence 1117 

GCCTTTTATGGTGATGGAATATGTCTCAGGAGGAGAGCTATTTGATTATATCTGTAAGAA 

TGGMGGGTMGCTGTTCTGCTTTMTTCTGTATGTATTTTGTNNCTNGNCCTTTATCCT 

TTACTAGCATCAAMTGTCAGCMCCAATTTTMGAGGTCTATTTAATAACCAGTTCCCT 

TAGTCATATATTTGTTTAGAATCATAAACTATGTAGAACTAGAAAGGATCTTAAAGATTA 

TCTCCTTAGCCTGTTTATACAGATGTGGATACTGAGCCTCGCGGCTTATATGATTGCTCA 

CAGTAACGTGATTTATTAATGACGGAATTGGCTTGAGCCCCCAGAACTCATAATCCTCAG 

ACTTATGCTTCCAGGGTATACAAATACTTTGAATATGTATCTTAATGTAATTAATCGTAC 

CAAATATATTATTATTACT 

Sequence 11 18 

GCGTCCGTTGTCATCTATTTACTTTACATATGTCATAAACCTAACACTACATGGTCATTT 

TTGTTTAAACAGTCMTTACCTTTTAMGGGATTTGAATAATAAGTCAAAATCTAATACA 

TTMCTGTGT AGTT AGCATTTCTGGTGCTCTTCTTTCTTTTCTGTAGATCCATACTTCCA 

TCTGGCATTATTTTCCTACTGCCAGMGGACTTCCTTTAACATTTCTTGTAGTGTAGATC 

TGCTGGTGATGMTTCTTTCAGCTTTTGTMTTCT 

GAGTATAGGTTMTAGCTTTTTCCTTTCAGTACTCTAAAGATGTTGCTCCAGGCCAGGCG 

CGGTGGCTCACTCCTGTMTCCCAGCACTTTGGGAGTCTTGAGGTGGGCAGAACACTTGA 

GGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCCGTGCTTCTAAACATAT 

TAAAGAAAAAAAAGA 

Sequence 1119 

NCGTGACATGCTGGCTGCTAGTNAGCTCCCCCATGATTGTCAGCTTCCGAGCCCTCACTA 
GAAGCAGATACCACCCCACCACCATGTTTCCTTTAAAGCCTGCAGAACTGNGAACCAATT 
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AAMCTCTTTTCTTTATAAATTATCCAGCCTCAAGTATTTATAGCAACACAATAATGGCC 

TAACACAACCTACAACTCTCTATATGTATTTGTGTGTATTTAAAACATGCAGGAAATAAC 

ACAGAATCCAAGGCACCCAAAACTCATTAAAAAATGGAATCAAGAATTCATATGCCATTA 

TGAAATTAGCCAGTCCTAAMTCTGACCTCTCTGCATTTTCACATTATTCTCCTCTCTCT 

ATCCCTGCCTTC CTCC CTCCCTTCCTCCAACTGTCAGAATTGTCCTGTAATCAAACATGT 

TCACATCACAGCTTTrCATTTTCTATTTCCAATCAATTGACCAGTCTAGCCAAGTAGCAT 

CCTGGATCCCGTATTACATATTCCTAGGACAGGAAGCCAGATTTTT 

Sequence 1120 

AGCTCTTTAGCAGGAGACAATTCTTAAACTTAAATTAAACTGAAAAAGCCACAGAAAAAA 

GGGTTTGACACCCTAAAGCCAGTGTCCAAATGAACGCTACGGTTGNCCTCATAGGTGTGT 

TATGAATGTTTACCCTGACCTCCTAGAAGAAAAGGAAAAAGAAAGGAAAGAGAGAGGGAG 

AAGGTGAGGGAAGGGGGGAAGAAAAGAAAGGAGAGAAAAGAAGACAGAGAAAAAACAGGG 

AGGGAGGMGCTGGGGMGGAAAAAGACCATlTGCTGACTCCGTTGTTTTATTTCCCAGA 

ATGATTCAATACCTCMGMGATTTCACTCCAGAAGTGTACAGAGTTTTCCTCAACAACC 

TTTGCC CTCGAC CTGAMTTGATAACATCTTTTCAGAATTGTAAGAGTACACATTTTAAG 

CCATATCTTTTTCAGCTTGCATTGATTCTCAGGTGGCTAGAGCAGGACTTGGAGTGGTAA 

TTGGAGATGGAAGAACATCATACACTGTGTCTAAA 

Sequence 1121 

CGGCCGAGGTACTATAATGGTCCCCATCTTMTTTGAMGGGTTTGAGAATCTTTTAGGA 

CANGCACTGACGAAGGCACTNGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGT 

GGCTACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCAT 

AAGAGAGAAGATTCCTTTGAAAGCTTGGACTCTTTGGGCTCGAGGTCATTGACAAGCTGC 

TCCTCTGATATCACGTTGAGAGGGGGGCGTGAAGGTTTTGAAAGTGACACAGATTCGGAA 

TTTACATTCAAGATGCAGGATTATAATAAAGATGATATGTCCGTATCGAAGGATTTCGGC 

TGTTGAGCCAMGACTGCGTTACCCTTCMTCGTTTTTTACCCAACAAAAGTAGACAGCC 

ATCCTATGTACCTGCCCG 

Sequence 1122 

CCCTTTCGAGCGGCCNTNCGGGCTTNTACGCGGGGGCAGCGGGAAGCTCGCAGCAGCTGG 

GGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGAT 

GAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCA 

AGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCA 

ACCATGGATTTGAGG CGTGT GAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATC 

ATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTACTA 

ACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTG 

GCTTGGGAATTTTTCTCAAAAATAT 

Sequence 1123 

CCCTTTCGAGCGGCCGTNCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

ACTGCAAAAMTTGCCMMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1124 

CCCTTATTTTNGGCNTTNGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCA 

AAAAATTGGCAAAATGCNACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGG 

TTTGGAGTGTGGAAGCCTNATCCCTTCANCATCAAGCTGGAATGGGGAATGAAGAATGGA 
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TTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTA 

Sequence 1125 

CCCTTANCGTGNTCNCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TNTATAGAGGGTGGGAAAATAAACCANAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCGTCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATTGTTAACACCTGAATTG 

Sequence 1126 

CCCTTTCGAG CGGCC GCCCGGGCAGGTA G I I I 1 1 1 1 1 1 1 1 I N I 1 1 1 1 I I NNGGACTTGCG 

ACAGTTCCACCTTTTACTCTCATTGGTAAAATCTCCTTTTMTTATTAAAAATATTGATA 

AATTTATTAAATTMGTCTTTNATTCTTTTGTAATCAGAAGAGGACATTAATGTTGCGTG 

TCTTG ACTGT C I I 1 1 1 1 GCTTTGTAGATTTATTTTGTGCTAAATGAGAACGATATGCATG 

TTTG TGNTTGATTTTTCCAGMGCAGTTACTTTAATTCTTTTTTTAAGNGCTGATTTTGT 

TTTTGC TTTAG GCATTAGTTTCTTCTTCCTTTATAGNTTTTTCAAAATCAATTAATTCCT 

TTATTTGTTTTGAMGAAGTAMmGGGGTMTTTTTTCTTATCACGCCCAATATGAAG 

AGTTAAAAAATTACCMCTGATTGCATTTCTTTACTTMTTTTGCAMTCGATTTTACTT 

CATCAAAAAAAAGMTTT MNAAT TMTTTACCTTGTTCAGGTCTTGGAATTTNCACGCC 

CTCTMGACGAAGAGCCACTTTTACTCTGCGTATCTMTAAAMTTCTTTGGCTTTTTTT 
GCTT 

Sequence 1127 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1128 

CCCTT AGCGTGGT CGCGGCCGAGGTACATGCTANAAAACATTANCACAGATACGACAGAG 

TGTGGi 1 1 1 1 1 1 l l AG AAAT GGGTAATTTCTCTCTCCAGTATCCTTTCACTTGTATGAGA 

TATTTCTCCTCTCCTGTTTTCACAAACCAAGAAATCCCCAGGTAGGCCAATCCCAGAGGT 

GCCATTTAGCAGTATGCAGCAGCCCAGTTTCAGCATAACAAAACATGCCTTGGTAGTGGC 

TCTCTCATGCAAATAAAAGAAAGCTTAAGAAATTCTTGTTGTAGGTGGATTAGGCAAGGC 

TGCCA TTCAGCTGGTATAAGCTAAAAGTAAAAAATCAAAACGCTCAAGAAAACGGACACA 

ATTTTGGMTGATTAMGATGTCTTTATAMGTTTTTTTCAAGACTTCATTCTAAATACA 

CAGAATAAAAAATGGGTGTCAGCTCACTTGTAAGACACCAACCAGATTTTCCTTATACTG 

TCTC AAAATTTAAAGATCAATTTCCCCAGAAGGTGTNCAATGCATCATAAAATGGCCCTT 

TTTTGAGGATGGGAGAGGAAGGGTTGGGCAGGATGGAATATTAAATTGTACATGGATAAA 

CATGCCMGACTGTTATCCAATCTAGATAAT7TATATACATTTTGATGACTTAAGGAAAA 

CAAAGCAATCATTTGGTGACAGCCTAAAAAGCNTGACCNTATTTAACATACTTAGGAACT 
TTTTTNGG 

Sequence 1129 

CGTTCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAA 

AAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCA 

AMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTrC 

AGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTG 

GAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCC 
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cactaggctactgctgaaagggagctgaaattcctccaccaagttggtattcaaaatatg 

taatgactggtatggcaaaagattggactaagacactggccataccactggacagggtta 

tgttaacacctgaattgctgggtcttgaagagagcccaaggagttctgggaagagggacc 

agattggggggtagggtcacgggcttgggtgataGaattatttctcgaatgactttcttg 

agtgccmtttgmctgtaacatttgcttantcacctttagnggagtaatctcctgggct 

tggttctatatttatataaaag 

Sequence 1130 

CGGGCAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCA7TACCCTA 

CCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATT 

ACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGGTTCCAATAAGTTATCCC 

CAGTCTTAAANGTAGGTTAAGTACCTCNGGCCGGCCACCGGGNTGGAGCTCCAAATTNGC 

CCTATMGTGAGGTCGGATTTACGCCCCGCCTCACTTGGOJCGNNGTTTTACAACCGTCC 

GTNGA CTNGG GAAMCCCCTGGGCGTTTACCCCMNCTTTAATCCGCCTTGGCAGCACAA 

TCCCCCTTTTCGNCCAGGTTGGCGTNAATAANCGAAAAAGGCCCGGAACCGAATCGGCCC 

NTTTCCNMCANGTTNGCGCCAGTNCTGGAATNGGCNAAATGGGGACCCCNCCCCTTGTT 

AACCGGGGNGCATTTAAACCNCCGGCCGGNTGNTGGTTGGNTTACCCCCCAANNGGTGAC 

CCGNNTTCMCTTTGGCMGGGCCCCTMNGGCCGNTTTCTTTTGNTTTTT^ 

Sequence 1131 

CGANGTACGCGGGGCTTTCCAAACAGCATAAGAAGTGTTTGAGCCNTNAGTATACTGAAG 

GAAGGGCTCCCTCGAGTTCTGGTGTGAAGAGATAAATCACCAGAAAAATGATGGCATCAA 

AACAAAAAAAGAACTCATTGTGAATAAGAAAAAACATCTAGGCCCAGTCGAAGAATATCA 

GCTGCTGCTTCAGGTGACCTATAGAGATTCCAAGGAGAAMGAGATTTGAGAMTTTTCT 

GAAGCTCTTGAAGCCTCCATTATTATGGTCACATGGGCTAATTAGAATTATCAAGAGCAA 

AGGCCACCACAGACTGCAACANGCCTGAATGGAGTCCTGCAGTGTACCTGCCCGGGCGGC 

CGCTCTTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAT 

ACCCGGCNGACCTNGAGGGGGGGGCCCCGGTACCCAACCTTTTGT 

Sequence 1132 

AGCTCACTCAAAGGGCGGTAATACGGGTTATCCACAAGAATCAAGGGGATAACCGCAGGA 

AAGAAACATG TGAGC AAAAGGGCCAAGCAAAAGGGCCAGGAACCGTAAAAAGGGCCCGCG 

TTGCTGGCNGTTTTTCCATAGGGCTCCCGCCCCCCTGGACCGAAGCATCACAAAAAATCG 

ACGCTCAAAGTTCAGAGGTGGGCGAAAACCCCGGACAAGG 

Sequence 1133 

CCCGGGA ACAAAA GCNGCAACCGNGCCCCCCCTCCAGGTCNACGGNNTCGANAAGCNCGA 

AAACCGMTTTTTGNAGNTTTNGGGGACCCACTANTTNGNGAGCGGGGGCCGANNNAGNG 
CGGC 

Sequence 1134 

CCCCCCCNCGAGGNCGACGGNANCGANMGCAGATNNTNMTTTTTGCAGCCCGGGGGAT 

CCNGGAGNGNGAGAGCCCACCGNGGGGGNGCGCCAACACCCCCGANAGAGAGNCGAANNA 

CGCGCGCNCACNGGCCGNCGANAAACAACGGCGAGACNGNGAAAACCCGGGCGANACCCA 

ACANAAGCGCCAGGCAGCACANACCCCAANCGCCAGCNGGCGAAACANNnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGANACCGCGCAGCGNGACCGNAACACNAGC 

CAGCCGCCCNAGCGCCCGCNCCMGCGCNNANCNNCCCANACCAACANCGCCACGNACGG 

CCGGGCAANCCCCGGCAAGCACAAAAAGCGGGGGGGCNCCCTGGAAGGGGNNCNCGAANA 

AAAGGCAAGACGGGNACCNCNGAANCCCCAAAAAACC 

Sequence 1135 

GGGAGTNNGGGAGCGGCCGAGGGCCAAANGANCCGCGAAGACAAGGCCATCCACCACTNN 

ANAGAGGGGGNAAAAACAAACCAGAAACCAAGGGAGAAAGAAAAGANGAAAGACAAACNG 

CAAAAAANNGCCAAAACGCGACNANCAAAAAANGGAGCAGANGCNGAGGCANNGCAAGNC 

ACGGCANACCGNCAGGAGCCGAANGAAAAAAAGCAACAAGCAAGANGAAGACNCTGAGAG 

GGGCNCGGAGGCCNGGAAACCNCATCCCTNCAACAACAAAGCNGGAANGGGGGAAANGAA 
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GAANCAGAAGAAGGGGGCGCCCACNAGGCTACNGCNGAAAGGGAACCGAAAAACCCCNCA 

CCAANNAGGNATNCAAAAAANNGGAAACGGACCGGGNCANGGCAAAAAAAANGGAACCAA 

GACACCGGGCCCANACCACNGGACCANGGGGNAANGGAAACACCCCGGAAATGGCAGGGN 

CCCCGAANAGAACCCCAAGGGAGCCCAGGGAAGAGGGACCCAAGATGGGGGGGGAAAGGC 

CCACCGGGCCNGGGGNGAAANAACAAAGCCACCGAGGGCCAACCGNAAGNGGCCAANNGA 

ACCCGGAACNANAAGGCNNAANCCACCCTGAAGNGGGAGGAAAACCCCCAGGGCNGGGGG 

CCAAANANNAAAAAAAAGCNGCCCAAAACCCCCAA 

Sequence 1136 

hrTTTAATTTTTTGCAGCCCGGGGGANCCAGGGGNAGGGNGAGCCCACCGCGGGGGAGCGC 

CAANCGCCCNACAGCGAGNCGNAANACGCGCNGCCCACNGGCCCGCCGNANAAACAACGIM 

CGAGACGGGGAAAACCCCGGCGNCACCCAACNGAAACNGCCANGCAGCACAANCCNCAAN 

CGCCAGCGGGCGGAANAGCGAAGAGGCCNCGCACCGAACGCCCANCCCAACAGNGGCGCA 
ACAGAAAGGGCGAAA 

Sequence 1137 

TTTGCANTTTNGGGGACCCACNAANTGGNGAGCGGGCGANGAACCACACNNACGGCCCGA 

GNCCNNGNCNACNGGNCNAGAGNGANAANCACAGGGACACCGANCANCCCTNNCGGCANG 

GACCCACAGCAGCAGGNGGANNGCCATACCGAGANGAAGGAAGACNCAGACANCGCCAAC 

CATNCCCGAGNGGACCNGCCCGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACGNNAAACAACCGGCGAGACNGNGAAAA 

CCCNGGCGNNACCCAACNNAANCGCCGNGCAGCACAACCCCCNACGCCAGCNGGCGNAAA 

AAGCGAAGAGGCCCCGCACCGAGCGCCCNTCCCAACAGGGGCGCAGCCCGAANGGCGAAA 

NGGGGACGCGCCCTGGAAGCGGCGCAANAAAGCGCCGGCGGGAGGGGGGGGANACNCCCA 

NNCGGGACCGCNACAACNANGCCAAGCGCCCNAANCGCCCGGCACCCANACGCTANCNAC 

CCNNCCNANCCACGCCCACGGGANCGCCGGCNANCCCCGGCAAGCCNNAAAAAACGGGGG 

GCNCCCCNNAAGGGGNCCCGACANANGAGGCCNNACCGGCACCCCGACCCCCANAAAACC 

AGGAAAAAGGGNGGAAGGGNCNACGGAAGGGGGCCC 

Sequence 1138 

CACGCGTCCGGAAAGGAGGGGAGGGGTGGGGCGGGCCGGATCTGCTCGCAGGCCGCACCC 

GCCTCCGGCTGGATGCTCAGAGTCTCGCCCTGATGGCCAGACTGGAGTGCAGNGGCGTGA 

TCTCGGCTCACTGCATCCTCCACCTCCTAGGTTCAAGTGATTCTTCCACCTCAGCCTCCC 

CTCCCAAGGAGCTGGCATCACAGGACAGGCACGGAGACTCACGCCTGAAATCCTAATACT 

TTGGGAGGCCGCGGCAGGAGGATCACCTGAACCCAGGAGTTTGAGACCAGCCTGCCAACA 

TGATGTTGTTATTCATGAGGACCAATGGGTTGGCGAGACAGTACTACAATCAACATTTAG 

CAGTCAGTTATTAAATCTTGGGAGTTATTCATCTATTCAGCCTGAAGAATATTCCAGTGT 

AGTTAGTGATGTTGTACTTCAAGACTTACTGGCATATGTGTNCTCAAAACATTCC 

Sequence 1139 

TTACCCCGCGTCCGGGTAAACAAAACAAAGATCGTTTGTTCTGGAAACAGGTAAAATGGT 

AATCAAATAGATTGTGTTCCAGGAGTGCAAAGGTGGCTTAATATTCACAAATCAGTTGCT 

ATTGTACACCACCTGTAGAAAAGTAATCTGGCATGCAGAACATTCTTATGGTAAAGTTAA 

TGTTCATTTATGATCTTAGCAAATGATGGATTGAAAGGGACTTCCTTAATTGCATAAACA 

GACTTCAACAAACAGTATGATGAAATAGTGAACATTTCTCCTAAGATTATAAAAATAAGA 

CAAGGATATCTGCTGTCAATGATTTTATTCAGCATTGTTCAGAAGGACCTAACCAGAAAA 

CTAATGCAAGAAACAGAAACAAAAGGCATAAAGATTANAAAAGAAGTAAAATTTTAAAAA 

AGAAAAAGAATATAAATCTCT ATTTG CATATGCCATGAGTAAATTTGGTAAGTTCCCTGC 

ATAAMGTTATGCAAAAAGCATTTTTTTGATATACCAGCCAAAAATCAAGGGAAATGGAA 
AA 

Sequence 1140 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 
AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 
AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 
CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 
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AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAMTTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTC 

TTGAGAGAGCCCAAGGAGTTCTGGGGAGAGGGACCAGATTGGGGG 

Sequence 1141 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1142 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCXJTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAG 

Sequence 11 43 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGTGGATA 

GAGAAACTGAAGCTCTCTAAAGACCCTGCAGCTGGGAGGTGGCAGAGTCAATGGCAGCCC 

TCAGCCCTATCTGCCCTGACATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAA 

TAACAATGTGTGGGGTC CTTGG CTGAGAGACTTNCCTTTTGGGAATCAATCTGAATGTAT 

GATGACAAAGAAAACMCTTTTGCTTTATACAACCTTCTGGTTAGATTCAGGCACCAAGC 

AGGACACTTCTTTGTGGCGCTCCAAGAATCTTTCAAATTCTTCATCACCAATAACAAATC 

TTTCTGCTTCTCTTAGAGCATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGA 

ACACTTTCCAGCGGACAGGGTTTAGTGCTTTGATCTGTTCCGTCATGTCCTCTTCCACGT 

TGAMCGATTAATGACAGAATTTTTTTTGGAGGCGACTCTATTAATCCCTACACCACCTN 

CTCAGCTTTTGAAGGGTTTNCACATGGGTTCTTTT 

Sequence 1 144 

GNAGCTCCCCGCGGTGGCGGCCGAGGTACGCCACCATGCCTGGCTMTTTTTGTATTTTT 

AGTGGAGACGGGGTTTCACCATGTTGGCCAGGCTGTTCTCGAACTCCTGACCTCGTGATC 

CACCCACATTGTCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCATGAGGAT 

TAGTAMGTGCACTCATGGTAAGTAAAAAMTTTGTTTTATGTTNATGCTGATTATATGA 

AGGTCATCATAGCTTAGACACAATCMAACCCATGGGGMCATCTTTAGAATTCATTTTT 

CTCTTTTCTTACAAAAAAAGTAAATAGGTAAAATGGAAAATAGAAGACAACCTATCCTAT 

CCTGGAT GAGACACACACATATTTAAATTGAATTATAGACTTAAAATTTAAGTAGGGANIT 

' NTGNGNAACAAAAGTTTNCAAAAAACCCAAAACTTTTNAGAATCACCCAGTT 

^GGAAMTATGATTATGAAAGCAGACTTTTTGGATGGGNGCTTAATGACATTTAGGCG 

ACATTTAAAATGCCCCTAGGNGNGGGAACACTTGAAATTGCCANCTAAATTAAAATGACC 

CTmMTTTGCCTGGACMCAAAAMNTTTCCATGATTTTGGCTTTTrTTGGAACAANN 
GANNAAGAAA A I I I 1 1 1 1 1 1 I A G 

Sequence 1145 

CNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCANGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 
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ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 
Sequence 1146 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCACGGTACTGTGTCAACAT 

GCAAGGCCAGGTGGCCGTGAGGAATGTCCCAATGACAGGCTCTATCAGTCATGTCCTGGT 

GCAGCTCCC ACATGT CTTCCAGAGGATGGAAGCTGAAAACCTAGCTTCAGTGATTGATGC 

CAGGTTTAAC I I I I I I GTGAACAAGATTTTGCCACAGTATCGTGATGCAGTCATGTCTCA 

CACGCTCATCTA TATC CCCTCCTACTTTGACTTCGTGCGTCTTCGAAATTACTTCAAGAA 

GGAGGAATTGAATTTTACCCACATCTGCGAGTACCT 

Sequence 1147 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAMAAATTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTNTTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGGGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGTAGGTCC 

Sequence 1148 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTTCT 

GCGCAGCTTCCCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGA 

MGATGAGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGMCGCATTT 

ACTGTCACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAA 

TGCAAATTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATG 

GAGGATAAGMCATTATTCMTTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGT 

AGCTACAGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTT 

CAGATCACAAGATGTGAAATTGCAGGATGCAGGGGGTGTACCTTGGCCCGCTCTAGAACT 
AGTG 

Sequence 1149 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCXTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGGGGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGG 

Sequence 1150 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 
TGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 
NAAAGAAMGATGMNGACAMCTGCAAAAAATTGCCAAMTGCNACTTTCTAAAAATGG 
AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATTGAAAAAA 
Sequence 1151 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 

TGAGGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 

GAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCTAAAATGCGACTTTCTAAAAATGG 

AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCA 

ACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATC 

AAGCTGGAATGGGGAACGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 
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Sequence 1152 

CCGCGGTGGCGGCCGCAGTTACCTGACGTATGACAACCCAGATATCTTGAAGAGGGTGTC 

AGGATCAAGATCAMTGTGTTCATGAGGTTTAAMCMCTGCCMGGATGGCCTTTTGCT 

GTGGAGGGGAGACAGCCCCATGAGACCCAACAGCGACTTCATTTCCTTGGGCCTTCGGGA 

TGGAGCCCTCGTGTTCAGCTATAACCTGGGCAGTGGTGTGGCATCCATCATGGTGAATGG 

CTCCTTCAACGATGGTCGGTGGCACCGAGTTAAGGCCGTTAGGGATGGCCAGTCAGGAAA 

GATAACCGTGGATGACTACGGAGCCAGAACAGGCAAATCCCCAGGCATGATGCGGCAGCT 

TAACATCAATGGAGCTCTGTATGTGGGTGGAAT 

Sequence 1153 

GCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGMCTAMTTTTTTAAACTTTA 

TTTGCTGTTAMTTCTGTGAAGTTTCAGTTATCTAAAATAAATATACACAAATATGAAAT 

ATMTGTTTCAGATT GCAAG GTAATATGTAATAGTAGTGTTTGTAAGATACTCTTGTCTA 

ATATTAACTAGTAGTATTTTGATTTGTACAATGTCACCCTCCCAGCAACAAGAAGAACAA 

GCTACTGMTCAGTGTCCCTTTATTACTATGGCATCAMGATTTGGCTACTGTTTTCTTC 

TACATGCTAGTGGCGATAATTATTCATGCCGTAA7TCAAGAGTATATGTTGGATAAAATT 

MCAGGCGMTGCACTTCTCCAAAACAAAACACAGCAAGTTTAATGAATCTGGTCAGCTT 

AGTGCGTTCTACCTTTTTGCCTGTGTTTGGGGCACA 

Sequence 1154 

GAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCXJTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGT 

Sequence 1155 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAATGTGTGAAGACAAGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAMCTGCMAAMTTGCCAAMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTT 

GCATGTCTTGGCATTTCCTTCAGGAGCTGAATGAAAAAATGCANCAAGCANATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCAC 

CAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTMGACACTGG 

CCATACCACTGGACAGGGTTATGTTMCACCTGAATTGCTGGGG 

Sequence 1156 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTC 

ACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGA 

GGCT GGCTCAGGTTTGACTCTATCTTGGAAAAAAATAGGAAAACTTCATTTATGGAATAG 

TTTTGAATMCCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACG 

GGTAGAATGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTC 

TGG MTTG GTCTGACCCMTTAACACACGGCCTCTGATGGGMTAGATGTATTTTGGGGA 

CACATTTTAATCTGATAGCTGTAACCCCTTTTGAGTTGGCTTTTGTTCACTGGAATCCCT 

TTCCAGTCAATGAATTTCCGAGAAAAATTCAGAGGAAGAGCTGTCGGAGGCACCAGAGTG 

CTGATGTTTTCT 

Sequence 1157 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 
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TCATTTCGTGAAGGAGCAAGGCTGMGMCCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATG 

TTTMCACCATCTTACTAACCATTATTGCTATGGGTGGGTATACACTGCCTATGTCTTTA 

TTCCMTCCACATTCGCCTGGGCTTGGGMTTTTTCTTCAAAAATATGTGGATATCACAG 

GNCCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCTTGCAGGGNAT 

Sequence 1158 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGA 

AGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATA 

CACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATC 

ATTTCGTGMGGAGCMGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTG 

CAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTA 

CTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCXJGATCTATGT 

TTAACACCATCTTACTAACCATTATTGCT 

Sequence 1159 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGAA 

GCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATAC 

ACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCA 

TTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGC 

AGCTTGAAGGAGCCMCCATGGATTTGAGGCGTGTGAAGGMTATTTCTCCTGGCTCTAC 

TATCMTACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTT 

MCACCATCTTACTMCCATTATTGCTATGGGTGGTATACACTGCCTATGTCTTTATTCC 

MTCCACATTCGCCTGGCTTGGGAATTTTTCTCA 

Sequence 1160 

CCGCGGTGGCGGCCGAGGTACCAMTGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCACAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGGGACTTTCTAAAAATGGAGCANATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTNAGCATCAAGCTGGAATGGGGAATGAAGAATAGAN 

ATGTGGTGCCCA 

Sequence 1161 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGAAGGTGGGTGACGTGCGGA 

TCTTCTTCTTTTTGTGGCTGTGGACACCTTTCAACACTGCCTTCTTGGCCTTTAMGCCT 

TCGCTTTGGCTTCAGCTTTAGGAGGGGCAGGAGCCCATCGCAAAACCACGCTGCGGAGAG 

AGGGGCGGGTAATGTAGCCCGGTTGAACATGAACCAGAAGGAAAATGGTTAAAGCTGAGG 

GCACTAATTCCTTACAGGCCCGGGGACATGGAGCTCCAACCAGTGGATGCATGTAGCTTC 

CCAGAACCGAATGTCTGCCCCGCGTACCT 

Sequence 1 162 

CCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGNGGATAGAGAAACTGAAGCTCTCTA 

AAGACCCTGCANCTGGGAGGTGGCAGAGTCAATGGCAGCCCTCAGCCCTATCTGCCCTGA 

CATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAATAACAATGTGTGGGGTCCT 

TGGCTGAGAGACTTCCCTTTTGGGAATCAATCTGAATGTATGATGACAAAGAAMCAACT 

TTTGCTTTATACMCCTTNTGGTTAGATTCAGGCACCMGCAGGACACTTCTTTGTGGCG 

CTCCAAGMTCTTTCAAATTCTTCATCACCAATMCAMTCTTTCTGCTTCTCTTAGAGC 

ATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGAACACTTTCCAGCGGACAGG 

GTTTAGT 

Sequence 1163 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 
CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 
AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 
TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 
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AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGAC 

TAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCT 

Sequence 1164 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNGGCACGGTNCCAAATGAAC 

GTGTGAAGACAAGGCCATNCACCACTTTATAGAGGGTGTAAAAATAAACCAGGAAATCAA 

GGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAA 

ATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAA 

TGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAG 

CATCAANCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCTGAAATTCCTCCACCAAGTT 

Sequence 1165 

TTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTTTCTTT^JTCCTGT 

CCATAAATCTTCTTCCACCACGTGGCTGTGTNCAAGACTCTCTGAACCTNCTCTGGCTCA 

GGAGGCTTNTAGATNTGTGAATTGTCTGCTCAGTNNAACTCCATTAAATTNAATNTGGCC 

AAGAANTTTCTTCTTAACAGNGGTATTGATGACCATTAATCCTTCAACCTAAACCTGCTC 
ATTAA 

Sequence 1166 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 
AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTT 

Sequence 1167 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAGCGTTTGT 

AATGTTTCCCAAATATTAGCTTTGAAATCCAAATGTCAAGCAGGTTAAAGTTCANAAAAC 

TATAMTGTMTTGTTAATAAMCMTGGATGAAAAAAGTCATTGAAATTTTTTTCTACT 

TGGATTAAGAACATAAATTAAAAGTGCAACTGCAAAAATAATATTAGTTTGATAAGTAAA 

ATAAAACAMCTAGATATATTTTGAAAATAAAAAACAAATGAAACAAAATAAAATTTAGG 

TAAAGAAMTTCAACGTMTTTGTTGTAGCTATATTTTTTGTAATAATTACAAAAGTAGA 

AATAATAGCTCATAMGCAAAMCMAATTTATTCTATGTTCTTTTTTTCAGTCAATTCA 

GAATTTTAGCTTCATATTTGMGCATTTTTTTCTMTTTTGTTTGNGMTTTGMTTTGT 

TTGCGGATTTTGATTTGCCATAAAMTATTATATTNTATTTAATTATATTAATCTTCGTC 

AGCTTTAATTGCTCTTCTTTAAAATTTGATCTGAAAT 

Sequence 1168 

CCGCGGTGGCGGCCGAGGTACCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTCCAGGA 
TCAGAAGAGAAGCCAACGTGACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCT 
GTTCACGAAGTGGTTGAAAAACAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCAT 
CCAGGTGTTCACAAAAAATCAGAGAATCTTTTTNAGGGGGTGGNCGCNTTAACCCTTTTN 
NGGTTNANTGAAAATCCCCCCCCTGN I 1 1 1 1 II I G AGAGGTTNAAAAA I nTTTTTTNAA 
AAAAAAA ACCCCCCCCCCNGGGNATTNTNAN 1 1 1 1 1 I I I I I NNNNNCCCCNNCAAAAATT 

nrm iaacccccccccnnaaaaaaaaaagggi i i i i i i i i i i i iaaaaaacccncngn 

TCNCCCNNNCCAN AAAAAAAAAAANGGGGGTCCCCN 1 1 I I I I 1 1 I I NTTNNCCCCNGGGN 
I I I I II I I I I 1 1 1 I I NTTCTNNNNNNGGGGGGGGGGAAAAAAAAANAA AI 1 1 1 1 1 1 1 1 1 I 

TGGNGGGGGGGGNNGC NCCC NCCCCCCNCGGGGGGGGNGGNTTAAAAAAAAANGNCCCCC 

CCNGGGGGGGAAANNANTTTTANTNNNNTCCCCCCCCCCCNGGGGGGGGGG 

Sequence 1169 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 
CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 
ACTGCAMAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 
TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 
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GAGGGGTTTGGAGTCTGGAAGCCTCATCNCTTCAGCATCAACTGGAATGG 
Sequence 1170 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGA 

GAGGGGTTTG 

Sequence 1171 

GNGGCGGCCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGG 

GTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAA 

TTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCAT 

TCCTTCAGGAGCTGAATGAAAAAATGCAACA 

Sequence 1172 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTAGGTCAG 

TTTAAGAAGAGTCAGCTCAGAGAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATC 

AGGGAACAAAATCCTCTCCTTGTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTA 

TCTTATTGCCTAAAAAAAATTTCTTATCATTGTTTCAAAAAAGCAAAATCATGGAAAATT 

TTTGTTGTCCAGGCAAATAAAAGGTCATTTTMTTTAGCTGCAATTTCAGTGTTCCTCAC 

TAGG TGGC ATTTAAATGTCGCCTGATGTCATTAAGCACCATCCAAAAAGTCTGCTTCATA 

ATCTATTTTCAAGACTTGGTG ATTCT GAMGTTTTGGTTTTTGTGACTTTGTTTCTCAGG 

AAAAAAAATATTCCTACTTAAATTTTAAGTCTATAATTCAATTTAAATATGTGTGTGTCT 

CATCCAGGATAGGATAGGTTGTCTTCTATTTTCCATTTTACCTATTTACTTTTTTTGTAA 

GAAAAGGAGAAAAATGAATTTCTAAAGATGGTCCCCATG 

Sequence 1173 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCTGCACACTGTGTTTATGACTTGTAC 
CTGCCCG 

Sequence 1174 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGChJTGCACACTGTGTTTATGACTTGTA 
CCTGCCCG 

Sequence 1175 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTNNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 
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GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGNTGCACACTGTGTTTATGACTTGTAC 

CTGCCCG 

Sequence 11 76 

CCGGGCAGGTACMCAAGCGTTTGTMTGTTTCCCAAANATTAGCTTTGAAATCCAAATG 

TCMGCAATTAAAGTTCAAAAAACTATAAATGTAATTGTTAATAAAACAATGGATGAAAA 

MGTCATTGAAATTTTTTTCTACTTGGATTMGMCATAAATTAAAAGTGCAACTGCAAA 

AATMTATTAGTTTGATMGTAAMTAAAACAAACTAGATATATTTTGAAAATAAAAAAC 

MAT GAAACAAMTAAMTTTAGGTAMGAAAATTCMCGTAATTTGTTGTAGCTATATT 

T7TTGT MTMTT ACAAMGTAGAAATAATAGCTCATAAAGCAAAAACAAAATTTATTCT 

ATGT TC I Mill 1 CAGTCAATTCAGMTTTTAGCTTCATATTTGMGCATTTTTTTCTAA 

TTTTGTTTGTGAATTTGAATTTGTTTGCGGATTTNGATTTGC 

TATTT MTTAT ATTAATCTTCGTCAGCTTTAATTGCTCTTTCTTTMCMTTTGATCTGA 
AATTTGTTTTGGTGTTATTTCATAGTGATCAAATTGCATTrGATMGTTCCACGACCTGA 
TGTCATAGACCTTAATTGTGTTGAGTATCCAAACATTTC 
Sequence 1177 

TAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTACGGAAATCCT 

AACTACCACTGGCAGGAAACTGCATATNTTCTGGTTTACATGAAGANGGAGGGCTAANGG 

MATGCCCAAAACCTTCAGAGATTGACACCGCTGTCATTNTCCATNTCNGTTCCTGGAAT 

CTACCGGGAGTNTTNATAAGAAGANTTTTGCAMTNGAGGGMGAAGCMTTGTTTTCAA 

ACTATATMCTGGAGNCCTTMTTTATMTTAGGGGATATTTAATCAAAAATATNGTAAA 

CCATGGAGGGCCCCCTCAGNGTNCTGGATCAGGTTCAAGAAATNGAAATGNTTTTCACCC 

AAGNCANGACCCCGGCCATGTGGGCATGNTCCGGGTNCCTGGGGGTGGCNTCGNCTGGCT 

TGTGGCGMNGAACAATTMGCCCCTTTTMGTTTATTGAAGCCCTGNGGGGAAACTTTA 

AGGGGGTTTCCANAGTTGGGGGANGAAGCANTNGGNNAGTTGGNGAAGGGCATTTTGGGG 
GGG 

Sequence 1178 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GMAGACAMCTGCAAAAMTTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGAC 

ACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

Sequence 1179 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAMTGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGG 

Sequence 1180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATTGCCAGGMGATAGGCAGCTCATCTGTG 

TCCTGTGTCCAGTCATTGGGGCTCACCAGGGCNNCCAACTCTCCACCCTAGACGAAGCCT 

TTGAAGAATTAAGAAGCAAAGACTCAGGTGGACTGAAGGCCGCTATGATCGAATTGGTGG 

AMGGTTGAAGTTCAAGAGCTCAGACCCTAAAGTAACTCGGGACCAAATGAAGATGTTTA 

TACAGCAGGAATTTAAGAAAGTTCAGAAAGTGATTGCTGATGAGGAGCAGAAGGCCCTTC 

ATCTAGTGGACATCCAAGAGGCAATGACCACAGCTCATGTGACTGAGATACTGGCAGACA 

TCCAATCCCACATGGATAGGTTGATGACTCAAATGGCCCAAGCCAAG 
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Sequence 1181 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 

CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA7TC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGA 

GCCCAAGGAGTTCTGGGAGAGGGACCCAGATTGGGGGGGTA 

Sequence 1182 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAA 

GACMGGCCATCCACCACTTTATAGAGGGTGTAAAAATAMCCAGAAA1CAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 

Sequence 1183 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

GAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

ACTAAGACACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTAGGTCCCGGGCTTGG 

TGATAGAAATATTTCTCGATGACTTTCTTGAGTGCAATTTGNACTGTMCATTTGCTTAA 

TCACCTT 

Sequence 1184 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTAT 

TATCTMCTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACNGCCATAATGCC 

GTGCACAGNTTGAAMGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTTN 

CTCAAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAA 

TAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCT 

NTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTNTACTTT 

GCAGAACCTCACCMCTTGCCCAGGTIvrrTCTCCCCGGTCTTGAAGAAATGGCTCTCCACC 

TGAAMGTNNGATCTTCTCCATACCAGCTTCTTAAGCAAAAGCAATCCTCTCTTTGCTTC 

CTCAAGGGGCA 

Sequence 1185 

TAGGGCGANTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAA 

GGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAA 

GATGMAGACAMCTGCAAAMATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTC 

TGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGA 

TGMGACTCTGAGAGGGGTTTGGAGTCTGGMGCCTCATCCNTTCAGCATCAAGCTGGAA 

TGGGGAATGAANAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

TCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATG 

Sequence 1 1 86 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
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CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAG 

CCCAAGGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTA 

Sequence 1187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1188 

CCGCGGTGGCGGCCGAGGTACAAGATANTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTT 

GAAMGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTTCNAAGTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGNCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGT 

Sequence 1189 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAMTCAAGG 

GAGGAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAA 

AGAATTGGACTAAAACACTGGCCATACCCACTGGACAG • 

Sequence 1190 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGCGAAGCTAGA 

GACTGTAACCTGAAGATTGGGACAAATTAAAGAAAAAAATGTGATTTAACACAATTACAA 

AAACTGTTACGTTAGGGTCAMCMGMCCATTTTATGAMCTGAATTACAACAAATGAC 

ATTATATCTMCTCTTCCGGGTCTCCACAACACTTATACTTACTTAAGCAGCTTAAACAC 

TTCCGAGTCTCCACAGCACTCTGATACTTACTTAAACAGCTCTTTTAACCTGCCCTAGTA 

TTCTTAAGTGCAGCATATCTAA I I I I I I I I I I CCTCAAGTAGTTTGAA 

Sequence 1191 

TAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAATGAACGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 
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CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCNTCCACCACTTTATAGAGGGTGNAAAAATAAACCAGAAATNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCC 

GATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGA 

Sequence 1193 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGGAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACX3AAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAG 

ACACTGGCCATACCACTGGACAG 

Sequence 1194 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACATAAATCACCTGGAACCTTG 

TTAAAATGCAGATCCTGACTCAGGAGGTCTGAGTTAGAGCCCAGGATTNCATATTTCTAG 

CCAGCTCCATGATGAGCTGCTGGTCCGCAGATCATGCTTGCNGGTTTTGACCAGAGTCAG 

TGTTGGTTANAGTAAGAGGATGAGGCANACATNTGGGAAAAGTCCAGCTGGGGCAAGCAT 

TTGAAGTCTGCCTTCCTACCANGTCAAAATCAAGGCAACGACCTTCCATAGATAACTATC 

AAAGCTTGAGGGGGNGCCTTGAACCCAACTCCTAAATCCCTAAGACCTGCCCACCTCTTG 

TGTCTCCTGTNTNAGCAAACATTCCCACACTCTTGCATATTGTTAAAAGTAACCTCTGCT 

TACCAGGCTTTTG 

Sequence 1195 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGGGGTGTCXJGAACAAGGCAGGTTG 

GTGGGTTAAGGTCTTAATCTTGACTCGAGATCTCTCTCCGGAGTTCACAGNGTNGGCGAC 

GAAGCCGAAGCAGCTGGAGCGCGACCCGGAGGAGTCTGACTTCTCGTTGTCTTCATAATT 

TTCATTCGTTGCTTTCTTCGTGGACTTGCGGCTGGGGGAGGATCCCCGCTGGTCGCCGAG 

CAGGCGGGCGGGTAAAGGTAGGCCGCCGAGAGCGAGGTTAGGAGAGGAGAGGAGGCCGCA 
GTACCT 

Sequence 1196 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCXJGGGCAGGTACGCGGGGAGGAGGCGGAAGCGC 

AGCGGGGGCGGGAAGGTTGTAGTGCCGCGAGTTGAGCTCCTCTTGCCTMGTGGTCGCGC 

CCCCTTTAAGAGCAGCGATTGTAAGGAGAGGCGGTCCCGGTGTCCTCGGGTCCCAGGTGA 

TTGTGAAGTGCTGACCAATTGCCACTGGACATACTTGAAACAAAATAGGAAAATGGCAGC 

AMCCCTGTCTCTAAATCAATCAATCAAGCGAGCCAGAATGCAGTAGTGGCCTGAGAGAG 

GCATCCTGGAACGCAGTGCGGTCTGGCTAGGCTTAGAAGTATTCATGTGATTTTTACCTG 

ACAAGGG 

Sequence 1197 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGG 
Sequence 1198 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCG 

AGCTTCGGAGAAGCAGTGGTGGGTTCCATGTGGTGGTGGAGTAGGAGGCAGGTCTCCGCG 

GTGGTTTCCACAAGAAAAATGGCACAATGTTTCTCAGAAGACAATTACATAAGAATCAGC 
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ATACTTTAAATTCACAGCAAATAAT CAGAC AATTGATGAAAATACTTACCCAAACACTAA 

TTGTAGACTGTGCCTTCTGAATATGTTTTGTCATAAACTTGGAGTAAGGAATCCTCACAG 

GCACTGGACMTTCAAAAMCGTAMGTTTGTTTGTTAGAATACCTGGGTGCTTTTGGAT 

AGAAACCCTCATCCATATCCTGGTAAGGCTTGAAGTTGCACAGGAGTTTTCATTTGTCAA 

AACCCAGAAAACCATAAGCTTTAGATTTGGG 

Sequence 1199 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTTTGA 

GAGGGGTTTGGAGTCTGGAACCNTNATCCTTTCACNTTCAACTTGAAATGGGGAATGAA 

Sequence 1200 

AGGGCGAATTGGAGCTCCCXJGCGGTGGCGGCCGAGGTACCAANTGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAATCAAGGGAGAANGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCNGACTTTCTAAAAATGGAGCAGANTC 

TGAGGCTTTGCATGTCTTGGCATTCTTCAAGAGCTGAATGAAAAAATGCAACANGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATATAGATGTGGTGCCX5ACTAGGNTNCTGCTGAAAGGGAGNTGAAATT 

CCTCCACCAAGTNGGTATTCAAAATA 

Sequence 1201 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACGCGGGGGTAAC 

TGAAAATCCACAAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGC 

AGAACGGCTGCCTAATTTACAGCAACCATGAGGCCACTTAAGGATGCAGCAAGAAGGAGC 

CATCTGCAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTA 

GGACTTCTAGCCTCGAGAACTTACAAATGGTGATGATCAT 

Sequence 1202 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAANNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

NATGAAGACTCTGAGAGGGG7TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAA 

ATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAAG 

Sequence 1203 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATAAAATAATA 

CATATG AAGCC A CAATA TT CAATT ACAGTTCAAACTAGAAATTACTAACTACATGTAGCT 

GCTTCAi 1 1 I iCGTT TTGCA TTTTGGCCCCTAATTCATATTTACAACTGATACCTGCTGA 

GAAAAAGATCCAMCTTTTAMCTTTGTATGTTTTGTGGAGGGTGCACAATTTCTTCTAA 

TATATCTTCAGGTGTTTTAAAATTTMTTTGTTTTTAATCATAAGATATCATCATGGCCA 

AGAGACTGGGAAMTMCMTTTTATTCTTTCTCCTMGATTGNGATTTTATTATTCCAA 

GATCTTATGCTTGAATTACTTAGCAAGAAGGCATGATTATGCANAAGACAGGGAAATGAA 

GAGAAAAGAGCGGGAATATACGAAAATGAAGCTTCCTTAACAGAGTTCATGGTGGAGATG 

GTAGACACTGGTGGAGTTTTTTTCCAGACTTAA 

Sequence 1204 

TCGAGCGGCCGCCCGGGCAGGTACACTCTAAAGAAAGCCATGAGGATGATAATCCACTTT 

GATAC TTCCAATCT GCTGGTCTTGCTGAACTCTTTGGATCATGGATATCATAAGTTCGAC 

AAAATAI I I 1 1 1 1 IGTAGAAGCACAAATGTGAAGNGTCACTCGTTCTGAGACTTCCTCCT 

CTGTGAM TTCCA CAATCTCTTTCTATTTATAGACTTTTCCACAGCAAACATTAGTCTAC 

GCAGAGCATTTTGAAAATCATTTGCCAGTTCTAAAGTAGTAATAATAAATACTCCAAGAA 

CTAAAAGTCCCCCTGGTAGCATTCTGGATACCTGGCAGGCATGTTCTGTGGCCCATTC 
Sequence 1205 
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NNGNCCTTTCGAGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCC 

ACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAG 

ACAAACTGTAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

Sequence 1206 

AGCGTGGTCGCGGCCGAGGTACATAAMCATTATTCCTTCCTTGGCCTAAAAACTCATCG 

CCACCTACATTAAAGCTMTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGG 

CAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGNTAGCTACATAAAAATGCACCCT 

AGACCCGAAACTTACTAGACTCATTATAAAATTTTCTTTAAGGTGTCCACGCAGTCCCTG 

GTCAC ACTTGAAG CAG TCCG GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAAC 

TTACACI 1 1 1 1 1 1 1 AATCTTTTC CTACAACTTCATATTTTATAAATA 

Sequence 1207 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATCTAGGTCAGTTTAAGAAGAGTCAGCTCAGA 

GAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATCAGGGAACAAAATCCTCTCCTT 

GTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTATCTTATTGCCTAAAAAAAAAA 

1 1 IGI IATCATTGTTTCAAAAAAGCAAAATCATGGAAAATTTTTGTTGTCCAGGCAAATA 

AAAGGTCATTTTMTTTAGCTGCMTTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTC 

GCCTGATGTCATTMGC ACCATC CAAAMGTCTGCTTCATAATCTATTTTCAAGACTTGG 

T GATTC TGAMGTrrTGGTTTTrGTGACTTTGTTTCTCAGGAAAAAAATATTCCTACTTA 

AATnTAAGTCTATAATTCAATTTAAATATGTGTGTGTCTCATCCAGGATAGGATAGGGT 

TGTCTTCTATTTTCCATTTTACCTAT 

Sequence 1208 

CCCTTTCGAGCGGCCCGCCCX3GGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCAC 

CACTTTATAGAGGGTGTAAAAATAAACCAGANATCAAGGGAGAAAGAAAAGATGAAAGAC 

AAACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCANATTCTGAGGCTTTG 

CATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCT 

GAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGA 

AGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCA 

AGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

Sequence 1209 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGNCAAAATG 

CGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTNTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTNAGCATCAAGCTGGAATGGGGAANGAAGAATAGAGATGTGGTGCCCACTA 

GGCTACTGCTGAAAGGGAGCTGAAATNTCCTTCCACCCAAGTTGGTATTTCAAAATATGT 

NATTGACTGGATANGGGCAAAAAGGATTTGGACTAAGACACTGGGC 

Sequence 1210 

GCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAACGGCCATCCACCACTTTATAG 

ACGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCATAATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCNGATGAAGACTCTGAGAGGGG 

TTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGA 

GATGTGGNGCCCACTAG 

Sequence 1211 

CGGAAANTTGGGGGGCCCCCTTNCTTAAGAAAAGGCCATTGGCTTNCCGAAGGCGGGGGC 
CCCGCCCCAAGTTGGTTGGMNTGGGGGATTATTNCTTTGGCCMGAAAANTTTCCGGGG 
GGNGTTTTNAAGGGNGGGGGGGGGGGCCCCCCCGGGCCCCCCAAAAGGGGTNAACCCCCC 
TGGGGGGNAANAGGGGGGGAAANNTTTNNNAAACCGGNNAAAAAACCCCXJCCAGGGGGGG 
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CCCCCCGGGGGGGGAAAAAANCCCCCNGGGGGGGAAACCCCCCCGG 
Sequence 1212 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

MAGAAAAGATGAMGACAMCTGCMAAMTTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAAGATGGTGGTGCCCACTAGGCTACTGCTGAAAGG 

GAGCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAA 

AGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

GGGTCTTGAGAGAGCCCAAAGGAGTTCTGGGAGAGGGACCAGATGGGGGGGTA 

Sequence 1213 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGGCATACCACTG 
G 

Sequence 1214 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGTCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCMGTTGGTATTCfAAATATGTAATGACTGGTATGGCAAAA 

GATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAAT 

Sequence 1215 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 

AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1216 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACCTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCCCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGT 

CTTGAGAGAGCCNAAGGAGTTCTGGGAGAGGGACCAGATGGGGG 

Sequence 1217 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 
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GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAMTTCCTCCCCAAGTTGGTATT 

CAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGG 

CAGGGTTATGTTA 

Sequence 1218 

CCGCdGTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTG 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGNGTTCTCAAA 

CCCGGNATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTCAAG 

AATGTGTCTGTCAAGGATGTTCGTCGGGGCAACNTTGCTGGTGACAGCAAAAATGACCCA 

CCAATGGAAGCAGCTGGCTTCACTGCTCAGGGTGATTATC 

Sequence 1219 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGATGCTACAGACGAGGACATCACCTCACA 

CATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAA 

CGCGCCTTNTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGA 

CCAGAGTGCTGAAACCCACAGCCACAAGCAGTNCAGATTATATAAGCNGGAAAGCTTATT 

GATTANAAGCMTGNGC^^^TTCCGATNTGATTGATNNGTNAAGNAACTTTTTNAAANGTN 

ANCCCTGMTTNCCNNNACCCAATTAMTTTTTNCNANCCCCTTTAAAAMTTTTNCTNG 

GGNTGGGGCCCCCCCNAAAANTTAGGGNANAAAAAATATNAAANCCCCNNAAAATATTTT 

NNNTTNTTTTCTAMAAAAATAAAAMCCCNCCCNTTTTTTGGGGGGGGCAT^ 

GGGAAAAAAMTTCCAATTTTTNCCCTTTTCNT 

Sequence 1220 

CCGCGGTGGCGGCCGAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTCACTAGGT 

GGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGAGGCTGGC 

TCAGGTTTGACTCTATCTTGGAAAAAAATAGGAAMCTTCATTTATGGAATAGTTTTGAA 

TMCCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACGGGTAGAA 

TGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTCTGGAATT 

GGTCTGACCCMTTMCACA CGGC CTCTGATGGGMTAGATGTATTTTGGGGACACATTT 

TMTCTGATAGCTGTMCCCCTTTTGAGTTGGCTTTTGTTCACTGGAATCCCTTTCCAGT 

CA 

Sequence 1221 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCATACC 

Sequence 1222 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGCAMTGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAMAMTTGCCAAMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAGGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGG 

TTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGGACTAAGACAC 

Sequence 1223 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAA 
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TGAANGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAA 

TCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCT 

AAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCNTTCCTTCAGGAGCTGAATGAA 

AAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCT 

TCAGCATCAAGCTGGGAATGGGGG 

Sequence 1224 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATAC 

CACTGGACAGGGTTATGTTAACACCTGGAATTGCTGGGTCTTGAGAGAGCCCAANGGAGT 

TCTGGGGAGAGGGACCAGATTGGGG 

Sequence 1225 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MMTTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAACAGAA 

GATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGGTTGGG 

TATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGGACTAAAGACACTGGCCATAC 

CACTGGACAGGGTTTATGTTAACACCTGAANTGCTGGGGTCTTGAGAGAGCCCAANGAGT 

TTNGGAGAGGGCCAGATGGGGGGGTAG 

Sequence 1226 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTT 

AACACCTGAATTGCTGG 

Sequence 1227 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCAGGAAGTGAAGTAAAA 

CCTGGTCTTGGTTGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGA 

GCCACAGTATAATTAGTAGGCTGAGAAGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTG 

CTCGCCTGGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAGTTGCTCATC 

AGCTTCTGGAMGCAGCTTCACCTGTCCTTTTCCCCAAGAAGCTGGGCAGCTCCCGGGTC 

AGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGAACTTGTCACCCTCTTTGCCCGAGTAC 

CTGCCCG 

Sequence 1228 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAtCAAG 

CTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGTC 
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TTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAAATGGGGGG 
Sequence 1229 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATNAACCAGAAATCAAGGGNGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACT 

Sequence 1230 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTCTTC 

TAGTCCGGTAAACAGAGGGCCTGCCCCCGACAGCTTCTGCTTCCGGGTCACGCCTTGACA 

GCGGCTTTCAACCCCCACCTCAGCCCAGCAATTCGTTTGGAGCATGTGAACACCTTGAGC 

CTTGATGAGTTCCAGTIsrTGTGGTATATTATGCAGNGCNTTCAGNGAAMTNCTTTTTNTN 

CGGGNNTTNAANNAAAAANANTNGGGTGCCATGNTNTTNCCCCCNNNNNNTNGGGGGGGG 

GCCCCCCTCAAANGGGGGGGG GGNACT ATNANNNNCCCThfrrTTTTGGGGNNCNANNTNN 

AC NCCNT T TNNTT NGGGCCCChrrTTTTTGGGGGNAAAAAAACCCCCCCCCTNNGQGGGGG 

GTATTTTCNTTTTNNGAAAAAAAAAGGCCCCGGGGNNGACCCCCCCCCNGGTGGGNTTAN 

ANAAAAAAAAAAAANTCNCCCNNTTN I 1 1 1 1 1 1 1 1 1 1 AAAA 

Sequence 1231 

AGGTACGCGGGGCTTTTCCGT GCTAC CTGCAGAGGGGTCCATACGGCGTTGTTCTGGATT 

CCCGTCGTAACTTAAAGGGAAATTTTCACAATGTCCGGAGCCCTTGATGTCCTGCAAATG 

AAGGAGGAGGATGTCCTTAAGTTCCTTGCAGCAGGAACCCACTTAGGTGGCACCAATCTT 

GACTTCCAGATGGAACAGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAG 

TTGCTCATCAGCTTCTGGAAAGCAGCTTCATCTGTCCTTTTCCCCAAGAAGCTGGGCAGC 

TCCMDGGGTCAGCAGCTCXTTTAGTTCTGACTTGTTGAGCTTGAACTTGTCACCCTCTTTG 
CCCGAGTACCTGCCCX3 

Sequence 1232 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAA 

CAAAAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCATTTATTGGTGGCAGA 

AGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAAT 

TGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCAACTA 
AAATGCCATTTCACC 

Sequence 1233 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAANTGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGNGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAG 

GCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAA 

GACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTNATCCC7TCAGCATCAAGCTGGAATGGG 

GAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNCT 

CCCCAAGNTTGGTATTCAAAATATGTAATGACTGNTATTGGCAAAAA 

Sequence 1234 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAAATGCGACnTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTNCCCAAGTTNGGTATTCAAATATGTAATTGCTGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCCTACCACTGGACAGGGGTTATTNTTTAACCCCCTGAATTTGCTTGGGT 
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CTTTGAGAGAGCCCCMGGGGGTTTTGGGAGAGGGGACCCANAATTGGGGGGTAGGTC 
Sequence 1235 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTTGAAAGG 

Sequence 1236 

TGGA CTCCACCGCGGTGGCGGCCGAGGNACAATACACTAGAAACCAACATAATGTATTTT 

TTTTAAAACCTGTGNGAAAAAATAAATGTTCCACNAGTAGGGATAGGGGAAAAGNAACCA 

AAAGAGAG AAAGA GAAAGGAATGCTGGTTNATCTTTGTANGTNGTAATCGAATGGAGAAA 

TTTGCAGTATTTTANCCACTATTAGNGAAA I I I I I 1 1 I 1 1 I 1 1 1 GTCAAAATGAN AG ACT 

GGAACCTCTGTTCMNATGCmNATTGNMCTCTGGTTTTGAAGACCGGGNNNGGNAAA 

GCAANNAAAAACGTNGGGAAACCCTNNGATGGACNTAAAGGGGCNNTGGNNGCCAAAGGG 

ACCTT GGGG GAAMNGGTCCACTTTGMTANANAAGCATGGGGNGGGGNGNATTTTTCCC 

CCCCCTTTTAAAAAANATGGTNTGGMTAATTTTAAANNGGNATATTAAMACCACCTTT 

NTT 

Sequence 1237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGATATCCACATAT 

TTTCGATAAAAATTCCAAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTA 

TACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTC 

TAAAACAGCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTT 

CACACGCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGT 

CCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAMTGATCCTGTGTGGGTTAGTTCTCC 

TCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAG 

CGCCGAGGCTTTGGC 

Sequence 1238 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACNNTGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 
T 

Sequence 1239 

AGCTCCACCGCGGTGGCGGCCGAGGTACGCGGGGGGCAGAAGAGGAAGATTTTTGAAGAG 

TGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGTGA 

GAACAATAAGAATTCCTTGGAGAGCAGCCTACGGGNAACTAAAATGCCATTTCACCTGGA 

ACTTGATGGANGGGAGAAAACTCCTTGGMTGATTTTGMGACAAMGTTATTTTTTACC 

CGGCACTGAAGATTTNCAGCAATCCGTTGGAATNTCAAAAGGCCCACCAAANGGTGCCAA 

CNCCTACATGNGCNCTTATCNTAAANAGGCACCCCTCCAANAGGGGGNCNAATAAACGNA 

GGGNCNAAGNNNCCCTGGGNAAATTGGCCTTTACNGGNTAAAAAAGGCCTTGGANNGGAG 

GTTTTAMNTNCCTACGGCCAAAGGNAAGCCATTGGCNTGNACCCCAAGGGCCAAGGAAA 

AATCCANNTAAAAAGNTTCTTGGGNGTCCACCCCNTGGGG 

Sequence 1240 

AGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 
TGAMGACMACTGCMAAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTG 
AGGCTTTGCATGTCTTGGCATTTCCTTTCAGGGAGCCTGAATTGAAAAAAA 
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Sequence 1241 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCNGGTACGCGGGGGAGACATTCCTCAATTGCTT 

AGACATATTCTGAGCCTACAGCAGAGGAACCTCCAGTCTCAGCACCATGAATCAAACTGC 

CATTCTGAT7TGCTGCCTTATCTTTCTGACTCTAAGTGGCATTCAAGGAGTACGGGAAGG 

CGMGAAMGMTAGAGAAGATAGGGAMTTAGAAGATAAAAACATACTTTTAGAAGAAA 

AAAGATAAATTTAAACCTGAAAAGTAGGAAGCAGAAGAAAAAAGACAAGCTAGGAAACAA 

AAAAGCTTAAGGGGCAAAAATTGTACCTTCGGCCCGCTCTAGAACTAGTGGGATCCCCCG 

GGCTGCAG 

Sequence 1242 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACATGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAG 

Sequence 1243 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAT 

CAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGG 

Sequence 1244 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCCTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAA 

Sequence 1245 

TGGTACTGCTAMGTCATGACAGCCCAACAGGTGATGTTTTACTGGATGAAACTCTGAAA 

CACATCAAAGCAACTGAACCCACAGAAACTGTCCAAACATGGATAGAGCTACTCACTGGT 

GAGACCTGGAACCCCTTCAAATTACAGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGT 

TCTCAMCCCGGTATGGNGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATC 

TGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAA 

TGACAAGAATGTGTCTGTCAAGGATGTTCCGTCGTGGCAACCGTTNCTGGTGACAGCAAA 

AAATGACCCCACCAA 

Sequence 1246 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCAGTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTG 

Sequence 1247 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGQATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 
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GGACTAAGACACTGGCCATACCACTGGACAGGGTTTATTGTTAACACCTGAATTGCTGGG 
GTC 

Sequence 1248 

GAGCT CCCCGCGGTGGCGGCCGNGGTACATAAMCATTATTCCTTCCTTGGGCTAAAAAC 

TTTTTGCCACCTACATTAMGCTMTATGCCTGNTTACTGTTTTTAGAGMCTTATT^ 

TTAGGGCAGTTCCMGCTCAAAMTACGCTAACTGGCACCTTGTTAGCTACATAAAAATG 

CACCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTimTAAGGTGTCCACGCAG 

NCCCTGGTCAC ACTTGAAG CAG TCCG GAGAAATATNAGCCCTACCCCAGTAATCCCCAGA 

AGGAACTTACACI I 1 1 I I I I MTCTTTTCCTACAACTNCATATTTTATAAATAAAAAGAC 

ANAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCAT 

Sequence 1249 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGMGGAATATTTCTCCTGGCTC 

TACTATCAATACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTAT 

GTTTAACGCCATCTTACTMCCATTATTGCTATGGTGGTATACACTGCCTATGTNTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTT 

Sequence 1250 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATNTGTAATGACTGGTATTGGCAAAA 

Sequence 1251 

CTNCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GMAAGATGAMGACAMCTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATGGAAGAATNGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

GAGCTTGAAATTCCTCCACCAAGGTTTGGTATTC 

Sequence 1252 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCTTGAGAAGGATTT 

GAAGGACAAGTTTGTGGCCCTGACCATAGATGATATCTGCTTCTCGCTCAACGACAACTC 

ACCAAACATCAGATATTCTGAGAACGCCGTGAGGATTGAGCCAAACTCCGTGAGTCTGGA 

AGACTGGTTGGACTTCTCCAGCACCAATGTGGAGAAGGCTGACAAGCAGCGGAACAACTC 

CCTGATGCTGAAAGCCCTGGTGGATCGAATCCTGTCCCAGACAGCCAATGGATCTGTGCA 

AGCCAGTGTGATTGTGGTGGACACCGGCATTCAAGAATGGGCCTGAAGGGATCAAAGGGA 

TGCCAGGGACAAGCTGGGCTTGATCATCTGGCCCAAGGTATTNGGAAAGAGATTGCTTCC 

CAGGGAAGAAAA 

Sequence 1253 

ACTNAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTNGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAANGGNAGATTGGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 
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CTGAAATTCCTCCCCCAAGGTTGGTATTCAAAAATATTGTAATGAACTGGGTATTGGCM 
Sequence 1254 

CCGCGGTGGCGGCCGNGGTACAATGATTGTCATCTCAGTAAAAGGTCTATTATCTAACTT 

GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATMTGCCGTGCACAGCT 

TGAAMGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGTCTTGA 

Sequence 1255 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGG 

GGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGCTCTGGGCAGCAGCAAGAAT 

TCCTCTGCCTCCCATCCTACCATTCACTGTCTTGCCGGCAGCCAGCTGAGAGCAATGGGA 

AATGGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCANAAGCTGGGTTGGTTTATCCAGGAA 

TACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAGATGGTGAACACCATCTGG 

GACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGNGCANGGAGTGGCCATAGGNGGCTCC 

TATGGACGGAAAAC 

Sequence 1256 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAANGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

Sequence 1257 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA/.GAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

GATGAAGACTCTGAGAGGGGTTTGGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 
AAA 

Sequence 1258 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGMGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAAATATGTAATGACTTGGTATG 

GCAAAAAGATTGGGACTAAAG 

Sequence 1259 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGTTT 

TGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAATACATAAATACTAGTCCACGTTA 

ACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTG 

GAAATACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAGGA 

GATGGAGAGCAATTCTTTAAAGATAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAA 

GCCTCTTAAAATGGGACGCATTTGCCTCGCGCCTCTGGGGTGTCTGCAGCTCAGCNTTGG 

TGCCCCACACGGGACACCCGACTTTT 

Sequence 1260 
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TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCAATGTCATT 
AACCATAAA TGAG CATT TACAAT CTGGATTAAATGTCACATGGTATTAAGTCTACACTTA 
GAGTAATGCTTTTA CTGATTTTTAA AAATATATGCATATGTTTAGTGATCGAGAAAAGTG 
AAATACTGGAGTAC 1 1 1 I I I 1 1 1 I 1 1 I I I I 1 1 1 1 I GGCTTGATGAGTAGGTGAGTTTATT 

GGGACTTACACACAGGTCAATCCTGGGCGGCGACAAGACAGCTCTAGAGATCTGAGCTTC 
CTCCCAATGCTAMCTGCTTTCATGCTAATTTTCTGACTGTTTACTTACCCGGGGTAAGA 
GCGATGGGGACTGTTTTCATTGG 
Sequence 1261 

TNCTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACTAGGCTACTGCTGAAA 

GGGAGCTGAAATTCCTCCACCAAGGTTGGGTATTCAAAATATGTAATGACTGGTATGGCA 
AAAGATTGGG 

Sequence 1262 

TGGCGGCCGCCGGCAGGTACGCGGNGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCX) 

TGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGNGAGAACAATAAG 

AATTCCTTGGAGAGCAG CTACG GCAACTAAAATGCTTTCACCTGGAACTTGATGGGAGGG 

AGAAMCTCCTTGGATGATTTTGAAGACAMGTATTTTACCGGACTGAGTTTCAGAATCG 

TGAATTCAAAGCCACAATGTGCAACCTACTGCCTATCTAAAGCACCTCAAAGGGCAAAAC 

GAGGCAGCCCTGGAATGCTTACGTAAAGCTGAAGAGTTAATCCAGCANGAGCATGCTGAC 

AAGGCAGAAATCAAAAGTCTGGTCACCTGGGGAAA 

Sequence 1263 

CGAGGTACCCAGGCCTGCAAATCTCCTGGCAGGATGGTCAGGAACTGCTCTAGCACCAGC 

AGTTCCAGAATCTGTTCCTTGGTATGGATCTCTGGCTTCAGCCACTGACAGCAGAGCTCC 

CGGAGGCGGCTCAGTGCCTCGCGAGGTCCAGGAGAATCCTGGTAGCAGAACTGCCTAAAA 

AGCTGCCTGCAGAGCTCCTGCTTAAGGAGTTCACTTCTCTGTAAGCAAGTGTCCTGCCCA 

TGGATAAATTCTTCCTCTTCTATCTTCACTATCAGAAGTCCTTCATCCTCTGGAGAGTTC 

TGGGCTGCAGCTTTCTTTGGTTCTGTAGCCATCTCTCGGACAGGGCTGATTCCGATCGGA 

CACTTCCGGTGGAAGGACTGAGCGGCGCTACACTTCAAGAATTCCGTCCACAGGGACTTG 

TGAGTCTGCGCAGAAGGCGGGATGCCTTTGGACTACGATTCCAAGAATCCTTCXJTGGGTC 

TCTTCGGGCGCAGACTTTTCGCCAAAGTCCTGAAGATCTCAGGGCTTGAAGGAGGGGGCA 

TCCTTCTTCTTCATTGNAGTAGTGTGTCTTGCTAAATAACAGAAGGGACTCCTGAAAAGA 

AAATGACGTTGGCCCGGGCGCGGGGGCTTACGCCTGCAAGTCCACACTTTGGGAGGCCGA 

ACNGGCNGATCACGAAGGTCAGGAGATCGANGACTATCTGGGTTACGCGGNGACACCCTG 

NGTTTCTTAAAATCCANAAAAAAAAAAAAAAAA 

Sequence 1264 

GGCGAATTGGACTCCACCGCGGTGGCGGCCCGAGGTACAGAGATTTATAATGTGCTGCTC 

TAGGTCCTATCGGGTAAAGGGATCAGCAGATGTGAAGTCAAGAGTCTCCTGTAAGATTTG 

^HZZZ^Z"'"^ ^ AAAC ATATTTTAATCCT G GGCX^CCTNTTTCAAATCACCT ATTTCTTTTA 

Gum iGCAGTGATACTGTGTGTTGCTTCTAACAGAGGTTCAGTTTCACAGCCTTTCCC 

TCAAGTGTCTTATCCTAAAAGTAAAACCTAGATGATCTAAGGTGGTGGNTTTCAACAGGG 

TGCAAATTTGCCTCCTATACTCGCAACACCCAGNGACAGTTGGCTATGTCTNGGAGACAT 

TTTTGNGTTNTCACACCTGGANNNAGGGTGGGGGAGGTGGNGCTAATGACANCAAGNTGG 

CCNTAANCCAATNATGCTGATANAAATNCTACANTGCACAAGGATAGGNTCCCACANAAC 

ANAAGNCTTANCCAAACCCCAAATACTAACAAT 

Sequence 1265 

CCGCGGTGGCGGCCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTT 
GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 
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TGAAAAGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGG 

CTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACC 

AACTTGCCCAGGTA7TCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATC 

TTCTCCATAC CAGC TTCCTTAAGCAAAAGCAATCCTCTCTTTGCTTCCTCAAGGGGCAGC 

ACAAAGGATGTTTTGGCTGTGTGGAAACAGAAGCCCGCATTTGTAGTTGCACTGGCGAGT 

GAAGTGATAGTTGACGCTGGTTGGGGTGGT 

Sequence 1266 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACTTGCTAACTTTGACGCCAGCATCTCTGAAAG 

ATCCCCATCGAAGGCCGGTCATTGCAAATACAGGCTGTTCCTTTTCACCCTTGATCTGCA 

AGACATCAAGTGGAACTGTCTCTCCTTTCACAATGGCAAGTGTGGCATCAGTAATATGTT 

GGACTTTGTTTCCACTTTCGGCAAAGAGGGTATGACTCAAACTACTGGTCTCTCCCAGTG 

GGATAAATCCAATGGGAATCTTACTGAAGGTAGCCTCATCTGTTCGTCGAAGAACACCAA 

GTMCAACCTNCTGCAGTGTCCCATCTCCTTCCTGNAACAATGGATCACATTCCGGNGTT 

TTCCATCAGTTNCAAGGAGGTTTNTTNGGCTTGGGCCCTTAANAAATCTGNGCTTAAACA 

AAAAGGCCCNATTCCTTCCCAAATAAAAANGGAAAAANTCGGGGGCCNCAAI 1 1 1 1 1 1 I 

Sequence 1267 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGGACAGG 

Sequence 1268 

GGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAA 

ACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGC 

GACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 
CACCTGAATT 

Sequence 1269 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTACAG 

AGATCCTGAAGAGATTGAAAAAGAAGAGCAGGCTGCTGCTGAGAAGGCAGTGACCAAGGA 

GGAATTTCAGGGTGAATGGACTGCTCCCGCTCCTGAGTTCACTGCTACTCAGCCTGAGGT 

TGCAGACTGGTCTGAAGGTGTACTCTTGGTTTATCAATGGGACGTTCCAGCAATCCACAC 

AAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGCGGATCCTATATGTGCCAAGCCC 

ATAACTCAGCCACTGGCCTCAATAGGACCACAGTCACGATGATCACAGTCTCTGGAAGTG 

CTCCTGTCCTCTCMGCTGTGGCCACCGTCGGCATCACGATTG 

Sequence 1270 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 
GACTGGTATGGCAAAAGA 
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Sequence 1271 

ACACTACTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAT 

TTTTATTTTGAAMCCATTCAGCACATTTATCTTATGTAACATGCAGAGCATATATCTAT 

CTGTATTTTTAAMTTTTCCTGTTACTCATTGATACATAGTACTTCCTTGATGTTGTTGG 

AGTCCGTGAGAAACATGGCGACTCGATCAATGCCCATGCCCCAGCCAGCTGTGGGGGGCA 

GCCCATATTCCAGGGCAGTACTCAMGGTGATATTTGCTTTTTTCAATGCTTCAGGGGAA 

AAATCCTTTTCTTTA 

Sequence 1272 

TAAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAANGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCGAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATT 

Sequence 1273 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGGTGACGTGGTTCCCAAAGATGTC 

MTGCTGCCATTGCCACCATCAAAACCAAGCGCAGCATCCAGTTTGTGGATTGGTGCCCC 

ACTGGCTTCAAGGTTGGCATCAACTACCAGCCTCCCACTGTGGTGCCTGGTGGAGACCTG 

GCCAAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAA 

AATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAA 

AATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

Sequence 1274 

ATCCCCGGCGNGGCNNNNCGCCCGTTCAGGTACTCTrrGTTTTGGCACACTTTTCCTGAC 

AAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGTTAACAACAATAGCATATGAG 

ACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTGGAAATACCTTGGAGGGTT 

TCACAAAMTAAGACAMGGGCAMGGAACTTTGCCAAAGGAGATGGAGAGCAATTCTTT 

AAAGTTAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAAGCCTCTTAAAATGGGACG 

CA7TTGCCTCGCGCCTACTGGGTGTCTGCAGCTCAGCTTGGTGCCCCACACAGGACACCG 

ACTTTAAGTGGCTGCCTTTGCAAGGCTGAGAGGCCATGGAGGGGTTGATGCCTGAAGTGT 

CAGCGCCATCTAGTGGAAACATGGGGCATGGCCC 

Sequence 1275 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

MAGACAMCTGCAAAAAATTNCCAAMTGCCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGMGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1276 

AGGTACAAAATTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGAATTTTCAM 

TGTMTATGCTGTTTCACAATCTTTTTATTAMTTACTTAATGATTCCAGTCTGCAAAAT 

GAGCCATAAGACTTTGCTNCTGTTTGNTATANGATNCATNTGGANATTGGGGGNGGGNAA 

ACCATANG TAAGGTTA AACCTATCCGTCACCTGCTTCATGTAAAAGACTCCACCATTTGN 

TTGGATNTATT1TTTTCTCCAGGCNACTAGTMGAMAMGGTGAACAAAGGTGGATTTN 

C ATCCCTN N CAAANTGGG CCCTTNTGG CN C AATTCTTTTTTC ANTAATCCTATG GTAN AC 

CNNTTTTGGTAGATTACCTTGGTGGTNGMTTTTNAGCNGTrfTTGGNGGCNTMTTNNNA 

AAAAATTCTTTGGGGATTTAAMTTTAAANCAAAAAAACCAAACCAAATAAAANTTCTAA 

TCCACCCNTGNGGAAMTTATTTTGGAAAAGGAAAAATTTTCAGGTTAAAAAACAAAGGG 
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ANTGGI^rTGGTTCCTTCCATTAGGTTTAMGGGGAGGAGGNANNATTAAAAAAAATTAAAA 

AAATTGGTTCATTTTAAAACAAGGTTTNGAAATTTNAAGGGAA 

Sequence 1277 

AGGTACAAMTTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGMTTTT 

TGTMTATGCTGTTTCACAATCTTTTTATTAMTTACTTMTGATTCCAGTNTGGCAAAT 

GAGCCATAGACTTTTGCTCTGCTTGTTATAAGATCANTTGGAATTGGGNGGGGGGGANAA 

CCANTAGTTAAGN CTAAATC TAATCCGGTCACTTGGCTTCATTGTAAAAGAACCCCACAA 

TTGGTCTGMTTMTTTTTTGCCCAGGCACCANGAMGNMNATNGNTGTACCAAAGhn"N 

GMNTACATTCCTTGGCAMNTGGGGCCCTCTTTGGCCCAMTCCTTTTTTCCAATTATC 

CTTATTGGTTAAAACCCTTTTTTTGGTTMGTNTNACCTTGGGTGGTTGGMTNTTTAAA 

GCCGGNCTTGGNNGGCCTTAATTTTGGTAAAAMTTTCTTTTGGGGGATTTTAAM 

AACC CMNMNAACCANAC CAAAN AMTANTTCNTNATTNCACCCCTTGGGGNAAATTNA 

TTTTGGGAAAAAGGAAAANATTTTCNAAGTTTAAAAMCCCAAAGGMNTGGGTNGTTCC 

TCCAATTANGGTNTTAAAAGGG 

Sequence 1278 

AGGTACATTTACACMTATTMCACTAAAMTCTGTGTTTTTTTAAAACACCATAGAAGT 

CAMCCACMAAACCCAGGATCTTGTTTTAMTGTGTTTATGAAGACTGCTGCTGAGCTC 

AAMGCATTGCAGGTAA TCATGA CCACCTAGATGAMGCTGGATGTTTGAAAACTCCTTC 

ATGTCCAATGMT GTAAATTTTTTC ACCTCATCCCCMGGTATTCTCCCATACTTTGTTC 

TACTmGACCTTCTTTT TTTTTT TG 

TGAGGGGGTACAGGTTCTTTTTNGTGGTlsn"AAAAGGAATTACTTTTCATTMT 
CTCCTTGGTTTCCTTTAATTTCCCTTTAAATTTTCCTTCAATM 
TTTTTCNTTTTTAAGNGGACC 
Sequence 1279 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAACGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAG 

AAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAG 

ATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAG 

CAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 

AAATTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1280 

GTGGCGGCCGAGGTACCAANTGAAGTGTGAATGACAATGGCCATCCANTANTTTATAGAG 

GGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAA 

ATTGCCAAMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTTGCATGTCTTGGC 

ATTCCTTCAGGAGCTGA 

Sequence 1281 

CCGCGGTGGCGGCCGCTCGGGCAGGTACCATTCCTCTACATCCATTTGGTAGCAGAACCT 

CMGTGTMGCAGTCAGTGTAGCATGMTATGMCTGGCTCAGTTTATCACTTCCTGTTT 

NGACCTGAAGCACCACCCAGCTATGCAGAAGTGGTAACAGAGGAACAAAGGCGGAACAAT 

CTTGCACCAGTGAGTGCTTGTGATGACTTTGAGAGAGCCCTTCAAGGACCACTGTTTGCA 

TATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTATTCAGAGATTGATCCAAATCCT 

GATCAGTCAGCAGATGATAGACCATCCTGCCCCTTTTGTTGAAGGAACACTTGGTTGA 

Sequence 1282 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCCTTCCTTGGCC 

TAAAMCTCATCGCCACCTACATTMAGCTMTATGCCTGATTACTGTTTTTAGAGAACT 

TATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACAT 

AAAMTGCACCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTC 

CACGCAGTCCCTGGTCAC ACTTGA AGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAAT 

CCCCAGAAGGAACTTACAC I II I I I I IMTCTTTTCCTACAACTTCATATTTTATAAATA 

AAAAGACAAAAATGTCAGGCCTGTG 
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Sequence 1283 

GGTGGCGGCGCCCGNCAGGTACCAAATGAAGTGTGAAGACCNGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCATGGGAGAAAGAAAAGATNAAAGACAAACTGCA 

MAMTTG<XAAMTGCGACTTTCTAAAAATGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACANGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCANCATCAANCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGNAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

GACAGGNTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCNAGGAGTTCTGGGA 

NAGGNACCACATTGGGG 

Sequence 1284 

CGCGGTGGCGGCCGCCCGGGCAGGTACCCCGGGAGAGCCCGCTTCCCCCTCCTCCCTGTG 

CTGTCTGCACCCGAGGAGAGCGGCCTGCCCGGAAGTGGGCCACCATATCTGGAAACTACA 

GTCTAT GCTT TGAAGCGCAAAAGGGAATAAACATTAAAGACTCCCCCGGGGACCTGGAGG 

ATGGACTTTTCCATGGTGGCCGGAGCAGCAGCTTACAATGAAAAATCAGAGACTGGTGCT 

CTTGGAGAAAACTATAGTTGGCAAATTCCCATTAACCACAATGACTTCAAAATTTTAAAA 

AATAATGAGCGTCAGCTGTGTGAAGTCCTCCAGAATAAGTTTGGCTGTATCTCTACCCTG 

GTCTCTCCAGTTCAGGAAGGCAACAAGCAAATCTCTGCAAGTGTTCAAAAAAATGCTGAC 

TCCTAGGATAGAGTTATCAAGTCTGGAAAGATGACCTCACCACACATGCTGTTGATGCTG 

TGGTGAATGCACCAATGAAGATCTTCTTGCATGGGGGAGGCCTGGCCCTGGCCCTGG 

Sequence 1285 

CGCCGGGCAGGTACCAAATGAAGTGTGAAGACANGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAlGAAAGACAAACTGCAAAAAATTG 

CCAAAATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACGACTGGACAGG 

Sequence 1286 

TCGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGATATA 

CTGGAGTCACAGAGCAATATGCA7TAACAGGATACAACAGTTCATAAAAACTGAGTAACT 

ATGCACACAAATTTCTTAAACAGCCACCTAAAGAGAAAATGCACAGATGTATGGTGGAAA 

CTGTATCTAACACTGAAACTACTACAGGACTCCATCAATGAGTCCAACTnTAGTGATAA 

AAAACTACTGTACACTACATGAAGAACCATATGTTTATAATTATCCAAATAAAAATGAAG 

TTATTAAAC7TCAAGATMTATGGTAATTTGCATTGMCCGATGATTTTACAAAATTCTG 

CAAAGGTCAAAATTTTAAAAGATGGCTGAACAGTAA7TGCAGCATCTAATAAAAACGCAG 

CTCATTACCGAGCAAACGGTTTTAATTAAAAATTCAAAAGGAATAATCCTGACAGGAGAA 

ATAAAAAAATAGATGTCAAAAGAAGATAAAATTATTTTCAAAGGAGTAGTAACTCAAGTT 

TTAACACC 

Sequence 1287 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATA 

CCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTT 

CTGGGAGAGGGACCAGATTGGGGGGTAGGT 

Sequence 1288 

CCGGGCAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTG 
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TTCACTGAGAGCCCTAA GGAAC TAAAACTQCCATAATGCCGTGCACAGCTTGAAAAGCAA 

TTAGAGTMGCMGATAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGC 

CCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGAATG 

CCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGGCTTCCATTGC 

TCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACCAACTTGTCCA 

GGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATCTTTCTCCATA 

CCA GCTTCC TTAAAGCAAAAGCAATCCTCTCTTTGCTTNCTCAAGGGGCAAGCACAAAGG 

GATGTTTTTGGCTGTGTGGAAACAGAAAGCCCGCATTTGTAGTTTGCACTGGCCAGTGAA 

GTGATAGTTGACCCTGGTTGGGGTGGGGG 

Sequence 1289 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CAMGTATTATGACTTTAAAAACCCCATTATTGAAAAGTACACAGGAGGCAAAGTGTTTC 

ACATCATAGACTTCACTTCCAACTCCTTGGAATGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGCTTAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCAAGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCATTTCTGGACGCTCAGGCCGTGTCGGGCTTNAAGTCA 

TCTTNCACCACACAGGTACCTTNGGGCCGGNTCTAGNAACTAGTGGGATCCCCCCGGGCT 

GGCAGGAAATTCGAATATCAAAGCTTTATCGATACCCGTCCGACCTTCGANGGGG 

Sequence 1290 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGTTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAMTGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 

CACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTG 

Sequence 1291 

AGGTCATAAAACATTAT TCCTT CCTTGGCCTAAAAACTCATCGCCACCTACATTAAAGCT 

MTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGGCAGTTCCAAGCTCAAAA 

ATCGCTAACT GGCA CCTTGTNGTACATAAAAATGCACCCTAGACCCGAAACTTACTAGAC 

TCATTATAAAATTTTCTTTAAGGTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCG 

GAGAAATATCAGCC CTAC CCCAGTAATCCCCAGAAGGAACTTACA C I I 1 1 1 1 1 1 AATCTT 

TTCTACAACTTCATATTTTATAAATAAAAAGACAAAAATGTCGGGCCTGTGAGCTGAAGC 

TTAGCCATTGTAACCCCTGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAG 

AAGTCTGGAGCAGNCGAAAAACCACAAAGAAGTGAAACAGCCAGGTTTCTGNCTTAACTA 

ATTAACCCAC 

Sequence 1292 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCC 

TTCCTTGGCCT AAAAA CTCATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTT 

TTAGAGMCTTATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTACTGGCACCTTGT 

TAGCTACATAAAAATGCACCCTAGACCCGAAACTTACTAGACTCATTATAAAATTTTNTT 

TMGCTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTAC 

CCCAGTAATCCCCAGAAGGAACTTACAC I 1 1 1 I 1 1 1 AATCTTTTCCTACAACTTCATATT 

TTATAAATAAAAAGACAAAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCC 

TGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAGAAGTNTGGAGCAGCCGA 

AAAACCACAAAGAAGTGAAACAGCCAGTTTCTGCCTTAACTAATTAACCCACCTTACGAC 

ATTCCACCATTATGAC7TTGTCCACCATTATGACTTGTTCCTGGCXJTGCCCCAACTG 

Sequence 1293 
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CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAAATGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCC 

ACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGT 

AATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCAGTGGACAGGGTTAT 

GTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGA 

TTGGGGGGTAG 

Sequence 1294 

CGAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTG 

CTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTG 

CCCAACTGGGTTTCCCAAGCTATGTAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGG 

GCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGC 

ATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCA 

GTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTC 

CTTGCTCTCGCAAGTGGCCCTGGCA 

Sequence 1295 

CGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCTTGATCCCCAGAAC 

TGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCTCAACAACCTCAGC 

CAGAGTGTCAAT TTCTG TGAGAGAACAAAGATTTGGGGCACTTTCAAAATTAATGAAAGG 

TTTACAAATGACCTTTTGAATTCATCTTCTGCTATATACTCCAAATATGCAAATGGAATT 

GMATTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTTTGAGTCGGTTCAGGTCACC 

CAATTTCGAAATGGAAGCATCGTTGCTGGGTATGAAGTTGTTGGCTCCAGCAGTGCATCT 

GAACTGCTGTCAGCCATTGAACATGTTGCCGAGAAGGCTAAGACAGCCCTTCACAAGCTG 

TTTCCATTAGAAAGACGGCTCTTTCAGAGTGTTCGGAAAAGCCCAGTGTAATGACATTGT 

CTTTGGATTTGGGT 

Sequence 1296 

CGAGGTACAGGAGCAACCTTCTTTCCACCATTACTGGGAATTCCACCACTATTTGCTCCC 
CCAGCCCAGAATCATGATTCTTCTTCATTCCATTCAAGGACTTCGGGAAAAAGTAATCGA 
AATGGTCCCGAAAAAGGT GTAAATG GGTCAATAAATGGAAGTAATACATCATCTGTAATT 
GGTAT CAACAC ATCTGT AC I 1 1 I 1 1 I I 1 1 1 1 1 1 1 I 1 1 1 1 1 ATCT AAAAGCAACATAATTA 

TTTTTCTTGCGATTTTTTCMGAACTCTTTTTAATTGTCTAACACCTGATTCTAGTGTAT 

AGCTTCTGATT 

Sequence 1297 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCAAAATGCGACTTTCTAAAAATGANAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCpTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCA 

Sequence 1298 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

AACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGC 

ATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTG 

AGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAA 

GAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAA 

GTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCA 

TACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAG 
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TTCTGGGGAGAGGGACCAGATT 
Sequence 1299 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACC AGAAA TCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCAAAATGCGACTTTTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCC 

TTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGT 

CTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAAT 

ATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1300 

CGNCCGCCAGGTACCAMTGAAGTGTGMGACAAGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAMTGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATANAAGATGTGNT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGG 

Sequence 1301 

CNAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAGTATGGCTTAAAAGGCTCTGCCTT 

AGATTCTAGAATCCAGMCATTTTCTCCAAAGACAATCAGGGTATGGGGGAGAAGTTAGT 

TCCA GAGAAGA GAGCGAGTCCAGGGTAGAAGGGATTCTTCTCTCCTGAGGGTCTATGGTC 

TCCCATTTTTTAAAGCAGCAGNGGTATCTATCCCACTCATGGCCTAGAGG7TGCACAGAG 

CTGTCTGGCACCCGCTTCTTTGGCTTTTCTCTCCTGACACCCAGCAATGC1TACTCAGAG 

CGTTGAAGGCGGCCAGCACCTCGAAAGAGATTCTCTGATTTTTTGTGAACACCTGGATGG 

TGMCCCATCMGGGACTTCTGGATCTCGAAATTGTTTTTCAACCCTTCGTGAACAGACA 

GAACTTCAGCTTATCCC 

Sequence 1302 

CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTT 

CAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCT 

GGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGC 

CCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATAT 

GTAATGACT 

Sequence 1303 

CCGGGCAGGTACTACCATGCCGGGCCAA I I I I I 1 1 I I I GTTTGTAGAAATGAGGTCTTGC 

TATGTTGCCCAGGCTGGTCTCAAACTCCTGGTCTCAAGCGATCCCCCCGCCTCAGCCTCC 

CGMG TGCTGGGATAAMGGCGTGAACCACCATACCCAGCCAGTATTATCTTTTCATTTC 

ATTTTCCAGTTGAGTATATTATTGGCTACATTTGCATACCGCACMTTGTTC 

AAMCCAATATTTTGTTTTGTTCTGTTGTCTACMTMGGAGMTTCAGATGATAMCTT 

ACAACCAATCATGGCCAAGTCCACTTGAGGAATTGTCTCTGTAGATTTATCTGTAGACTC 
CCTAATA 

Sequence 1304 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGAGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGA 

GCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAG 

CCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATG 

Sequence 1305 

AGGTATTCGACCCACGCGCCCGTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAAC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

215/467 



CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAGTnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnGCTGTTTCCTGNGTAGAANATTGTTATCnnnnnnnnnnnnnnnnn 

nnnnnnnnnNAGCCCGGGGAGNCATAAAAGTGTAAAAGCCTGGGGGTGCCTTAATGAAGT 

GGAGCTAACCTCCACATTTMTTTGCGTTTGCnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnCTGTCGGTGGCCAGCTGCATTAATGAAATCGGGCCAAACGCCGCGGGGGAAAGAAGG 

CCGGTTTGCGTTATTNGGGCCGCTCTTCCCGCTTTCCTCGCTCACTGGACTCCGCTGNCG 

CTCGGGTCNTTACCNGCTTGCGGGCCGAAGCCGGGTAATTCAAGCTTCAAC 

Sequence 1306 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTT 

CCAGGAAATCCCACCCTGGGATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCAC 

CTAATGAGTGCTCTAGAGATCAGCACCAAAGATGCTTGCCAGAGTCTTCTCTATATGTCC 

TCCCTCCTGTATCAAATAAGTCACCAAGTATAGCTGACTATCTCCATAACATCTCTCCAG 

TTGTGGCCTTAGTTCCAGCCTTCATCACTCTGCATTCTGAGGAGCTGATATCTGTCTCCT 

GCTGCCAGCCCCTCACCCCCAGTCGGTCTCCAACATA C I 1 1 1 I I I I I I I I II I I CGGACN 

CGTGGGTCGAAAGCCTTGACCCTGCC 

Sequence 1307 

AC I 1 1 1 1 I I I I I I I I 1 1 I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCA 

CCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCCCCTATTATTTTCCTAC 

AAGTTGCTTTTTACTTGAGTAGGAACCCTTGATTGATTTTTGCGGACGCGTGGGTCGAAG 

CTTGACCT 

Sequence 1308 

ACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGGCCGCAC I I 1 1 I I I I I I 

I M I 1 1 1 I I I I I I I I I GCCTTC AACAAAA AAGG AATTTATTGGCTCACATAAGNG AAAGC 

TGANAMTAGATCTCMTTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAG 

ACTCCCTNTCACCCTGGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTCAAGCTTC 

GACCCACGCGTCCGTCTTANCCTACAAGGNGGCTNTTATGTACATAGNTGGGTAATACAT 

CCAATTAATGATGTIvrrGACATGCTATTTTTGTAGGGAGAAMTATGTGCTMTGATATTT 

TGAGTTAAAANATCTTTTGGGGAGGATTTGCTGAAMGTTGCACTTTTGTTACMTGCTT 

ATGCTTGGTACMGCTTATGCTGTTTTAMTTATTTTAAAAAMTAAMTACTGTTTGTN 

AGAMCCANCTGGTTTAGAAMGTTTNNTATGTGACGATAAMCTMGAMTTTCCCTTN 

TTATTCCTAGTATTTTTCAGCACTCCATNAATTCTATTACCCTAA 

Sequence 1309 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAAGACCCAAGAGAGCTGATGGTGCAGTTCCAGTCTCACAGAAGTCTGCTGGAGAAT 

TTCCTCTGGATTGGAGMGGTGGTCTTTTCATTCTACTCAGGTCTTCTATTGATTGGATA 

AAGGTGCACATACATCATGGNGGGCMTNTTTTTTACTCANAGGACATCTGTGCCAATGT 

TATTCCCATNCAGMACAGCCTCCAAGTTGACACATNAMTTTMCCAATACACTTTTCC 

TTATCTTNACCCAAAANTT AAGGT TAAAGMGTTTTNGGATNNCCCTGAATAANNNATTG 

GCCATTNGGAATGCCCACTTTTTNCCTTTm 

TTTTNTNAAMNTAAAAAMACTTGTTGGGCTTANGGNCCACCTNGGTTTNCANCCTTAT 

TAATTCCCAGCCCNTTTTTNGGANGGNCCCNANNGNCAGGNNNGGATTCCCTTTGNGCCC 

CCCGG GAANTTTCMNNNNTTTNNGCCCNGGGGNTAANAAANAAGGNGNAAATCCTNTTT 

TTTTTCANAAAAAAAAAAAAA 

Sequence 1310 

GGGGGGGCAT TTCACGCTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGG 
GCAGGTCGCAC I I 1 1 I I I I I I I I I I I I 1 1 1 1 ACCTAGCCCATGTTACCAATCTAAATG AA 
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CTGTCTATGCATAGAAAACTGCTTTATGCCTAAGATAATTACTGGGATTTAAGAAAGTGA 
GAAAAAAGAATAGGTGGGATTGAGAAATTAGGTAAAAACAGAAGAGGCCAACTAAACCCA 
AGTGCTGCCCTTCAAGGGCTCTAGTAACCGGACGCGTGGGTCGAAGCTTGACCT 
Sequence 1311 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTT 

ACAGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAG 

CCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATGCTATAAAAGAGGAGCATTAG 

AAATATTTGAAAGATAGAMTGAGAACTAGTCAATATTTATTTTGCTTAGCACTGTATTC 

AGTATTATGGCATCTTAAAGTAGTTAAGACTCAATATTTCATCAAAAAAGTTTAAAATCT 

AATCAGAGAAT 

Sequence 1312 

TACTTAGGGC GAAT TGGAGCTCNC CGCGG TGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGGCCACATTTTCMTTTAGCCATTTTTCTCTTATTCACCTTTTTCTGCTAATTACTC 

TGTAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCTGAGAATT 

TTATTAGCTGCTAACGCTTGTCTCATCATAAGCACTCATATGTTCATTGAGTAAATATTT 

ATTG AGTA TTTGCTATG GTCCA GGCACTGTGCTAAGTATTGAGGATAAAATGGTGATTGA 

AACATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAAAGTATGTTAATATCAAAAAA 

AAAAAAAAAAAAAAAAAAAAGGN 

Sequence 1313 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAAGCCTCACTCACAATTATT 

TTTGAGATAGCTCCCAATGAGTTTAATCACTGCTATGCCAGGTGTGTGAGGCTGCTGTGG 

GACAACAATCTTG ATTCC TAGAAGAGTCATAAATTTCTAGGGACTACAGGCTCCTGCCAC 

CATGCCTGGCTMTTTTTGTAGAGATGGGGTTTCACCATGTTGCCCAAGCTGGTCTTGAA 

CTTCTGAGCTCAAGCGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTG 

GGCCATCACGCTTGCCCTAGAGTAATATTCTCTATTATCAAGGTAGAAAGTTCAACATAT 

ATTC ATTAGATCTACTTTATAGATACTGTTACTCAGATCACTTATATCGTTATATGTATT 

TTTTGTCTTCTTMCTTCMGTCTTGGATGAGAGMGAGGTGTTTTAAATTTCTCTGTTA 

TTTCTAGGGTTCTATTCATTTT 

Sequence 1314 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGATGACAACATC 

AAAACATACTCTGATCACCCCGAGAAAGTAAACAAAGATGATGAGGAATTCATAGAAAGC 

AATAAAATGCATGCTATTAATGGAAGAATGTTTGGAAACCTACAAGGCCTCACAATGCAC 

GTGGGAGATGAAGTCAACTGGTATCTGATGGGAATGGGCAATGAAATAGACTTACACACT 

GTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCCGGGCAGGTGCTGTGAGTGCTCTGG 

CGAAGTTTGGAGCCCAGAATGAAGAGATGTTACCCAGTATCTTGGTGTTGCTGAAGAGGT 

GTGTGATGGATGATGACAATGAAGTAAGGGACCGAGCCACCTTCTACCTAAATGTCCTGG 

AGCAGAAGCAGAAGGCCCTTAATGCAGGCTATATCCTAAATGGTCTGACTGTGTCCATCC 

CTGGTCTGGAGAGGGCTCTGCAGCAAGTACCT 

Sequence 1315 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAGAT 

CAGACGTGGCGACCCGCTGAATTTAAGCATATTAGTCAGCGGAGGAAAAGAAACTCTGAA 

TCCGACCAGTGTAGGTGATTACATTAGCCTTTGAAGTCAACACAAAGTTTAAAACACCTG 

CCCGGGCGGCCGGCCGCCCGGGCAGGTGTACAAGCTTCGACCCACGCGTCCGGCTGAAGA 

CATCCCTAGGGCAGGTAGCAGAATACCTAATTCAACCTAGAGAGGCACAGGCTGCACGAG 

AGTCTCTCAGATAAAGCCCCATTGAAAATAAATTTACAATCTAAAATTTAAAAACCCGTT 

AAAAAAGCAGCACAGCATGAGGAGTCAGTAGATACACTGAAAGCAAGATTAGATCTTCAA 

GACTTTCAAACTATAAAATTTAGAAAATTATAATAAATTATGAAATAGAGGCCCTTTCAT 

GTCAAAAAGTCATGAAAG 

Sequence 1316 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCG 
AAAAGATGAGGCAACAAGTAAGAGAAAACAGCATTGAGCTTAGAGAATTGGAGAAGAAAT 
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TAAAAGCAGCTTACATGAATAAAGAAAGGGCAGCTCAGATTGCTGAAAAGGATGCCATTA 

MTATGAACAAATGAAACGTGATGCTGAAATAGCCAAAACCATGATGGAAGAACACAAGA 

GAATAATAAAGGAAGAGAATGCTGCAGAAGACAAACGAAACAAAGCGAAAGCACAGTACC 
TGCCCG 

Sequence 1317 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGCATAG 

GGACTCTTGCCTTMGGAGTGTAMCTTGATCTGCATTTGCTGATrrGTTTTTAAAAAAA 

CMGMATGCATGTTTCAAATAAAATTCTCTATTGTAMTAAMTTTmCTTTGGATCT 

TGGC AAAAA AAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACACTTGTTGAT 

AAGAGTTTTCTGAAAACAGTCTATCAAATATAAAGAATGGTTTCTATCCAAGAATCAGCA 

GTGAGGGAAGAAATACTAAACACCTGTCAAGAAATCAGTTATTCATTTTAAAAAATAACA 

GMCCAGTGCTGCTCTCTGTCATAAAAMGAACATGTAAMTTTATTTTTATAGGCTTTG 

GTAACATTATATTCCCCACAGAGGCCTTCAATCCTACTTAAAGATA 

Sequence 1318 

AGGTCAAGCTTCGAC CCACG CGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGGTT 

TAGTTGGCCTCTTCTGTTTTTACCTMTTTCTCMTCCCACCTATTCTTTTTTCTCACTT 

TCTTAAATCCCAGTAATTATCTTAGGCATAAAGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCGG 

GCGGCCGGCCGCTCGTGATCTAGATCGCCGACCT 

Sequence 1319 

GCGAATTGGAGCTCC CCGC GGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTTT 

GGGTGGAATTATMTATTTTAG ATAAG ATTTAAGAGGATTCTAATTCTAGCTACTTGATA 

GGAATGCGAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCTAGCACCCTGAT 

ATTCTGTMTTG AAAC AGAGTTTCAAGTCCTTTGGTCAAGTATTACCCTTATTCCTTCAG 

GMTAGTAGATATTTTAAGATTACAGATAGGTTATCTTATCTAATTTACCTACCTATTGT 

TGAMTTATTTAATTTGCATTTAACTGTGTTTTCACACCTGCCCGGGCGGCCCTCTTACC 

TGCTTCTGACCTTATGCTCAAGAACTCCCCTAACTCTGGCCAGAGCTCAGCTTTGGCAAC 

TCTGACCGTTGAGCAGCTCTCATCCCGGGTTTCCTTTACGt 

Sequence 1320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCA CI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCACCAAGAGCCACTATAAAA 

AGATCCTTMTGAGCAA MTATA TCCCCTATTATTTTCCTACMGTTGCTTTTTACTTGA 

GTAGGAACCCTTGATTGATTTTTGCGGACNCGTGGGTCGAANCTTGACCT 

Sequence 1321 

GG GCGAATT GGAGCTCCCCGCGGNGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGT 

TTGi 1 1 1 1 i iCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGCCATTATNATTGC 

TTACTCTCATCGCTTAGCGCCCCAGGTGGGATGTGTTTCCAAMCACATTTTTGTATTTA 

TMGGAMTGTAGTTAGGATTMTTTTATTGTCCTMTTAGMCTCACATTTTGGTTAAA 

TCCTC AATTTC ATTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

AmAATTTTTCTGCTTGCCCAAGAAACAAAGCTTNTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTMTTGGGMTAAGATGGAAAATATGAATGCTAAATCAAATTTTTTA 

Sequence 1322 

CCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTCCGCTCACTTCATCCTCCCAG 

CAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTGGGGCTTAAGGAA 

ATTAAAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTCAGAAATTTGATA 

AAMTAGCACTTTGCATTTCTTGAATCTTGAGCTAAATGGAAATTAATACTAAACATTCT 

CCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAGAAGAAAAGGAAA 

TGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGTTAAGGCTTTGGCACCTGCC 

CGGGCGGCCGGCCGCCCGGGCAGGTTCAAAGACTACCAAAGTATGTATTTGATTTTCACA 
TG CAAAC AACTTAAA 
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Sequence 1323 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGCAGAAACCTTGCCATCATTCTTACTGCTGGTTTGCATCTCATTTATGGTGNNTCTGGG 

ATTTCTTCTCATGAACAGAAAGATTCAGTGGCTGTTATATGCCTTGTCAATTTAATGTAT 

TGCCCTATCCTCTTTTTGATCAAAGATAGAGACTAAGACTGGGAATTATGACAGAAAAAG 

T CATATTT TTCTTTAMTGATTTTGAAATGTTAAAATAGGCCAATATGAGTCAAAGTGCA 

MTTTTTTGGTGACCTGCCCGGGGCGGCCGGCCGCCCGGGCAGGTACTAAGCATTTCAGT 

TCCAGGAGAATAAMGAMTTCCTATTTGAAATGAATTCCTCATTTGGAGGAAAAAAAGC 

ATGCATTCTAGCACAACAAGATGAAATTATGGAATCAAAAGTGGCTCCTTCCCATGTGCA 

GTCCCTGTCCCCCGCCGCCAGTCCTCCACACCXJAAACTGTTTCTGATTGGCTTTTAGCTT 

TTTGGTGGIIIIII II I III I 

Sequence 1324 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGA 

AAAGTTTGCCAACCTCTGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGC 

ACAAGACTATGTAGTCAGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGA 

AAGGCGGAGAAGAAATGGATTTGGATAGGTAGTCAACAATGTCTGTTTTATGTTACCACA 

CATTTTCTCGAGAAATTTCAATCAGCTCTCTGAGAACAGATTCATCTTTAAATGAATGTT 

CATAGGTAACAGCAACTCATGCATCAATGTTGCAAAGTGAGCTCATTTTCACATTGCTTC 

AGGTTAGGCAGAAGGTTTGGTAAAGGGATTAACGTAATTGTTTCCTTGNTGTTTACAAAA 

AGAAGTCCCAGTTGGCATGCCACATAAAATCTTCTGNATCTCACTCTTGGTTACATTTC 

Sequence 1325 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACGCGTCCGCAGAGGCCC 

AGGCTCCCAAACCG ACAAA GTGAAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCT 

GTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAGGAGAGTTG 

TTCCTGTATGTGGCTGCTCTCAGATCTTTCCAAGCAAGCCAGTCATTTGAAGAGGTTTTC 

TTTTCATGCTGGAGGGCAGGCTAAGATCAATGAGTGGAAGAGAGAAAGGCTGTTTTAGCT 

CAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGGCCACCACA 

CATGAGAGAGCGCTCTNTCCTTAA 

Sequence 1326 

GCGAATTGGAGCTCNCCGCGGTGGCGGNCX3CCCGGGCAGGTACCAAAATAATTACCAACA 

NTACATTATGTACACCATTTACAGGAGGGTAACACAAACCTTGACAGGTAGTAACTTTTC 

ACCCCACATNACTGMCGCTTAACACTCCTGGCTGTTAATTGTCAGTTCAGTGTT7TAAT 

CTGACGCAGGCTTATGCGGAGGAGMTGTTTTCATGTTACTTATACTAACATTAGTTCTT 

CTATAGGGTGATAGCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 1327 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTGTAGCCTATGACTTG 

AGTCTCTTGAACTTCTAGGAAGAGGCAAACTACAAACTACTAGGATTCTGATTTCAGATA 

TAGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAACTTGA 

ATGGCCTTGGTTGTTCTGTAATTGCTGCCAAAATCATCACAAGCTGTACCTGCCCGGGCG 

GCCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGATGGGAATTCAGGTATGA 

AAGAAAACAGGCAAGGAGGCACTGAGGGAGAAAGACACAGACTTTATCGCTCTGTGGCTC 

ATTGTTACTGGAATATTCTAAAACTCTTGTTCACATGCTATTATGACTTATAAAGCAGCA 

ACAGCTGAGGCGCACCAGGACACAGCTTCCATTTCTTTAACGT 

Sequence 1328 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 

AAAATGGGAGACAATTTCACATGGACTTTGGAAAATA 1 1 I I 1 1 I C CTTTGCATTCATCTC 

TCAMCTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAA 

CCTTACCAAAAAAAAAAAAAAAAAMGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTA 

CAAGCTCGACCCACGCGTCCGAMTMTAMGCTAGMGTMTATTTTTCCTTTTGTCTA 

TTTrCCAAATTGACTCGATATTGATGGCTACTTTTGTMGTTTTTATTTAAGTTTAAAGG 

GAATATTTATTGATCACCTNTATGTGCTCAGTACCT 
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Sequence 1329 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGC 

TTGGGGATTTCGAGGMGGGTTCATMGGGAGATTTTAGCTGAGAMTACCATTTGCACA 

GTCAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATG 

TTCTAGGGTGGTGAGAMTCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAA 

TATCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGC 

TTGTGTATCAGTGGCAGGGTTTMGGTTCATTTTTTATTCCCATATTMTCCTTTM 

TTAGACMATTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGC 

CCTCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAACC 

Sequence 1330 

AGGTCMGCTTCGACCCACGCGTCCGTGMC7TTTATCMGGCTTTTGCTCTTTTAGACT 

TGAGTTTATCTTTATMTTAAGGAGMTGGTTTTTAAMTTTAGTTCCTCTGACACCCCA 

MATTATCAAMTA AATTA TGTTGTAGTGAATCTGTGTTTTGAAAGTCATTGATAGGACT 

TATATGAGTCAAAATTTTATGGATTATAMCTAGGCTTTATCTGGTTGGAAATAATTGCA 

ATACMGAAGCAACTTTATTAMTTAGACCTAMGTCACMTCTTCTTCTTTGCTGCTTT 

TTAAAMTTACCTATTACCTTTAAMGATCCCAMTTTAGAAGAGGAATTAAAATAAAAG 

TTAATGCAATAAAACACTTCCACAATATTCTATTACTTCAACCTCTAATCAATGAAA 

Sequence 1331 

AGGGCGA ATNGGAGCT CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATA 

ATAATAAATTTTTGTTTTAAAAAMGAGTGTTGTCTTTGTCTTGTATTT^ 

CATGTGATATTCTTA GGTAT AGA 1 1 M 1 1 1 \ AGTATTTGTCCTGTATATTGTTA7TCGAG 

CTTCTGGGATCTGTGTTTTGGTGTCTATCATTMCTTTGGMTATTCTCAGTCATTACTG 

CT TCAM CATTCATTCTGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACA 

CCTT TTGT M TTCT CCCACAGTTCATAGATATTCTGTNGTATTTATTTATTTT^ 

TGCCTTTTAGTTTTAGAGATTTCTATTGACATGACTTAMGATGATTGATGAGTTGATGA 

GAATTGAGAGAATTGATGAGAATTGTTGATGAGAATTATT 

Sequence 1332 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTCCTGCATTTC 
TAATGMGAMTAGGCTGGTCCTCMTTTTGCAGAAGTTGTATCATCATAGGTCATACCT 
AACATTCGTTTGTCAAGAGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCA 
CAMCCTTCAGMTGTGGTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAA 

ATCAGCCTGGCTTTAACATGGGGTAGATGGCMGAAGGATAATGCGGACGCGTGGGTCGA 
AACTTGACCT 

Sequence 1333 

CCGC GGTGG CGGCCGAGGTAGCGGTCGCAC I I 1 1 I I I I I I 1 1 I I I I I 1 1 I I AAATAGAGA 

TGAGGTTTTGCTATGTTGCCCAGGCTGGNCTNCTGGACTCAAGCAATCTCCCACTTCAGG 

CTACCAAAGTGCTGGGATTTACAGGCATGAGCCACCTCTCCCAGTCTCAGTTATTATTTT 

MTAAATGAGACTGAACGTCCTCTTATAAGGCTCACTCCCTTGTTCCTACTACATTTGCT 

CTGTTTMGTATCTCTTTAMTTCTTCAGTTAAGCGGACGCGTGGGTCGAAGACCTGCCC 

GGGCGGCCGGCCGCC CGGG CAGGTATTMCAGGTGCTTGCAGTTTGTTGACTTTTTTGAA 

AAMTCAAGTTGTAAACTTTTATTACAMTTAAAMTGMGTTCTTAAAAATCTCAACTT 

GACCAGATATGMACMTTTAAAMCCTTTAAGGCGTATTGAGAAAMCCAGGCTTTTTT 
AAA 

Sequence 1334 

ACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCTCACCTGCCCGGGCGGCCGCTC 

GAGGCCGCTCACCTGCCCGGGCGGCCGGCCGCA C1 II I I 1 I I 1 1 I I I 1 » II I I NNCAAAA 

CAAMCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGTGGAMGTTTATTGGCAA 

TATTAMTTTCACCCTAN ATAGG ATATGAGMTGTTTTGATAMTCACMTTTATAGTAT 

ATTMTGCCATGTGAGMTTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTA 

MCCTTAAAGAAA CCTTTC TCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATG 

^"nTGTAGATTTCTTTTTAGTTTMTTCCTGAGTATACAGGGCAAAAGCTTATATCCTT 
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TATATAMCTTCTGCTTT6GTCTAAAACTGATATATCTTCACGTTGAGGTTTCATCTGAA 
ATGCNCCACCGTTTGCTGACTTGCTTCAATATGAATTTGGATGGCTATAAAATTGACCTC 
GGCCGCTCTAGAACTAGTGGGATCCCC 
Sequence 1335 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGCCGCTCGAGGCCG 

CACI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 GGTAAACAGGCGGGGTAAGATTTGCCGAGTTCCTTTT 

AG I I I 1 1 1 I AACCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGTGAGGGTAATAAT 

GACTTGTTGGTTGATTGTAGATATTGGCGGACGCGTGGGTCGAATCTTGTACCTGCCCGG 

GCGGCCCATAGTTTGTCAACCACTGGTGTAAAACCTTAGTTATATATGATCTGCATTTTC 

TTGAACTGATCATTGAAAACTTATAAACCTAAGAGAAAAGCCACATAATATTTAGTGTCA 

TTATGCAATMTCACATTGCCTTTGTGTTAATAGTCAAATACTTACCTTTGGAGAATACT 

TACCTTTGGAGGGAATGTATAAAATTTCTCAGGCAGAGTCCTGGATATAGGAAAAAGTAA 

TTTATGAAGTAAACTTCAGTTGCTTAATCAAACTAATGATAGTCTAACAACTGAGCAAGG 

ATCCTCATCTNGAGAAGTGCTTAAAAT 

Sequence 1336 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAGATCCTGCGGAAGGAA 

TATGTTTTTGCTGACTCCAAAGTAAGTGACAGCAAACTTCTAAAATGGGCTGTGAGGTAG 

GGAGGGGACACAAGCGTTTTGAGGCTCGCTGNGTGCCAGGGAGTGTATCATTAGCTCACT 

CAATTCCCAGAACAACCCATTTCACACCTGGGAAAGGTGAACTTAGAGAAGTTGAGGATC 

ATGTTCCAGGTTGGCCTGGATTTGAGCCATCACTGTCTCAGGAGTAGGGAGGCTTCCCAC 

TTTGCCCAGCTGCCTCCCAGCCTCGAGGCCACATCCTTTATGACCCACATCTAACTCAGC 

CCCACACCTGGGGGAAAGGCTTTCAGCTTCTCTGGGCTGGACTTGGGAAATCTTTGGGAC 

ACTCCTGACCTGCCCGGGCGGC 

Sequence 1337 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCAGGCCCAGGCAGAACCCAT 

CCCATTTTATCCTTAAACTCAGAAGGAAATNNGTCTAAATATTAAAGGATTAATATGGGA 

ATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTCTCTTAGAGTTCAAT 

GAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGGAATATCTAAATATTGTTG 

ATTTAG ATAAC CACCCTAGATTTCTCACCACCCTAGAACATTTAGNGGGGAGACATTCTT 

TTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGTGCAAATGGTATTTCTCAGC 

TAAAATCTCCCTTATGAACCCTTCCTCGAAATCCCAAGGT 

Sequence 1338 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGCTT 

GGGGATTTCGAGGAAGGGTTCATAAGGGAGATTTTAGCTGAGAAATACCATTTGCACAGT 

CAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATGTT 

CTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAATA 

TCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGCTT 

GTGTATCAGTGGCAGGGTTTAAGGTTCA I 1 1 1 1 1 ATTCCCATATTAATCCTTTAATATTT 

AGACAAATTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGCCC 

TCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAAAACCTGCCCGGGCGGC 

Sequence 1339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAAATATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGNGCTA 

AGCAAMTAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTTATAGCATGGGCTCAGGTATCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTAC 

TTACGGGAMTACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAA 

AGTACCT 

Sequence 1340 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTACTCAAGTAGTCT 

TTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAACAAGGAGTTTGATNTGTGTTNGCTG 

ATTGTGAACCATCAATTGAGATAACTCACTACCTTCAGGCCAGCCAGTTACATACTTTTG 
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AAMGCCMGAGTGMGCAGGGTTGTTTTTCATCCMTTCTTGGTCTTTTTGTTAMGGC 

AGCMTAAGATAGGGTGGTTTCGGGCAATCACTTAGCTAATTGGCTCTCTATAGTCATAC 

CTGGATMTATTTGTAGTCATACCTGGGATAATATTTAAAGGGAAGAAACTAAACATAGT 

CCTTAAGTAGGAACCAACTACAAT 

Sequence 1341 

CCGCGGTGGCGGCCGAGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTAT 

TTCTTATTCTACMCMGGGNGCAGCCTANAGGCAMACACATCCCATTGTCATTTTTTT 

GTAMTAAAGTTGTATTGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGC 

TTTCACTTACAA CCTGA GTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATAT 

TTACTATGTAGCTTTTTCTTTTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTG 

GAGTGCGGTGGTGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATCCT 

CCCGC CTC GGTCT CCCMGTAGTTGGGACTACAGGCATGAGCCACCATACX)CGGCTAATT 

TTTTTMAGTTTTTGGTAGAMTGGAGTTTTTTMTGTTGCCCAGGCTGGTC^ 

CCTGGTCTTAAATGACCCTTTTCCCATCAG 

Sequence 1342 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAGTTATTTATTGATTTAATCAT 

TGTMTCTCCAATAGAGATTACMTAGAGATCTCCMCATGATTTCATGCATTTAGAGGA 

GAMTATTTCCTGGTTMGTGGAAMTTGTGCGGATGTGGCXITCTGGAAGACCTTCATTC 

TAAAGCAGCGGACGCGTGGGTCGAAACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCTG 

CAATCCAGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 

AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGAGGGGTGCTCTCCTGAGCT 

ACAGAAGGAATGATCTGGTGGTTAAGATAAAACACAAGTCAAACTTATTCGAGTTGTNCA 

CAGTCAGCAATGGNGATCTTTTTGCTNGTCTTGCCCATTCCTGGA 

Sequence 1343 

CTATAGGGCGAATT GGAG CTCCCCGCGGTGGCGGCCGAGGTACCAGCTTTAAAACCCATA 

CCCCTCC AGGGTT CTTTTCTGTTGCTGGTGAAGTGCACTTrTTAAMGAGTNATTCATAC 

CATCMGATTTTTGACMGAAAATTTTAGAAAAACTGTGGAAGAAAACTGATTGCTCTTA 

GTTCTAGCCATGTGTMTTGCTGACCACCTGAAATGGTCCAAACTGAGATTTGCTAAAGC 

ATAAAATACACACCATATTTCAMGGTTTTTAAMGMTGTAMACATTTCATTMTTTC 

GGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGCCGAGGTGGATGGACCCATCCATTC 

AGGCAGGGGGTGTGGGGTGTCCCCTGTGCTTAGAAACCACCTAGCATCATAAGCTGCAAC 

AGCACTTTATTGGGATCTGAGTCTACAGTTCACATAGGGAGGTGAAGCCGTGGGAGMGC 

AGGGNGTAAAAAAAAAAAAAGGGGGGGGGACTTTACCCCCCTAAGGACAGGNTGCTTTCC 

AAACCTAACAAAAAC 

Sequence 1344 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTAATTTGTTGCACACTAT 
GTMCAAAACAACTGMGATATGTTTAATAAATATTGTACTTATTGGAAGTAAAAAAAAA 
AAAAAAAAAAAAAAAAAG TGCGGCCGGCCGCAC I I I < I II I » 1 1 I I 1 1 1 1 GCTGGGGTTT 
TTTTCTTTTC1 I 1 I I I I 1 1 CAGCTACAGGAATTTAGCCAATTCANAGGAAATCTTCCCCA 
TMTTATGGMCTTTNTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACT 
GAAATAACAATAACAACTTCAACAAATAAAAAAACAGTTAAGCTAAATAAACAGATGATT 
GCAGAATTTATGTGATTACTGGGTACCTCGGCCGCTNTAGAACTAGTG 
Sequence 1345 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCMGTTTGAGTTGAAACGGTATGTGACTTCCC 

CAGCTGCACCCTGGGCAGTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAA 

AACTCCACAGATCACTCCTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCC 

AGTGCATTGAGGCTGGAAGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTC 

CCCTGTAGTTGGGCTMTCTGGAAAMCTCAAAAACCTGGCCTGATTACCGAGGTTTCTT 

TTATGGATATTTAGTATTTAGATAMATTTrTACAGTATTCTTGAAATGAACCCAATTAA 

ACACATAGTTCTCAGTCTTGACCACACATTAAGAATCATCTGGTAGACTTCTGTAAACTA 

CCAATGCCTGGCCA 



# 
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Sequence 1346 

CCGCGGTGGCGGCCGCAC I I II I I M 1 1 1 1 1 II I J 1 1 I ACCTGAAAATGCTTATTCTAGC 

TTCACATTTGATTGTTTGGCTMGMGAAMTTATTTATTAGACTTAATTTTCC 

GTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCA 

TTCTGATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTANACGCTTTCAGGAC 

TTTTCCTCAGTCAGTGTATCCAMCCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 1347 

CTCCCCGCGGGGGCGGCCGCCCGGNAGGCNAAGCTTCGACCCACGCGTCCGCTTTAAAGG 

GAATTCTNTGTAGAGTGGGAGGCGAACACGNCTGGNNCTTCCAACTCAGGAATTCTCGTG 

GCTGGGCTGGGTCAGCGATGGCTTTGTCTCTTTATGTCTAAAGTGCX)CTATGGCATGCTG 

AAGGTTACCTAACCATTCTTTAAAAGGAGAATGACCCTCCATGGGAATGGCCAGCCTGCC 

AACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTTCCTTGGGGAGAAAGTG 

CATAAACCAGGGGTCCCTTTTTTTTTT 

Sequence 1348 

AGGTCAAGCTTCGACCCACGCGTCCGCAAAAATCAATCAAGGGTTCCTACTCAAGTAAAA 

AGCMCTTGTAGGAAMTAATAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTG 

GCTCTTGGTGCACTGCCTGTGAGTTAGCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAA 
AAAAAAAAGT 

Sequence 1349 

GCGCGTATACGACTCCTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTA 

CMACTATGTATCTGAMCACTTCTATTTGGCAATTTTATAACAAATCAAATTTTAAAM 

GMCAAAAGAGATTGCAGATTACTTCGCAGATACAGMTAMGCMTTGATGAAGTGCTT 

AAGCAAAAGAAACMCAMAAAAGAAAACACACTGCTTTTCTTTTTAAAAATAAAATCAC 

ATTGCTATA GATCA AATGGATMTACCCTTATTAAACAACCATTCCAGAATGTCTTATAG 

TAGCAGTGCTTTTATTTGCACTrCACTTMTTTTATMGACTC^ 

TCTTTACCCCCATTGTTMCGAATAAAGTCTCTCATAATT^ 

AAAGCAMTG AGAM TGATTTATGTATCGTGGAACCTTTCCCATTTTGGAACCAAAGGTT 
TT AATTCT AT ATTTTTG N CT AAT ATTTCTTT AAAAAAT 
Sequence 1350 

CC GCGGTG GCGGCCCGCCCGGGCAGGTACTATCTATAMGGAGGTTTGATGTTTTCATTA 
CTGTTTTTGTAMTATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGT 
GG CTCAT ACCTGTAATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGA 
TAGTTTTATTCACTTGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1351 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGATTACCATGG 

CAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGTAG 

AAACAAAGTGC GGCC GGCCGCCCGGGCAGGTGCCGCAC I I I I 1 1 I I I I ) 1 1 I I I I I I I AG 

MTAGGATTGAATTTTATTAAMACAAAAATAMTCTAAAMGCTTCCTTCAGTTACAAA 

TATGCACMGAATTTCTGCATTACATCATTTGACATAAAATGTTCTGAATGACAGAAGTA 

GMGTAGAACTTACTACCATTTGMGACAGGAGTTGAGCGCTGAAAACCACACACATTTA 

TAGAAAGAAACCAAAGTTTCACAGGGAAGACCTGTGATCTCTGGCTACAGGAGCTGAAAT 

TAGGAACATGAAAGAAACTTGGAGAGAGAAGACATTCAATACTCTAAAATACTTCAGCAA 

AAATAGTCAAACATNTGTNAACAACTTGGNACAAAACTTTATATGGTGGGGGGTGGCTAT 

GCCGGMMANTCTTNACTGGNTATTATTCACCTCAMMGGGGGNTTTTAATGNTCACG 

AATCCTTCCTTTAAMTAMANAMGCNTGGNTTNTTTTTCTGGNGTCAAGAGTAAAANG 

TANTAGGGNNTACTCAGGGATGGTTTGGMTTTTAAAACGGGGCNTTCCACCCTTGGTGG 

CTNGTGGCANTTTANCCCAAAACGGCNAAANAACCGGCCCGNGGTCAACTTGNAAAACCT 

GGCGGNTTNAAATANGGACCCCGGGTGGGGGATTNANTNAGNTTTGANCCGNNNCTTGGG 

GGGGCCGGCCC 

Sequence 1352 
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CGAGGTCAAGCTTCGACCCACGCGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGG 

TTTAGTTGGCCTCTTCTGTTTTTACCTMTTTCTCAATCCCACCTATT CI 1 1 1 1 I CTCAC 

TTTCTTAAATCCCAGTMTTATCTTAGGANMGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGGTGCGACCTGCCCG 

GGCGGCCGCTCGAGGCCGCCCGGGCAGGTACTATGTCGATTCGACAGAACATTCAGAAGA 

TTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGCAGGCAGGTGGATTTCTTGGCCCAC 

CACCTNCTTCTGGGAAGTTCTCTTGMCTCMGMCTCTTTATTTTCTATCATTCTTTCT 

AGACACACACACATNAGACTGGCAACTGTTTTGTAGCAANAGCCATANGTAGCCTTACTA 

CTTGGGCCCNTTTCTAGGTTTGAATTATTTCTMGCCTTTTGGGNATGATTAGAGNGAAA 

ATGGCNCCNGCAAACTTGNAGGGGCTTTTGGNNCCANMTGATTTTTAATAAAAAAAAGG 

GGATTGAATAGNTAMNTCMGGGMNGGTTTATGNAAAGGAAAAAAAAAAGCCTCCTTC 

NTGGTTTMTTTACAAMGGTNTTTT^ 

CCCGCAAGGTGGGNNGGTATTNACCCNTTTNNGGNTTNAAAAAAAAAANTTNGNNNGGGT 

NAACCCTGGAACNGGGGGGGTTNGGNG 

Sequence 1353 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTCTTCGACCCACGC 

GTCCCGGGGTTTCAMCTGMTAACTCATTGTTGGGTCTGGATCTGTAAAMGGTTTTGT 

CACTGATGGGCAATATGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAAC 

AACTGGATGAAAGCTATAAACAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGA 

ACTTCAGGAGATTCCTAAGMGGCCTTCAGTGATCTTTTCATCAATATCATCAGGCCTTA 

TCATTGTTTCACATTTGCTTCTCTTTACCATAGGGMTATAATAATTATTTACTGG7TAA 

CTTCCTAGGGAGATTGCCTGCGGCTTATTTAAGATCCAAATTTTAMGTMTAATTTCTG 

TTGAAGCTGCTTGTGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTA 

AATGAGCCCGGGATGTAGGCAGGTTTTGATGTTTTGCTTGCTTTTATCCCTMCAT^ 

Sequence 1354 

TATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAMTATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGTGCTA 

AGCAAMTAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTAAGCATGGG CTCA GGTATCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTT 

ACGGGMATACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAAAG 

TACCT 

Sequence 1355 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGGGGTTTCAA 

ACTGMTMCTCATTGTTGGGTCTGGATCTGTAAAAAGGTTTTGTCACTGATGGGCAATA 

TGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAACAACTGGAATGAAAAG 

CTATAMCAATAGTTCAGMGATTGGCGTAGAGGATTTACCTACAGAACTTCAGGAGATT 

CCTMGAAGGCCTTCAGTGATCTTTTCATCMTATCATCAGGCCTTATCATTGTTTCACA 

TTTGCTTCTCTTTACCATAGGGMTATMTAATTATTTACTGGTTAACTTCCTAGGGAGA 

TTGCCTGCGGCTTATTTAAGATCCAAATTTTAMGTMTAATTTCTGTTGAAGCTGCTTG 

TGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTAAATGAGCCCGGGA 

TGTAGGCAGGTTTTGATGGTTTGCTTGC 

Sequence 1356 

CGCCCGGGCAGGTACTATCTATAAAGGAGGTTTGATGTTTTCATTACTGTTTTTGTAAAT 

ATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGTGGCTCATACCTGTA 

ATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGATAGTTTTATTCACT 

TGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1357 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGTCCCGAGTGTTTCCA 
CTCTGTCCATAAAATGGGAGCTAATATTCTCCAACCTGTGTGCCTGACATGATGGTTAAA 
GGGATTAAACAMACAATAGTTTGTAATTTATTCTGTCAGAGCAAACTGCTGGTAAATAA 
AAGGGCTAGTGACGAAAAATAAATTTTAAAAMCCTMTAAMCAAGTTTGTAATTTATA 
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ATTQTATACAAATAAAAGATGTTACAAAAAAAAAAAAAAAAAAAAAGGACCTGCCCGGGC 

GGCC GGCCG CCCGGGCAGGTTTTATTTAACATTCAAACTTCATTAAGACATGTGCAATAT 

GGCMTTTTACTGGGGATTAMCCCTACCTAGGATTGCTTGCTGGGGCTTAGCAACAGGG 

TCCAGTTCACACTTAGCACTAATTAMTACTTTATTGAATAAATACAATACCAAACAAAA 

TGCATTCAAA 

Sequence 1358 

CCGCGGTGGCGGCCGAGGTCAAGTTTCGACCCACGCGTCCGCATTATCCTTCTTGCCATC 

TACCCCATGTTAMGCCAGGCTGATTTGTTCTTGGATCTTAAGTGTGACAAAGCCTGCGG 

GAAGG AACCACATTCTGAAGGTTTGTGGGCTGTGTGAGATCCAGAGAACCCAAGGGGGTT 

TTTTTGCTCTTGACAAACGAATGTTAGGTATGACCTATGATGATACAACTTCTGCAAAAT 

TGAGGACCAGCCTATTTCTTCATTAGAAATGCAGGAAACCTGCCCG 

Sequence 1359 

CGCG GTGGCGGC CGAGGTCAACGCTTCGACCCACGCGTCCGGGACCTCAGAATATAAAAA 
TATGGI I 1 1 1 I I I I CAGACTTACTAGTTTTTTTTGATAATT(X)TCTACGAATGTTG ATTT 
MCTTAGAMTATGTAMTTTMTATTCAAMCCAMTTATTTTTTAAAGAGGAAAAAAA 
TATAAAACCTGCCCGGGCGGCCGGCCGCAC \ 1 1 I I 1 1 1 1 11 1 II ) I M I 1 1 INAAACTTT 
AATAGNGTNCGGAAGNTGAATAATTTATGAAGGAGAGGGGTCAGGGTTGATTCG 
Sequence 1360 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCAC 

NCGNCCGGAGATGTATACTGCCACTATAGGAACTATAAGAAAAAGTCAAATGGAAATNTN 

ATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGATANGGNGTCAGTGAACTC 

AAAAATN ANTCAATNGAAACTACTCAAACTAAAACTCAAAG AG AAAAAAAANG ATGGGAG 

ATMTTAT TTTr TAAGAATTGGTCATCAAAATGTAGCMCAAGTTCGCCTTATCCTATAT 

CATTTGAATTTTCAAAAMTAAGCTCATTATACAATCTTTAAMTATTTTGAATAGAACT 

GTTTCATGTGTTATTNGTGAAAAT 

Sequence 1361 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 

CMGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 

TCAGGCCAGCC AGTTA CATACTTTTGAAAAGCCMGAGTGMGCAGGGTTGTTTTTCATC 

CMTTCTTGGTCTTTTTGTTAMGGCAGCMTAAGATAGGGTGGTTTCGGGCAATCACTT 

AGCTAATTGGCTCTCTATAGTCATACCTGGATAATATTTGTAGTCATACCTGGATAATAT 

TTAMGGMGAAACTAAACATAGTCCTTMGTAGGAACAACTACMTTTTAAC 

Sequence 1362 

ACTGI I I 1 1 I I I I ATTTGTTGMGTTGTTGTTGTTATTTCAGTCTTTTTCTTATTGGGTT 

GACCAGACTTGGTAAMTCTGTAAGAMGTTCCATAATTATGGGGAAGATTTCCTCTGAA 

TTGGCTAAATTCCTGTAGCTGAAAAAAAAAAAAAAAACCTGCCCGGGCGGCCGGCCGCCC 

GGGC AGGTTACMGCTTCGACCCACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACA 

TTTTA MTGAC CTMTACATATTTAGTCCACATTGAMCTTTGGCATTTTGNCATTGCCA 

TTAAMTTTTTGATGGCATTAAMTTTTNATGCCATTAMMTTTTTGATCAG 

CA 

Sequence 1363 

CCGCGGTGGCGGCCGAGGTACCACGGTTGTCCCCTGAAAGGTGTTGTGTCCCTCACCAGA 

CTGGGAGCACCTCMGGGCAGMCCCATGTCATGTTCCTTTTTGTATTTCCAGACCTGAA 

ACTGCC AGTAAATAAACCCTAA AAGTAGAMGAAAAAAAAAAAAAAAAAAAGTGCGGCCG 

CCGCACI I I I I I I 1 1 I I I I I I I I I I INGGAAMCCAMCATGCTTTATTTCATTTTTTTC 

ACAATTTATTTAAACATCTCANATATACAAAATAGGTACCT 

Sequence 1364 

CCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCCAGAGAAGCTGAAAGCC 
TTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGATGTGGCCTCGAGGCTG 
GGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACAGTGATGGCTCAAATCC 
AGGCCAACCTGGAACATGATCCTCAACTTCTCTMGTTCACCTTTCCCAGGTGTGAAATG 
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GGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCACACAGCGAGCCTCAAA 
ACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGMGTTTGCTGTCACTTACTTTG 

GAGTCAGCAAAAACATATTCCTTCCGCAGGATCTTCCGGACGCGTGGGTCGAAGCTTGAC 
CT 

Sequence 1365 

TACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GATTGATTMTACCTGTCACAGATACATTTTGGTTTACAAATCAATGAACAATGGAGGGA 

ACTCTGTCCTTAATCTTGGTACGAGACAATGAACCCCAGGTACTTACCCCAGACAACGAC 

GCCGCTTNACCATGATGATGGACAACAGGCAAC I I I I I 1 1 I I GGAGTTTCAGCTTGCTTC 

CAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTCTAAGACGATAGMGTTTTCAGCC 

TMGCCGTATTCCTAGGTMGCAGCTGGATTGCAGTTTT^ 

TTGNNTNANNCGTTMNATTMCMACTAGCT^ 

TANAGGTNCCCCCANAMTTNAAAATNAAATTTNTGCAATTMTTTTTGAACCTTGCCCC 
GGGGTGGGCCCTGGCCCCCCCCTNGACAANGN U M I 1 1 M I I II I I I I II I I 
Sequence 1366 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCX5GGGCAGGTCTTGAGTCG 

ACCCACGCGTCCGGAGCTGCTCAATAGTGAGAATCAGGTGATATAATGCATGTGGAAAAA 

GAATGTGAAAAATCTAACACTTTAGATTGTATACAGTGTTTTTTAAAAAGACACAAAAAA 

ACTGTCAACATGAGAAACATAAGCAAAGTTTTACTCAAGACAAACATCCACGAGTCACAA 

CTTCAGTTATTCCCAGTCTTCAAAATAACAGAAGGGCAAAGCAAAGGTAAACATGCAAA 

Sequence 1367 

GAC TNCTATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCyrCTGATTAGATTTT 

AAACTTTTTTGATGAAATATTGAGTCTTMCTACTTTAAGATGCCATAATACTGAATACA 

GTGCTAAGCMAATAMTATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGC 

TCCTCTTTTATAGCATGGG CTCA GGCATCAGATGGCGTAGGTCAAGATCTTGGCTCTACT 

GTTTACTTACGGGAAATACTTTTATGTTC 

GATTAAAGTACCT 

Sequence 1368 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTAMCAGAGGAAGTATTTACATTCTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCTCGGCCGAGGTACCTTCTGTCACAAAGACCCAAGCTTCCTCCAGCTTCCAGGATAG 

CAGTCAGCCAGCTGGAAAAGCCGAAGGGATCAGGGAGCCAAAGGTGACTGGGAAGCTAAA 

GCAACAATCACCTAAATTACAGTCCTCCAAGAAAGTTGCTTTCCTCAGGCAGAATGCCCC 

TCCCMGGGCACAGACACACAAACACCGGCTGTGTTATNCCCATCCAAGACTCAGGCCAC 

CCTGAAACCTAAGGACCATCATCAGCCCCTTTGGAANGGGCC 

Sequence 1369 

CCGGGCAG GTCG AGCGGCCGCCCGGGCAGGTTTCCTGCATTTCTAATGAAGAAATAGGCT 

GGTCCTCMTTTTGCAGMGTTGTATCATCATAGGTCATACCTMCATTCGTTTGTCAAG 

AGCAAAAAAACCCCCTTGGG7TCTCTGGATCTCACACAGCCCACAAACCTTCAGAATGTG 

GTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAAATCAGCCTGGCTTTAAC 

ATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGAAACTTGACCTN 

Sequence 1370 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCGACCCACGCGTCCGACGACTCACTATAGGGA 
TCTAGATCACGAGCGGCCGGCCGCCCGGGCAGGTACAGAGATTTAAATGAAATCTTCGAA 
AGMTAMTTTGCTTTTCAGTCCACTGTATTTTCAAAATTT 
Sequence 1371 

CCGCGGTGGCGGCCGAGGTACTTCAMGTTATTGCACATACACTTGTTTACTTTGTATGT 
TTTGCAGGATTAAACTTTGTATAATCTTTTTACAAAA j I I ) 1 1 1 ) I CAGTATGCAAGCTT 
GCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAAAAAAAAAAAAAAAAAAAGTGCGG 
CCGGCCGCCCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCGGAGAnnnnnnnnnnnnn 
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nnnnnnnnnnnTGATCTAGATCACGAGCAGCCGCACTTTN" 




CGAATTCAGGAGTCCTTTATTAGCCGGCAGCCGANAGACAGCTAGCGCTCAAAATTNTCT 

TGGCCCCGAAGAAGGAGCTAGATTTTCTTTTATAC 

Sequence 1372 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATAGCACATTACATGTCAA 

ATCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTC 

CGTGAAATCAATATCCCGCACAAGAGTGCTAAANAAAAAAAAGNNANAGGNCTGCCTNTG 

TnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTNGNTATCAAGCTTAN 

TCGATACCCGTTNACCTTGNGGGGGGGGCCCGNTANCCCANTTTTTTNnTTCCNTTTTAN 

GGGGGGGGTAAATTTNGCCCCNNNTNGGGGNNAAAAAAAANGGGNAAAAAA AI I 1 1 1 1 1 I 

CCTNGNGGGNAMNTTTTTTATNTNNTTTTTAANANANTTTTNNCCAAAAAMTNTAAAN 

NGGNCCNGGGGGNNNNNTAAAAGNTAAAAAANNNNTNGGGGGGNCCNTTTNNGGGGGGGN 

CCNNNANNNNCCAATTNNNNNNGGGGGGNNCGNCNNTNTTNCC C I I II 1 1 1 1 AAAAAANN 

AAAAAAANCNCNGCCCCCCXJCNTATTTTTNAATTNAAANCTCCCCCCCCNNGGGGGGGGG 

G 

Sequence 1373 

CCGCGGTGGCGGCCGCCCGGGCAGGTTATAGACAATATGCTCCTTAAGGTCCCTTTCAGT 

CCCGTTCTATGGATCTGTGTAGTTTCGCTTC I I I 1 1 1 CAATATGCTCAGAATTAGGACAC 

CAATGTTAATGGAAGATAAGGAAACTATACCACCTATCCCTTATAGAAGATTTGTGCACT 

AACTAATATGAGCCCTGGAAGATCAAGCCAGTAGAAGATAGAAGATCTATCCCTGCTTTA 

TACTTTGGATCATTTATTTGTGMGATCACMCTTTCAMGTTTTATTATTTCTTAGGTC 

TTCATGGAAGTTCGGGGAAATTAACTGGATCTACTTCTAGTCTAAATAAGCTCAGTGTTC 

AGAGTTCAGGGAATCGCAGATCTCAGTCATCTTCCCTGTTGGATATGG 

Sequence 1374 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGAAGMGTGCTATGTGGCGAGAAAAAAAAGTTTTAATGTATTGGAGAAGTTTTAAAA 

AACCCAGAAAAATGCC 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 GAACATAAACTCAAGATTTTATTGTCTTC 

ATAATAAAAGATGACACTTAGAACTGGATCACTTGGCCCTTCTCTTCTTATCTCCTCCCA 

GTTCAAMTGCTTGCATNTTTTMTAGCCAGCATTNTTTTAANATCTGCAGGTNGGGCTT 

AAACNCCNCTTAAAGCCTTTAAACACAAA 1 1 1 1 1 1 1 1 1 1 GGNGGNTTTTAAGCCCTTTTT 

TCCCGGAAAAAANTCGGNTTTNAATTTtGGNCCNANAAAANNACCCCTTTTG I HI I 1 1 I 

NNNAANAAAAACCNCNTTTnTNNTNNGGGGGGGAACCNTGTCCCGGGGGGGGGGNCGTT 

TTTAAAAAAAAAAGGGGGNNCCCCXJCCGGGCNGNGGGGGA 1 1 1 I 1 1 I I NTAAANNTTTTT 

nNNNCCCCCCCCCCC 

Sequence 1375 

CCGCGGTGGCGGCCGAGGTACTTCAMGTTATTGCACATACACTTGTTTACTTTGTATGT 

TTTGCAGGATTAAACTTTGTATAATCTTTTTACAAAA 1 1 1 1 1 1 1 1 I CAGTATGCAAGCTT 

GCAAGATGAAAATAAAACCTGTTTGCCTGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTGCGGCCTCGAGCGGCCGCCCGGGCAGGTACTCTATAAGAGGTGTGGGTGTCTTGTTTG 

GTCAGGATGTTAGAMGTGCTGATMGTNNCATGATCAGTGTATNCCAAAAGGTTTTTAG 

GAAG TATGGCAAAAANTGTTGTATTGGCTTTTATGGGGACATGATNNTAGTCNNCTTCCT 

TTTTMANANGNNTTATNTTGTTCNANTGGNTTAANTGGTTTTTAAAAA 

Sequence 1376 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 

CCACGCGTCCGGCCACATTTTCMTTTAGCCATTTTTCTCnTATTCACCTTTTTCTGCTA 

ATTACTCTGTAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCT 

GAGMTTTTATTAGCTGCTAACGCTTGTCTCATCATAAGCACTCATATGTTCATTGAGTA 

AATATTTATTGAGTATTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGG 

TGATTGAMCATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAA 

Sequence 1377 

CCGCGGTGGCGGCCGCCCGGGCAGGTAATCACACAACACTTTCTTTTCCAACTGCTGCAA 
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AGTGCATC TACMT ATGCTATTACAGATCCACTTTTAAAAGGTTTCCtGTGACATTACAG 

CMGCCTCTTnTTCAAACAGAGGAATAATCCCAAATTCTTCCTCAAATAAACTCCATTC 

CAGTAAATGGTAAATACATAAAAAATTACAGTAAGCCAGACACTTAAAAGGACAGCCAAG 

AAGTCTTCCAACA GTTTA TTAGAAAGAATGTAGACATTTAAAAAAATCCCCACTGTCATG 

AACATAAATTGAGGTTTT CAGCC CGGGTATAAGCTGAATCAAAAAAAGGAAATAAAAAAT 

CCMTAGTGTATTAMCATTTTTCACTCATTTGCCATACTGACAGTGCAAATCAAATCTG 
GACTAA 

Sequence 1378 

CCACTGGATTTGACTCAGAGAGGACCCCCAGAGGGTGTCTCCATCTTCCCTATTTATTTT 

CAGCCCTTGAGGGCTTCATTGTAGATCAAAGCCAAGGCCCCCAGGAAGGNGACATACTCC 

TGGMGTTCACCTCCTGGTCCTTGTTCCGGTCCAAGTCTTCCATCAGCCTTGCAATTTCA 

GCATCCTGCAGCTTCNAGCCAATGGTGAGCTCCTTCTGGATCAGCTCCTTCAGCTCCTTC 

TTGCTCAGGGTGTGCTTGTCACCCTCCCTGCCGGAGATACCTGCCCG 

Sequence 1379 

CTACTTAGGGCGAATTGGAGCTCCC CGCG GTGGCGGCCGAGGTACTCCAGGGATGGGAGG 

CACMGTTG TGATTGGG CAAAGTTTATTTTCTATGTCAGCCTGTCAGTCCACTGCCCCAT 

TTTGCAAGAC I 1 1 1 I 1 1 1 AGCCTTGACAAAATGTCTCAGTTAAGTATAAAAGTTTTTCCA 

CTACTTAGTCCAAAAAAAACTATTAAATCTTAATGAAATAGCCACTCTCAAAAAAAAAAA 

AAAAAAAAAAGTGCGGCCGGCCGCX3CGGGCAGGTCTTCGACCCACGCGTCCGTCTTTTTC 

TTCCCAAACATAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGACCTNAAGAAAGC 

AACGAAAGGAACGCAAGAACAGAATGAAGAAAGTCAGGGGGA 

Sequence 1380 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGCACATTACAGTCAAA 
TCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTCC 
GTGAAATCAATATCCCGCACAAGAGTGCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAGTGCGGCCTNGAGCGGCCGCCCGGGCAGGTACATNTATTTTGGATTGTATATTGNG 

TTTGTGTATTTACGCTTTGATTCATAGTAACTTNTTATGGAATTGATTTGCATTGAACAC 

AAACTGTAAATAAAAAGAA 

Sequence 1381 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATCTGTCAA 

ATATACTATGAAATGCATAGTCTCCACTTAAAATGCTGAATGACACACACGTTTTGCAAG 

CATTACTGCTTTCCACAAAAACTGCTGAATAGGAGTTCCGTCCCTGCCAAGATCAGTGTT 

TAAGAGATACTTTATGATGCTGATAAGTATTATTGGTGGTGGTGGTGTTCAGAAAGTTTG 

TCACTCA TGCAGAT GTCTGAAATCTTGTTCCGAATCCATGGAACATAGGGTGGAGGCCAG 

CTCCCCCTTTTTTTAGATGATCACATAGTTCTGAGCAGAGATGTGGTCCTCACCCTGCAG 

TTCCTGCAGGAGCTGCTGCTGCTGGGATGGCTGCTGG 

Sequence 1382 

CCTAT AGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGAI 1 1 1 1 I IAAAGTATTTTCTAGTCTTTTCTCTCTCTGTGGAATGGTGAAAGAGAGA 

TGCCGTGTTTTGAMGTMGATGATGAAATGMTTTTTAATTCAAGAAACATTCAGAAAC 

ATAGGMTTAAAACTTAGAGMATGATCTMTTTCCCTGTTCACACAAACTTTACACTTT 

MTCTGATGATTGGATATTTTATTTTAGTGAMCATCATCTTGTTAGCTAACTTTAAAAA 

ATGGATGTAGAATGATTAAAGGTTGGTATG A 1 1 1 1 1 1 1 I I A ATGTATCAGTTTGAACCTA 

GMTATTGMTTAAAATGCTGTCTCAGTATTTTAAAAGCAAAAAAGGAATGGAGGAAAAT 

TGCATCTTAGACCATTTTTATATGCAGTGTACCTGCCC 

Sequence 1383 

CGAGGGAGTCTATTGGAGCCCTTGGGAACTTCTGGATCAAATTGGACCTGAATTGAGATC 

^ATTTCTCAGCTTTCACTTATGTGAGCCAATAMTTCCTTTTTTGTTGAAGGCAAAAAAA 
AAAAAAAAAAAAAAAAAAAGT 

Sequence 1384 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCA C I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 
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TTTTMCGTTTTAATTAAATGGATTTATTTAAAAAGACTATAAAATCTGACATCAAGAGA 

GATAAAAAAAAAAGACCCATAAGATTTAAATTGACAAATGTAAAATGATTGGCTACAATG 

TAAAAATACATTTNCCAGCCCCCAMCAAMCACMGTATAGTAATTATAAMTTTTTGG 

ACCTGCCCG 

Sequence 1385 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTATTTCTTATTCTACAAC 

MGGGTCAGCCTACAGGCAAAACACATCCCATTGTCATTTTTTTGTAAATAAAGGTTGTA 

TTNGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACM 

TGA GTGG TTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAGCTT 

TTTCTTTTC 

Sequence 1386 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 
CGAGMGAG TTTG CAMTGCMCAAMTATTTMTTACCGGTTGTTAAMCTGGTTTAGC 
ACAATTTATATTTTCCCTCTCTTGCCTTTCTTMTTTGCAATAAAAGGTATTGAGCCA^ 
TTTTAAATG ACA I I I I IGA 
Sequence 1387 

CTACTATAG GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCA CI II I I l l 1 1 I I I j I I I 1 1 
I I I II 1 1 I I GCCTTCAACAAAAAAGGAATTTATTGGCTCACATAAGTGAAAGCTGAGAAA 
TAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAGACTCCCT 
NTCACCCTGGTACCTGCCCG 
Sequence 1388 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGG 

GGAGGACCTAGGCAACG GCCT GAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAAC 

CTACTGMGATACCGTGGTTTTACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGC 

CATCTGAGACCAGCCATCTGGCAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATC 

CCACTGACACCACGTAGAGCACAAATGAGTTGCCTCCACTGAGCCCAGCCCAAATTGTTA 

TCCTATAAMTCATAAAAACATAMCAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAA 

A7TAAGTGCGACCTGCCCGGNNCGGCCGGCCGCCCGGGCAGGTACCCATTAATTTGCTCA 

GATATAGCAGGCTTMTGGTTCTATATTTTCAAMGTTTTTAAGAATGGTT 

Sequence 1389 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAATAAGCCCACC 

CCACTAGGMCTATGTTAAAAAAAAATTCMGAAAGMTTTAAGGGAGATTACAGTGTTA 

CTGTGACACCAGGAAMCTTAGMCTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGG 

GTTGGCCATNANMGGMGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAA 

CCCNTMGCCMGGCACCCAGACCAGTTTNCATACATAGAAAGTTACAGCTGCTTTTATA 

CCCCCTTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAG 

GAMGACCAGTAGAGAGAAMAAAGGCCATCTATACCAATTNTAAGTTAATTTAGACTAA 

ACAAGGTCTTMTAGCAMGGATMTTGAMTCCCAMCTTACAAGGTTTTTTAAC 

Sequence 1390 t 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGGTTTCATGAATGGAAACCTAAG 

TAAMCTAAGCTCATTAGTGACAGACTTGTTTTCTTCTTGTTATTCCTCCAGCAACTCCC 

TCACCACCACGCCTCCCTGCCTACCATCCCCGGAAGGGTGCTTATTCTTTAACAAAGAGA 

ATCTAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTGAGAAAACAGACCATATT 

TACTCACATMTTCGCCTTCTACCTTTCACCTGCTTATGTAATAATTTAACACTGTAGAG 

GGGACATGGAGGTGACCGGAGTATTTAGTGGGTTCCTTGCTCCTGGGCTGGGCAGGTTCA 

CAGGCCCCACAGGCCTTGGGCCCCAGCATC 

Sequence 1391 

CCGCGGTGGCGGCCGAGGTACTCTGAGGTCACGTCAAACTATGCTTTAGAACCTTCATCT 
TTTGCTTTCTTGGGCTTTACTTTCCAAMTGGACTACAGGATMTGAGGCTTTTTTAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 1392 
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CCGCG GTGGCGGCCGCACCTGCCCGGGCGGCCGCTCGAGGCCGCACI 1 1 1 1 I I 1 I I II I 1 
I 1 1 I I M I I I I IAAAMTTCAAMMTTAGTTTATTAGCTTAATATAATTAGGTCAATGG 
AATCCTGTTTTGATCTCMTACTTCCCATATTGCAATATATAAATGNGACAAATTCAGCT 
GTTTTGTGGCATAAATAAGTGTCTAAGCTGGGCAGTTAGTCTACCC 
Sequence 1393 

CTTAGGGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGT C I I 1 1 I U l I I I II I 1 1 I CT 

TTTGAMCAATTTTTCTGAMTTTATTTCTAAMGTCAGAGACAAMCTTTAGGAGTGAC 

ACATTTATACTAAGCATACATGCGTGAGCAAAAAAAAATAAGCACAGAATACAAAAATGA 

MTAGTAAMTTTTAATACAGTATTCTGAATACAAGTAGAATACCACTAGATAAGAATTG 

TATTTACCTAAGAAATCTATGATAGNGNGGGNGGAGATAAACCAGTTTAGGATAGCCACT 

TCACTATTCACATTTTAATCAGTGCTGACCAGAAGCTAAAGCAA 

Sequence 1394 

ACTTAGGGCGATTGGAGCTCCCCGCGGNGGCGGCCGNGGTACAAATAAGCCCACCCCACT 

AGGAACTATGTTAAAAAAMATTCMGMAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGNACAGGTCCCTTGTTTCAAAGGTATGACACANGGTAACCCGT 

ANGCCMGGCACCCAGACCAGTTTCCATACATAGAMG^ACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNGAGAGAAAAAAAAAGGCCATCTATACCAATTCTAAGTTAATTTAGACTAAACA 

A 

Sequence 1395 

CCGGGCAGGT ACAAM GGGTTCCTCTATATGCCMCTMTTCCAAATTTTTACTTTTACT 

GCAAAAAMCCTTTTTGGCATCAMACTCCATTGTTTCTCTGCACTCTGACACCATCATT 

TCAMGGGGCTCACATAAATGATCACTACTGCTCTCTCCCTAATTTTTGAAAAAGGAGTT 

TTGAGMTA AMCA GTGCTTTrATTATTAGCCAACACAAAGTGTGAGAAAATCATTCCTG 

AGAATTMCATTTTMGCTMCAGMATTCAGTATACTTAAAACATAATTATATTTAATG 

AGTCATTATTTGGATCTAAAACGGACGCGTGGGTCGAAGACCTCGGCCGCTCTAGAA 

Sequence 1396 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCGCCCTGGGCA 

GTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAA MCTCAA AMCCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAMTTTTTACAGTATTCTTGAAATGAACCCAATTAAACACATAGT 

Sequence 1397 

AGGTACTTTAATCCCGTCTTACAGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTT 

CCCGTAAGTAAACAGTAGAGCCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATG 

CTAT AAMGAGGAGCATTAGAAATATTTGAAAGATAGAAATGAGAACTAGTCAATATTTA 

TTTTGCTTAGCACTGTATTCAGTATTATGGCATCTTAAAGTAGTTMGACTCMTATTTC 

ATCAAAAAAGTTTAAAATCTAATCAGAGAAT 

Sequence 1398 

AGGTTTGAGTCGACCCACGCGTCCGGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGG 

TGCATGGCCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGA 

CTCTGGCATGCTAACTAGTTACGCGGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGC 

AAGATTCCTGATCGGTATACAGTGATGTATTTACTAAACAGAGACCTGTGCAGAAATTAC 

ATAC TATC CATCTAGATAGGTTGTTACACTTTTGCCTATTGATGGMTAGTTCCATTTAT 

C AAGTTTTATAC ATC AA AA AG CTTTTG AAGTTC ACC AG ACTGTCCAT 

Sequence 1399 

CCGGGCAGGTACTGTAAATCTACTGTAATCCTGTTTTGCAGAATACTGCACGACGGAGAT ' 

TGAGAAGTGAGAGCTCTTATGACATAGATAACATTGTGATTCCCATGTCATTAGTAGCCC 

CAGCTAAATTTAGAGAAACTCCAATATAAGGGAAATACTTACTCCCAGGGTATGGTATAC 
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TTTACCATCTTCATA NTTTTT TCTTTCCCTTCCCTTCCCTTAAAAAMCTNAA N I II 1 i C 
NMGGTGGAAGMNTTTTTMTTNMNTGGAAAGGGANGCTTCCCTTCTTCCCCAGTTCC 
CTTCTTAGCCNATGGGAGGGGGAAACCGGG 
Sequence 1400 

CCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAMCTTTTTTGATGAAATATTGAGTCT 

TAACTACTTTAAGATGCCATAATACTGAATACAGTGCTAAGCAAAATAAATA7TGACTAG 

TTCTCATTTCTATCTTTCAMTATTTCTAATGCTCCTCTTTTATAGCATGGGCTCAGGTA 

TCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTTACGGGAAATACTTTTATGT 

TGCTAAATCTCAGNTTTCTCTTCTGTAAGACGGGATTAAAGTACCT 

Sequence 1401 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCACCCTGGGCA 

GNGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCX)CTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGNGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAAATNTTTACAGTATTCTTGAAATA 

Sequence 1402 

AGGTACTCCCATTT(XCTGAAACMGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTT 

ACTGGTTATAATTCTTAAAA AGCTT GTTTTCCTAAGATATGAAATGCCTGCCAGTATACA 

MCTGCTGTAACTACTTCCCTTTTTGCTnTAGCGGGGAAAAAATAGCTTAATGACAGCA 

TAGM TCATG TAGTAAATATAATTCATTTTTTGAMGGTTCAGCTATATCCTCTTCCATT " 

TGTTTATT TTAAAT GATCTMTTGCAAACATGTCATCACTCCCTTGATGTTTACCTNCTT . 

GTTATGCATTTTTAGCAGGCTTTATTGTCACC 

Sequence 1403 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 

ACTGAGACTCAGMTGTTTCAATTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 

TGGAAACTGAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I I 1 I I M 1 1 I I 1 1 I N 

CMTATTATTTATCA MATAA ATTTATTAAAAGTATTCAAAGACCACTTCAAAGNGTAGC 

TGCCTTCAAGACAGATTTTTGGCACTCATMCGGACACTGCAGTTTTCAACACCATAGCA 

CTCATTCTATTTCACACATCATTTTTAACAA 

Sequence 1404 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGAC TGGGGATAGGACCACTGCTGACAGGGCCCCACCTTCAACTTCTTTCATTGCTCCTC 

TTTTCAGGAAATCCCCACCCTGGGGATACTTCAAAAGACCT 

Sequence 1405 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAAAG 

GGCCAGACACAGTGGCACACGGCTGTAACC 

Sequence 1406 

ATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCCCTAMTATTTAAC 
TGTTACTTGTAMCTTGTGTAATTTATTATTTATTTTMTCAAAATTCTGAATATTTC 
TTAAATG AAAGTTGCAAAAAAAAAAAAAAAAAAAAGTGCGGCCCGGCCGCCCGGGCAGGT 
ACATACCTCCTTGCACAMTGGAGGGGAATTCATTTTCATCACTGGGAGTGTCCTTAGTG 
TATAAAMCCATGCTGGTATATGGCTTCAAGTTGTAAAAATGAAAGTGACTTTAAAAGAA 

MTAGGGGATGGTCCAGGATCTCCACTGATMGACTGTTTTTTMGTAACTTTAAGGAC 
Sequence 1407 
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AG GTACTCA ATCTMTCCAAMTmcmCTTAGCMTCTATTTTCTGTAmAGA^ 

ATGTTTTTTATTTCAMGAGCCTCTCAMGAGCATITCACGTATCTTTTACTGTT^ 

CTCCACCTCCAAGGGGTCTGTCTAGATCAGTGCGGACGCGTGGGTCGAAGACCTGCCCGG 

GCGGCCGGCCGCCCGGGCAGGTACTGAGGACAAATCAGTTCTCTGTGACCAGACATGAGA 

AGGTTGCCAATGGGCTGTTGGGCGACCAAGGCCTTCCCGGAGTCTTCGTCCTCTATGAGC 

TCTCGCCCATGATGGTGAAGCTGACGGAGAAGCACAGGTCCTTCACCCACTT 

Sequence 1408 

AGGTACATATCACACATTTCCAAATTTGAGACCACTMTGTTTTTTMTTTCAAATATGT 

ATATAMTATGTATTC TTATTTC CMTTATTTCCTTGGCATGMTTCCTAGAAATTGATC 

TATTTAGTATMGTGCTTTTTTAGCTATATGTCCACTAGTATGGTATGAGAATGCCCTGT 

TTATGCCAGTATTATCATCATTGAATATATTACTGCTGATGTTGTGGTAATACATTTAAA 

CCMTGTGATGGGGCAAAMAATTATATTTTTACTTACATCTTTAAAATTACTGGNGATC 

TCTGNTATTGACAAGCTGGGCATANAAAAAGTAAATTAATAGAATT 

Sequence 1409 

CCGGGCAGGTCGGACGCGTGGGTCGAAGCTTGTACAAAAACCCAAGTATCACCTGAATTA 

CMTTATTCTTAAAATTTGTCCTTAMTAGC7TACTCTTGGAAGATTTGTTTCTATGTAG 

ACATTATGGTAAAAGTT ACTC TGAMCTCTTTTCTTTAGTTATCTGTTTATTCTGAGCTC 

AACAAGATTGMGTAAGTTTTCGGGAGCTACAGAAATTAAATCAAGAAAAGAATAATAGA 

GGATT ATATTCAATTGAAGTGCTGGAGCTCTTCTGATATTATCAATTCTCCTTCATAGAC 

ATTTTATAAAGCTCTTTTATGTGAACTCTTGCTTCATCCAGGCAAG 

Sequence 1410 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTAAGTATACTGMTTTCTGTTAGCTTAAMTGTTMTTCTCAGGAATGATTTTCTCA 

CACTTTGTGTTGGCTMTAATAAMGCACTGTTTTATTCTCAAMCTCCTTTTTCAAAAA 

TTAGGGAGAGAGCAGTAGT GATC ATTTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAG 

NGCAGAGAANCAATGGGAGTTTTGATGCCAAAAAGG I I I I I I I I GCAGTNAAAGTAAAAA 

TTTGGMTTAGTTGGCATTATAGAGGAACCCTTTTTTGTACCTGGCCCGGGCGGCC 

Sequence 1411 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAMGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAA 

Sequence 1412 

CCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGAAAAGTTTGCCAACCTC 

TGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGCACAAGACTATGTAGTC 

AGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGAAAGGCGGAGAAGAAAT 

GGATTTGGATAGGTAGTCMCAATGTCTGTTTTATGTTACCACACATTTTCTCGAGAAAT 

TTCMTCAGCTCTCTGAGMCAGATTCATCTTTAAATGAATGTTCATAGGTAACAGCAAC 

TCATGCATCMTGTTGCAMGTGAGCTCATTTTCACATTGCTTCAG 

Sequence 1413 

AGGTCAAGCTTCGNTCCACGCGTCCGGGAAAAACGGGGTTACTAGTAGCCGCCCATAGCC 
TGCAACCTTTGCACTCCACTGTGCAATGCTGGCCCTGCACGCTGGGGGCTGTTNGCCCCT 
GGCCCCCCTTTGGTTCCTGGCCCCTTAAANAACAGGCNGGTTTTATTAAACCCCAANNNN 
CCCGGNTTANAAGGGGAATTNAAAAAAGGGCCCCGGCTTTNGNAAAAAAAAAA 
Sequence 1414 

NCNGNCCA GGTC TACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGNGTAGA 

ACANGGAGTTTTGATCTGTGTTCAACATGATTGCGAACCATCAATTGAGATAACTCACTA 

CCTTCAGGCCAGCC AGNTA CATACTTTTGAAAAGCCAAGAGTGAAGCANGGTTGATNTTC 

ATCCMTTCTTGNNCTTTTTGTTAAAGGCANNAATAAGANAGGGTGGNTNCGGGCAATCA 

CTTAGCTAA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

232/467 



Sequence 1415 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTMGTATACTGAATTTCTGTTAGC7TAAMTG7TMTTCTCAGGMTGATTTTCTCA 

CACTTTGTGTTGGCTAATMTAAMGCACTGTTTTATTCTCAAAACTCCTTm 

TTAGGGAGAGAGCAGTAGTGATCATTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAGT 

GCAGAGAMCAATGGAGTTTTGATGCCAAAAAGGTTTTTTTGCAGTAAAAGTAAAMTT^ 

GGAATTAGTTGGCATATAGAGGAACCCTTTTGTACCTGCCCGGGCGG 

Sequence 1416 

AGGTGTACMGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 

CAGTGCCTGCCACATACTAAGTGTTGGATMGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAGTGCNGGCCGGCCGCCCGGGCAGGTCAGATGATTGCAGAATTTATGTGATTACTGGGT 

ACTCTMTGGTAAGGAGAMTTAAGACCAGCTAGTTGTTAATCTTAACTTTTAGTCATTA 

AGGAGAATTTCCAAGACAAAACTGCAATCCAGCTGCTTACCTAGGAATACGGCTTAGGCT 

GAAAACTTCTATCGTCTTAGAAATGGGA 

Sequence 1417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCX5GAGGTACAGTAATCCTGTGAGAAAG 

ACAGGACAGAAACCACTGTGCCTATTTTACAGATACGAAAACTGAGGCACAGGTAAAGGG 

GCTTGTCTGTAGTCCCATAGCTAGCAGATGGCTGGAGCCAAGACTGAGGCTCGTTCTTCA 

ATGCTGAGCCAGGGCTCCTTCCGCTGCACCACAAGAACGCTAGACCACTCGCCACCAGCC 

TTCTCATTCCCTCTTCCTCCATTCTMTCATTTCTAGCTGGCTGGCCTCCACAGAGCATA 

GGAAAACAGCCAGGGCCGGGCACGGTGGCTCATGCCTGTAATCTCAACACTCTGGGAGGC 

CGAGCCGGGTGGATCACCTGAGGTCAGGAATTCGAGACCAGCCTGGCCAACATGTTAAAA 

CCCCATCTCTACTAAAAATATAAAAATTAGCCAGGCATGGTGGCGCACACCTGTAATCCC 

AGCTACTCAAGAGGCTGAGGCAGGAGAATTGCTTAAATCTGGGAGGCGGAAGTT 

Sequence 1418 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGNCCGAGGNNCAAAAGAGAGACAAAAGG 

GTTCTCTTGGAAACAAGAAGAGTGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGT 

TAATGCAAAAGATCAGAACAGGGCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGT 

TTTCACTGATGATCACCTTTCACAGCATTTTCCCCMCCAGCATTTCACTTAGTCTTCTC 

TATACCCAGCACCTCCCCCGGCACCCCCGGCAAGCCCACTATCACTTCCGACTTCCAACG 

TGGCATCCGTGAGATCTGTCCACATTAGGCGAAGCAGGAGAACACTGAGAGCAGCAGGAT 

GGGTTTGGAAAGAGCATGCCTCTGGAAACACAGCTTCCTGGGAATTCACATGAGGCCAGT 

CCT ACAG AGAGCAAGATGCACCCCAGGATTTCTTCATTTTCTAATAGATGTGGGAGTGCT 

CCATTTTCCCCGACAGCGMTTTCCCCTGAGAAACGATACTAGACCCTGGGTTTGCCCAC 

CTTGTMCTCTTCCTTATCTNCTCCTTTTCATCCCTAATTCA 

Sequence 1419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 

CACCACACTCTACAGAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1420 

CCGCGGTGGCGGCCGAGGTACACTGTAMTAGCCTTTACCAAACGTGTTTGACAAGGACC 

ATMTTMCATCACTTAGTGMTTGTGATAMGAAMAAAAGCCATGATTTATTCGATGT 

GATTGGCTTGTTTTTATGTGGCGCCAAGAACGAACCTGTTTAGCAGCTGTAACCAATGGT 

ACGCGGGGGAGGCGAACAATGGCGGAGCTGGGCGAAGCCGATGAAGCGGAGTTGCAGCGC 

CTGGTGGCCGCCGAGCAGCAGAAGGCGCAGTTTACTGCACAGGTGCATCACTTCATGGAG 

7TATGTTGGGATAAATGTGTGGAGAAGCCAGGGAATCGCCTAGACTCTCGCACTGAAAAT 
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TGTCTCTCCAGCTGTGTAGACCGCTTCATTGACACCACTCTTGCCATCACCAGTCGGTTT 

GCCCAGATTGTACCTGCCCCGGCCGCTCTAGAACTA 

Sequence 1421 

CCCCGCGGTGGCGGCCGAGGTACTTTGGGAGACCACCCCCAGCTATGGTTCCATACACTT 

ANACTGCGCCCAGCTACAGNTTNATACACTTNGGACAAANTATCTGATAAAATAGAGAAA 

AAAATCTTATTTACTATAGCATTACATAATAAATTTNTGAGAAAAAAATTAACCAGGGAT 

GTAAAAAACCTTTACAATAAAAAATAAATAAAAAAGGAAGATCCAAATAAATTTTAAAAT 

ATTTTATGTCTTTGGATTGAAAGAATAAATATTAATAAAGTGCCATATTATCCAAAGTGA 

TCTATAGATTCAATACACTTCCTATCAAMTTGCAGTATTTTTTTCACAGTAATGGAAAT 

TCMTTCTAAAATTTACATGAAACTAAAATAAACTTTGAATAGCCAAAACAGTCTTGAGG 

AAAAGGMCAAGGCAGAAGAATATCATACTTACMTTTCAATCTATATTTGAAGACTTTA 

TAGAANTAAM 

Sequence 1422 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATCCAGCTAATAGCAAAC 

TTTGTGGATCCCAATCCGAGATTTCCCTTGCTGAAAGACAAGAAAGTATCTCATATAAAA 

GTGCTGTAGCAAGTATTTGTATACTCCAGAAATAAGCTTCTGTAATTCTTAGCTGCCAAT 

GTGTTCAGGCGTGATGACTCGGTTTCTGTTTCTCTGAACATCAATACTAGGGTCTGTATA 

ATTTCAATGCATGCCACCAGCTTCATCAACCCTT 

Sequence 1423 

AGGTACAATCAGAATGCTGCATTCTCCAGCCATAAAGATCGCTCCCTCTTCTTTTCAAAC 

ATCCCTGTCCCTCAAGGTCTAGCTCAAGACGGTCACCTTAAGAAAAGCTCCCTTTGTCGA 

GCAGTGACTCCATACCAGGCCCTGCT7TAAACGCTTTATCTGCATTATCTTACTTGATTC 

TCGCAATAGCGCTGGGTGGTAGGTGCAATTATTATCTCCAGTTTATAAAAGAAGATACTG 

AGGGTCAGAGAAGTTAAGTGACCGGCTCAAGGTGTCACATTCAGTAAGCGTTGAAGGGGC 

CTGTGTTGGTCTGTCCTTGAAGATGCCCCCTACCGACTACACTTTCAATGATTTTCTGCC 

TTGAACCTGGCCCCATGACTAAA 

Sequence 1424 

NNCAAACCTCCTATGCTTTCCTTGGCATCGGCTACACATCATAGTATTCATTGCCTCCTT 

GAGGTCATCTTGCAGCTTGGACAGAAACTCATTTACTGACCGGCTCAGCTCATTCTCTGC 

CATTCGTTTCATCTCATACTCCTTTCGCTTTTCAGCATTGCTGACAATGTCCCAAGCTGC 

TCGCAAAACCTTGAAGGCCTCCTCAGCCCGGGGATGATGATTTTTGTCAGGATGAACCAT 

CACTGCCAGCTGTCTATAG 

Sequence 1425 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGAACTGATTTCCCC 

AAATGGCTCTTTGTG CAGCC GCAGCAGCTGTTCAGACTTATGAGCAGACAGGAACTGCCA 

GAGGGCAATGGTGTGTTTTAACTGGCATCTGTTTAAGGCCTTTAACACGTGAATCGTCTG 

ATCACCCATTTGCAGGATGTCTTGAGTATACACATTCAGCTGCATGTTTGGATCCCCACC 

AGCTGTGCTCAGAAACCCCAGAGTGACTTCTACGACAGACAGCACTTCACAGGCATCGCT 

GTAGGACTGCAGCTGTCCACTGATGGCACTAATGACCGAGCTGGGGAGGGAGTCCTGGGA 

AATGAAAAGCAGGAGAGGGATGTCTGTGGGCTGGGTTTCTGGCATCTCACCACCTGGTAA 

GAGAGCCGAGCCCCTTCACTGCCCAAGCCACATGCG 

Sequence 1426 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGGCCAAGGCTGCAAT 

CAGTGATTCAGCCGACTGCTCTTTGAGTCCAGATGTTGATCCAGTTCTTGCTTTTCAACG 

AGAAGGATTTGGACGTCAGAGTATGTCAGAAAAACGCACAAAGCAATTTTCAGATGCCAG 

TCAATTGGATTTCGTTAAAACACGAAAATCAAAAAGCATGGATTTAGGTATAGCTGACGA 

GACTAAACTCAATACAGTGGATGACCAGAAAGCAGGTTCTCCCAGCAGAGATGTGGGTCC 

TTCCCTGGGTCTGAAGAAGTCAAGCTCGTTGGAGAGTCTGCAGACCGCAGTTGCCGAGGT 

GACTTTGAATGGGGATATTCCTTTCCATCGTCCA 

Sequence 1427 

AATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGAC 
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AAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTC 

ACCMCTTGACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTG 

GACCCCAGCCTGGTGGAGCMGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGG 

CTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTAT 

CAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTA 

CCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

ATTGAGGATGCGCTCMCCMCTCTTCCGAAACAGCAGCATCMGAGTTATTTTTCTGAC 

TGTCAAGTTTCAACATTCAGGGTCTGTCCCCAACAGGCACCAC 

Sequence 1428 

AGGTACAAATAGATACCTTCAAGGAGAATGAAAACGGGGAATATACTGAGCACTTACACT 

CGGCCAGCTGCCAGATCAAAGTTTTCAAGCCCAAAGGTGCAGACAGAAAGCAAAAAACGG 

ATAGGGAAAAAATGGAGAAACGAACACCTCATGAAAAGGAGAAATATCAGCCTTCCTATG 

AGACAACCATACTCACAGAGGTAAAAAGATTTCTTTTGGTGACMTTTCAGTTCATAATT 

TTTMTCTTAAAAATTCATCACTTCCAAACTGGTCAGMTTTACTTCTCCTAAGCCTTGA 

GGGACACAGTATCACATGGATTCTGTGTCCAGCGGCCTTMCAGGAAGATTGCTTTAGAA 

TTTGGCACGAACCATGCCACTGTCTCTGT 

Sequence 1429 

NCNGNCCAGGTACTCNNNNACANTGNAAACTNNTCANGNGCCCATCATTGCTGGATTTGT 

ATTTAACATTATGTTTCACCCAGACAACAGCTCAGAGAACTGGGCAATGGCTGCTNATGT 

GTTGAGCCGGGGCATACAGGATGAAGAGGGACAATGAGAGGGAATGAATTCTATTCTANA 

CACCCTGAGTTTGAGGAACCTATGGAAATGTCCAGGAGGCAACTAAATGAAACAGCCTGT 

GGTAGACAGAATAATGGCCCCAAAGATGTCTACAGCCTAATCCCAGGAGCCTGTGAAAAT 

GTTCCCTTCGCATGGTAAAGGGATGTGGCAGATATGATTAAGCTAAGGATCTTGAGATGG 

AGAGTTTATCCAGGATTATCCAGGTGTGCCCAGTATAAT 

Sequence 1430 

AGGTACGCGGGACACAGGGTCCTGTGCAACANGGNGGACTAACAGTAACACCGCCACGCC 
GGCAGCAMGCTCATTTTGGTCCCCGCCCCGTTCCTCTTTCT C I I I I I AACTCCTTCCCT 
CTTTGCGGATTCTAGMCGGAACCTTTTTTTMTTCTTCCCAGTAGAAACGTAGGAACAA 
TTTCGTGAACGCAATCNGGAGTGCCCAACATGGC 
Sequence 1431 

AGGTACCCCTGTT TAAC AAGGGGTAGGGGCCTTCTGAGACTGTTTCCTCTACAGAGTAAG 

GGTTCGTTCAGCCTTTTCCGTGGCCTGCCAAGAACTCAACTCCATGTTCCCTCACTTCCT 

GTMTTGACCTTGTCCAGGACTTrCTGACCTTGGAGAATTCACCTTTGCTCTTTCTGCTG 

CTTCGTGCATTCTTCCACCAAATGTCTTAACTGACTGGGCTCCTTTCCAAACTCAAGGGC 

TTTGCCAAATGCCACCAGCTCAGGGAGGCCTTTNCTGGCCATGACACTTGAAGTTGCAAC 

ACTCCCCCGCAGTCTCCCGTGCCCCAGATGTAAGTTCCATGAGGGCAAGCCCTGTGCTTT 

TACCACCATATCCCCAGCATCTTGAGCTGTGCCTGGCCCAAGAAATATTTGTTGAATGAA 

TGMTTTAAAAGGGGATATTCATGANGGCTTACACATTCTCAATGGGTT 

Sequence 1432 

G GGCNAATTGGAGCT CC CCGCG GTGGCGGCCGCCCGGCAGGTACl I 1 I 1 I 1 1 I ICI I I I I 
CI I I I I I I 1 1 I I 1 I I I IMGTTTTTGGGATCGTGTCTCACTCCTGTTGCGCATGCCTGTA 
GTCCCAGCTACTCAGGAGGCTGAGGCAGGATAATTGCTTGAACCCGGGAGGTGGAGGTTG 
CAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTTTGTCTT 
CGGAAAAAAAAAAAAAAAAGATTTGGCGGATGAAAATAACCAGAATGAAAATAGCTNGAA 
AACTCANCAAGCAGGMGCTCCCCTTCTCACCCTTTTGTTCCCTTGCCGATAGAATCAGT 
CACTATTAGAAAAAATGAAAGACGCTCTGTTTAAAACAATGATGACAGCAGTACCT 
Sequence 1433 

GCGG TGGC GGCCGAGGTACTTCCCTTTTAAGAGATGAGTCACCGCAACTGAAACTTCTCT 
ATTTCTTTTCTTTTCTGATTGTTCTCCAGAATTAGGACTAGTAACAGTCCTGAANNCTTG 
T^flTTCCTTATCTAGAAMCTCAGTATCTTCCCTTCCGTTTGTCTTAAAATATTAGTACA 
CGCTTTCTCAAGCCTAGCCGATTAGAAGGGGCTGCCGGGCTTCCACCACACCTCATCGAG 
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GNAATGGTTTTNTGGNNAAAAAGCCCATGGAAATACTGAGCCCATGCCNCTCACGTTGNA 

AAAGCCCCGTTCCTTGCC 

Sequence 1434 

AGCTCCCCGCGGTGGCGTMCTTATCTCATTTTAGATNAGTTTGCAAAGAGAGTTGGTGG 

CTMGGCCATAGCTTAGCCTCCTGACCCCTACCTTCCCACGTTCTTTCCAAGAGATTCTC 

CTCAGGAATMCACTTGCAAGGGAGTTCCTGATGAAGTGGATTCTTGTTATTCTAGGAAT 

AGGCCTACATGGTGCACTGGCAATGTGAGATTATACCTCAGCATTTTCAAAGAGCATAAA 

MTC TAGAGC TGGGGGGTTTAAMCATGACAAACCTMTTTTAAGTAGGCAGACAAATAT 

TTAAAl I I I iCCCCTACCCTTGTTTCTACATCGGTCCATTCAGACTCTGCACCATCTGGT 

TGGGCAGGTGCTACTGTGGMGATCTTCGTTTTCGACTACCATTTGGTGATTCTTGCTTT 

AMGTCTCMTATCAGTAMCTGAACAGATTNCCACCACCCCTTGTTTTATAAATATCAC 

CCTTMTTAGTTTAAGTTTCAATCTCCCCATCGGAGGCTAGTTCCTGGTGGGTCAGCATG 

TACCTGCCCNGGCNGGCCGTCTANAACTAAGTGGATCCCC 

Sequence 1435 

TACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC^ 

M I M u l l I M I I I l IGGAGCCAAAATTGTGTGCATTCCTACTGGGAAACACAGTGGCC 

MATCCTTTTGAATTGTTTCCTTCTAGAGACTTTMCTCTTCTGACTGCAAATCTTAGTG 

TCCTGTGAGTATTAGTTGATTAATTATACTTGCTGCTTAGTGAAATACAGCCAGCTATAG 

GTATCTTCTGGAGTAGCTCMCACAACTTTTCTCTTGCTAGAGTGACTCTTGCTAACAGA 

ACCCAAAGATGCGCACATATACCCACAGGAGCTGGAGGTCCCTCGCATGCTCCTCTCGTG 

CCAGCCTTTGCCTTACCCTTCACTCTCTCCCTCCAGGAGCCGTCGGTACCTCGG 

Sequence 1436 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATTTGTCTTGCTC 

ACCTCTTACCAAATCCAAGAATGGTTTCTATCCAGTGAATGGCAAATTCATCTGATAGGT 

MGGGA ATAATG GGTCAAAATGGTAGCAAACACTTCTTTCAAA7TTCTACTAAAAGACTT 

GCTGTTGTTTTTCTT ATAMG GGGCMTTTCATCAACATACATCTTTTAAAGGAATCTCT 

AGAAATTTGAGTGACTTTTTGGCCATMTCCTGTTTGATATATTTTTGGTCAGCTGCTCA 

MACAAACATTCTCCTTGTMGGTTATCTATCTGAMGATACTAATTCATTTAAAGCAGC 

TGCAGGTGAACAACCTAAAGATGACATGATTTGGGAGAAGAGGAAGGCAGATTACTGAAC 

TGACAAGTGACCCAAAGCATAATTAGGTTTGTGCACATGGTAGCATGGAGGTTCCACACC 

TACCTTCTACAGCGTATTAMATMGAATATTGTCTTTGAMCATCTTCTAGCACCTTTT 

T AATAAAAC AAA ATTTCCC ATCTTC AATTCT A I I I I I 1 1 CCCAAATCTACCTTTAAAAAA 

TTGT 

Sequence 1437 

CCGCGGTGGCGGCCGAGGTACAATMACAGGGAATGAGMCTATTTACATGGAAGTTTCT 

TTCTCATGATGCGGTGGAGAAGCCTCGGCCACTTGGTTCTGCCAGATGTTCCTGGGGTTA 

CTGTAAATGGGAAGGACAGGCAGAGCTAAACAAGGTAGGAGAATCGCCCCCCTTTTTTGA 

ATGTTTAAAGAGTTTGCTGCAGTATGCTGCATTCCATGTGTGCTGCTTACGGGAGCCAGG 

GA MCTG GGATTCCACTMTTCMTTGTMTACTTGCGGGGGACCCTGGAGTTTTACGTA 

ACATTTTGATTTGGGAAAAAAAAAAAAANAAAAANANTGTTCCTGCCCGGGCG 

Sequence 1438 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGGC 

ATGCTCCTGCCTTGGGCAGAGTGTATATGTGTGAGTGCACCTGCCCTTCCCACAGCCTAG 

MCGTTCCTCCTCCAGACAGCACATATGGCCTGCTCTCTCACTTCCTTAAGGTCTTTATT 

CAMAGTGACTTTCTCAGT GMGCCCT GTCTGCTCACCCTGCGTAAAATTTCAGCTCTTC 

TTTCTATCTCTCTTCCCAG A I I I I I I 1 1 CTCCTTCATGTTCGTTGGTGTCTAAGGTTTAT 

CATC TATTTCGCTAATGGTCAGTAGMTGTMCCTCCACGTMGCMGGAGTTTTGTCTG 

TTTTGTTCATGTCTATGTCCTTAGTGCCTGGAGCATTCCCTAGTATGCAGTAGGTGCTCA 

ATAAATGTCAGTTGGATTAATGGCTGAAAGAAAGGTCACCGCTATAAGGATGGAGTCAGA 

GAACAAACACAGTTMTTCCTGGTCCACTGTTTTTGCTTCCACTAMTTGTATTTGGTCT 

ACGGCTTCTCCGCTTGCCCTGGAACCCTGCTCAGAACACTGCTCCCTTCTCCTTCTTCTT 
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CTCCCTCCGGATAAATTCT 
Sequence 1439 

CCGGGCAGGTACCGCGGGGGGCCGTGGTCAGAGCGAGCTTCGGAGAAGCAGTGGTGGGTT 

CCATGTGATGG TGGA GTAGGAGGCAGGTCTCCGCGGTTCATCTGTGTTGCTCTAAATGAC 

ACTGTTTCATTATTTTGATGGCtGGAGMTATTTCCTMGTGTATGTATATGMGAAGTT 

TCTTGATCTCTTTATCTGTGGATGMCAGCTACTTTGAAACATATGGTACCTCTGTGGTC 

AGACCATTTGCCAAGCTTGTGAGGCCTCCTGTTCAGGTATACGGTATTGAAGGTCGCTAT 

GCCACAGCTCTTTATTCTGCTGCATCAAAACAGAATAAGCTGGAGCAAGTAGAAAAGGAA 

GTTGTTGAGAAGTAGCACAMTCCTGMGGMCCCAMGTGGCTGCTTCTGTTTTGAATC 

CC 

Sequence 1440 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGTGCT 

TTCATAGCCGCGGACACAACTTGGGCCCACAGTTAACCGAGGAGGAAGGCAGAGCGTGCT 

GAGCAGAGCACCMGGAAGAGAGCTCGGCTAGCCGGGAAGGTCCGAATGGATTTATTTGG 

TGAGGCCAAGGAACCCACTGCCTCCACGGTGTTCCTCCAGGAGGCTCCTCCCTTCACTAA 

GGCAGCAAGGAGTGCACTGGTGAGGGGMGMTTGTTGTCATCCTTTCTCTACGACTCCA 

AGAGAAACTTTATACTGGAGGMGMTATTCTACCACTTTGGGATGCTTCCAAAGAAATG 

GGATACCAAAGGAGTGAACTTCAGGTCATTGGAAGTGGCCAAGCTGGAGGCGACCTATGG 

AGACATGACCTTCAGTTCTTCCTGAGATCGATAAAGCAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 1441 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGATATTGTCAATGCGCAACATGGAGAG 

ACTTTAAAACA AATGC TAGGGATTAGAGTATAGATCAGATAGCTGGCAAATCTATAGGAA 

GGGAM AGTAATTTTAAACACACAGCATTTGTTTCCTGCTGCTCTATCACAATAGCTAGG 

TTTTTAAATMGTAGGCT TTATA CCAAGCCATAAAMTGAATTGCTGGGGCTCTTTGGGA 

CTAGGGAAGGCGGGAAATTTTAGATATTGCTGTTGGCTTAGTGAAAATGCATGCTTACCC 

GGTCACCTGTGGCTCCAGCAGGACCAGGGGCACCTACAGCACCAGGAGCACCCTAGTACC 

T 

Sequence 1442 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTTATTACATA 
TGAI I 1 1 1 ATTAGTTTCTGGAGGCAAATTTAATTT™ 

AAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 
CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 
TAACATAGCAGGACCCTGTCCCTTAAATAATAAAAAATTTAA 
Sequence 1443 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGGTGCTCCTGGTGCTGTAGGTGCCCCTGG 

TCCTGCTGGAGCCACAGGTGACCGGGTAAGCATGCATTTTCACTAAGCCAACAGCAATAT 

CTAMATTTCCCGCCTTCCCTAGTCCCAMGAGCCCCAGCAATTCATTTTTATGGCTTGG 

TATAMGCCTACTTATTTAAAAACCTAGCTATTGTGATAGAGCAGCAGGAAACAAATGCT 

GTGTGTrrAAAATTACTTTTCCCTTCCTATAGATTTGCCAGCTATCTGATCTATACTCTA 

ATCCCTAGCATTTGTTTTAMGTCTCTCCATGTTGCGCATTAACAATATCCTAATGCACT 

GAGGCTTCTCAAAGCCTTCAATTATTACCAAAAAAAAAAAAANNTTTNNNAGGTACCT 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACCAAT 

TCCTTCCT MCCT GGATTCCACTGTCCTTGGTGAAAACTACTTTGATGGAACCTACCAGA 

TGCTTTATCTTTTGGTTAAAGGAACTATACCTGTNGAAATTCACACTGCCACAGNGATAT 

TTGTTTCTTTCCAATTATNTGTTGCMCANMGATGACTTTTATACCTCTCACAATCTGG 

NTAAAAATCTTGCCTTGTTCCTAAAGATACCAAGTGACAAAATCCGTATCAGCAAAATAA 

GAGGGAAGAGTCTGAGGAGGAAGAGATCCATGGGATTCATAATTGAAATAGAGATTGGAG 

ACCCTCCTATTCAGTTCATAAGCAATGGCACCACAGGTCAGATGCAGTTATCTGAACTCC 

AGGAANTTGCTGGTTCTCTTGGACAAGCTGTNATTTTAGGAAA 
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Sequence 1445 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAGTCACTCCTGCC 

TTCACCATGMGTCCAGCGGCCTCTTCCCCTTCCTGGTGCTGCTTGCCCTGGGAACTCTG 

GCACCTTGGGCTGTGGAAGGCTCTGGAAAGTGTAAGTTGGAGTCACTCTGGTCTAATCTG 

GGCTGCAGGGTCAGAGGTGGGGTCTCCTTGTGGTGTGGGTGTGTCCCCTTCTGTAGGCTC 

TGATCCCTCAGCTTAGTTTCGGGAGACCTCCCTGAGGGTGGAATACATGTCTGGCTGGGC 

TCCMGGTTTGTGTGGCAGTTTGAGCTTCTGGAAATGCTTCCTCTATGCAGCCATGCTGT 

CAGCCCAGGTCCCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATACTNTGCCT 

TCT7TCTTTCACCTGTCTGCG ACTCTTCAAAAAAAAAAAAAAAAAAA 

Sequence 1446 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTAATCCAAGACAA 

MGM TAGCTCCTTTCAGAGT TCTCA TCATTTCTTCACCCCAGCCTTAATGGGTATATTC 

CTTTTCCCCAGGTGTCCAGAGTTTTCCAGAATTGTCCCAAGGCTCAGTCCACACCAGTTC 

TTCTCCAGTGTGCTCTCCTGAGAGGCCAGGCACACTCAACAATTATCTAGATGAGTTCCC 

ACTTCTTTCCAAGTGAAGGTTCTGCTATCCTAGCATAGTCAGAATAAGTTAAATTATGTC 

TCTTMGAGGCACTGTTCCACTCCTTTTCAAGGNGTGTGGCATATTTGAAATATGTGACT 

TAAAAGT CTAC AGTCTCTTACCAAAAGCCTTGGGCCAAATACATGTCAAAATTCAGAATT 

TTNCAGATTTTAGAAAAGTGACCCATATACCATACATTGCA 

Sequence 1447 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCMCTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCAT 

TCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCAT 

CAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCA 

CCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACA 

AAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

TTTCTGAC 

Sequence 1448 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTGTCGTTTTTATTCCT 

CTTATTGGGATATCATTTTAAAAACTTTATTGGGTTTTTATTGTTGTTGT^ 

ACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTC 

CAGC AAGCA M GTGC TTTGACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAAC 

TGTTCTTTTGCATTTTGCCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATT 

TATAGCAAGATATTGATCAAATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAA 

GTATMTCTTTC TATAG CCMCTGCTMTGCAMTTAAMCATATTTCATTTTAACATGA 

TTTCAAAATCAGTTTTTCATACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGC 

AGAACCACA MCMTTCG MTGGGGTAGAMCATTGTAMTATTTACTCTTTGCAAACCC 

TGGGNGGTATTTTATTTTGGCTTCATTTCMTC 

ACCTGCCCGGCCG 

Sequence 1449 

CCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTC 

ACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAAC 

TTGACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCMTTTGGACCCC 

AGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATAT 

TCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAG 

GATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1450 

CCGCGGTGGCGGCCGAGGTACAMTTGNCGTTTTTATTCCTCTTATTGGGATATCATTTT 
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AAAAACTTTATTGGGTTTTTATTGTTGNTGN^fTGATCCCTAACCCTACAAAGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAMCCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGMCAAMCTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCMGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

AACTGCTAATGCAAATTAAMCATATTTCATTTTMCATGATTTCAAMTCAGTTTTTCA 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAMCATTGTAMTATTTACTCTTTGCAMCCCTGGNGGTATTTTATTTTGG 

CTTCATTTCAATCATTGNAGTATATTCTTAT 

Sequence 1451 

CCCCGCGGTGGCGGCCGNGGNACMATTGTCG^NNTATTCCTCTTATTGGGATATCATN 

TTAAAMCTTTATTGGGTTNTTATTGTTGNTGTGGGNTCCCTAACCCTACAAAGAGCCTT 

CCTATTCCCCTCGCTGNTGGAGCAMCCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTNCTTGCTTCAGTCATCAGCCAGCMGAGGGMCAAAACTGGTCTTTNNCAT7TTG 

CCNCTGNGATATGNCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAN 

CAAATATNGAAAGTTGNTATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTMTGCAMTTAAAACATATTTCATT^MCATGATTTCAAAATCAGATTTT 

CATACTACCCTTTGCTGGAAGAAACTAAAAATAT 

Sequence 1452 

CCCCGCGGTGGCGGC CGAGG TACMATTGTCGTTTTTATTCCTCTTATTGGGATATCATT 

TTAAAMCTTTATTGGGTTTITATTGTTGNTGTNGGGNCCNTMCCCTACAMGAGCCTT 

CCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTTCTTGCTTCANTCATCAGCCAGCAAGAGGGMCAAAACTGTTCTTTTGCATTTTG 

CCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAT 

CAAATATAGAAAGTTGATATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTAATGCAAATTAAMCATATTTCATTTTMCATGATTTCAAAATCAGTTTTT 

CATACTACCCTTTGCTGGAAGAAA 

Sequence 1453 

GAANCCCCCTTTNGACGAANANNCGCGAATCGNGAGCTCCACCGNGGNGGCGGGCCCGAG 

GGGGACNANGANTTTTCTTGNNCN I I I I I I I I NNAAAAACNGGNGACTATTTAATCCATC 

TAAAAATACAAATCAGGNAANGGGGGGAACCATAGGAAAATCCTCCACCTNTAACAAGAG 

CCGAAGNTACNGGGCTTTCTGCTTGCTCCAAANAMTCCCAAAGGGCTTGGATAGTTTGN 

GGAANGGGGAATTATCTGTGTCTTCAAACTAACTCCCAAGGATACCTCAAAGGACATTAA 

AGGTNTACCACCACCATTCTGGGGGMGAAAAAGGGGGGTTTCTTGCCCCTTGCTTGAAA 

AGCCTTANAAATNGGGGGAAGCCTCCAAATNGCCNTTNGGGGGNGNAAAAAGGGGNNCCC 

TNTN AATTTTTTN AAAA AAAAA AAAATTG G GTTCCAAAA AAN AACCCCCCTTTN G AAAG G 

GANCAAANGGGGGGGGGGGCCCCCTNCCCCNAAGGCNAAGGGGGGGGGCCCNNTTNGGGG 

GGGCCNCNTGGGGGGNNGGNNCCCCAAAAACCCCCCTAAACAAAANNTNGGGGGCTGGGC 

CCNTAANGAGGA GGGCCCCN CCAGAGCCCAAAANTTNGGGGTTTNNACCCCANNNAAAAT 

TTTGGGGGGGTN 1 1 I I I I 1 1 I ACCCCCNTTTNNGNNCCCCNCCCCCCTTGGGGGGGAAAA 

AAAACCCTTNAAAACCCCNTTNTTTNAAAACCCCCCCGGGGGGGGG 

Sequence 1454 

CTACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTTN 1 1 I I I I I I 

nTTTTTTTl I ATACCAGATTTTAAGGAAAGACTGCTTGCTTCTGANAAAGAATTCTCGG 

AGTTGATTCTCTGCTCCATTTGCTCCTTTCTCMCCTCTTTAGTCCTCGTTCTCTGCAGT 

MGAAGCTAGTGACMCTGGMTTTAGCTCCCAGNGGGCTTTCTTCCGGGTGGNGCCTGG 

ACAGGCTGCTC CTGCT GCTAAGGCTTCTGGAGCTGTTATTGAAGATGTCAGCTTCTGCCA 

CTGNGGTCTGCTTT7TGGAGTCTGCATTGGTTTTGCCTCGATCTCTATCATTCTTCTCAT 

TATTTTCATGAATGAA 

Sequence 1455 

CGAGGTACTGACCTCGTNTGTCCCTTCCCCTNCACCGNTCCCCACAGCTTTGCACCCCTT 



m 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

239/467 



TCCTCCCCATACANACACNNNCCATTTTATTNTTTGGGCCATTACCCCATACCCCTTATT 

GCTGCCAAAACCACATTGGGGGCTTGGGGGGGGGCCAAGGGGCCTTGGGCATGGGACCAA 

GGACC ACCTCCCC CTACCCATAT CCCTCCCCG TGTTGTGGGGTTTNGGGGAAAAAAACCT 

TTTTGGI I I 1 1 1 1 IGGGGGGGGTCI I I I 1 1 1 1 1 CCTCGGAAATTAAANAAAAAAAGGATT 

NCTTACCTACAAGAGAAANAAAGAAAAAAAGGGGTACCCTTGGCCCCGGGGGCXX3GGGCC 

CGGCTTTCTTAAGGAACCTAAGGTGGGGAATTCCCCCCCCGGGGGCCTTGCCAAGGGAAA 

TTTCCGNATTATTCNAAAGGCCTTTAATCCGGAATNACCX3CGGTNCGNACCCTTCGGAGG 

GGGGGGGGGGCCCCCGGGGTMCCCCCAAGCTTTTTTGGTTTCCCC 

Sequence 1456 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATAAGACCC 

TGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAG 

AAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGA 

ATTTCACCATCACCAACCTACCATATTCCCGGGACAAAGCCCAGCCAGGCACCACCAATT 

ACCAGAGGA ACAAAA GGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCA 

TCMGAGTTATTTTTCTGACTGTCAAGTTTCAACATTCAGGTCTGTCCCCAACAGGCACC 

ACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTG 

CCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGGTACCTGCCCGGGCCGGCCGCTT 

CGGCTTTAGAACTAGTN 

Sequence 1457 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCCAAATTTTAAGGAAAATGA 

GTCCCGCAATGAGTTTCCTCATGCTTCGCCTGTGCGTGGACCGGNCAGCTTCTGGGTGTG 

ACTGGAGCAGGGCTTGTCTCCTTCTTCAGAATCACTTTGCAGGGGTTGGCAAAGCCGCTC 

CCATCCACGTACTCTCTGGACACAATAATTTTGGCCTATTGCCATCAAATGCCCATTTTC 

CACTGCTGGAAGCAATGTCAAAAAAGGGCTGGCCCAAAAAAAGACCCAGAGCTGTCAATA 

CAACACTGGAGACAGATGCAACTGAATAAACCCTGTTTTACCCAATTGCACTATTTGGTC 
CT 

Sequence 1458 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTGGAACAGGGATAA 

GTTCTTGGATAAGGNGCCMCATACCTATAAMGCTGATTTTTGAGTAAATTATTGATTC 

TMCATATGTAATGGATTTGGTGTGATMTTTTCTGATCT7TMCTATAAGTGACTTTTT 

ATTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAAT 

GGAGAAACGGAAAAACTAT CTCTGT TAAAAACTGATTCCTGTCATTCTTCTGATATCAAA 

TAAGAGGAAGGAAAATAAACTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTG 

CAGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAG 

GTCAGGAGTTCGAGACCAGCCTGGCCAACATGGNGAAATCACGTCTCTACTAAAAATACA 

A 

Sequence 1459 

NGGGCGAATTG GAGCT CCACCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCACTC 

AGGGAGCTCAGATTTTGAGACAGTAGCTGGCCGATGCTCCCAGCTGAATAAAGCCCTTCC 

TTCTACA AAAAA AGAAAAAGAAAAAAGAAAACAGGATATCTGAAATTAAGACTGCAGGAT 

GGAGTAGTTTTCTGAAAATGACAGGGTCCAAGGTGTGACNCACCGGGACCAAGTGGCTGA 

ACTGGAATGAAGTTAAGAAGCCAGTAAGAAAACATCNCGGATAATATGGTGGATCAGTTC 

AACAGNAATGACATTATTTCACCCATGGTCCCCAAAGGGAGGGAGATGACTGGAGNATTT 

CAAATTCTTCAAGGCAAGCCCTCAATGCCAGCCCAGAGGATTTTAANGAGGGGCCTATTG 

TTGTTCCCAGAAAGGAGGACTCTGNGGCCAAAACCGCCAAGAATGGGATTTCAAGAATTT 

ACTTCAA AATT CCTGTCAGGATTTCTTTAACCCTGGGTGGGGCTTATACCCAAAACCCCA 

AAAAAACTTTTMGCCAGCCTTCNTACTTTTTGGCTTAATTTTTCTCCCTAAAGCCCCAA 

CCCTTGGGCN I 1 1 1 1 TT 

Sequence 1460 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGGCAGGTACTCAGAAATCCTTCCCCGAATT 
TACAGCACAGGCAGGATGACCTAAGAGGCAGTTTACTTCCCTGAGACCCACAGTTGGGCT 
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GTTCTGGAAACACATCTGTGAATCATAGCCAATTGCCACAGAGAAAACAGAACCAAGCCT 

CCGGTGAGGCCACTCCACCCCAGAGAAGTCTGCAGAATTCCAAGGACTCGGATTGGATGT 

TCAGAATTCAGCAACTGGAAAGTCCTTAAAAACAAACAGGCCAAACCAAATCAATATTGC 

TGTTTCTAGATGTCCCTTCTGTGGTTGAGCTAGTTTTACAGAGATAAATATATTAAGACA 

AGGAGGTGGGGGTGTTA TATGATCMTGATAGCCATTTTGAAAGAGAGGGAGGAGTACTT 

M I 1 1 1 1 I I I I I I I I I I I I CCAAGCACGTGCCACTTTATTGAATGACACTGTAGACAGGT 

GTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGG 

Sequence 1461 

AGGTACGCGGGGCTCAAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTC 

CCCTGCACCGAGGGMCMCTCGAGCCTTTGAGGGCTTCCTCCAANCCTACCCACTTTGG 

CCCAGTCANACCAAAAAAGGGGAAAGNAGGCCTGGCTTCCCCTTGNCAACAAGGGNATTA 

ATTCCMAAAGNCCTTCCTGGTTTTTTGGNTGGTNCCTTGAAGGGNAAGGGNAGGAAAAA 

AAAAGNCCTTGNAACCTTCGGGGAAGGGGNAGGGTTCCTTAACAATTCAAAAGGAAATTG 

GGGGGGGGAATAAAAGGAAAAANGGGGCCAAGNCCTTGGTTTGGAAGGAAGGNAGGTATG 

GCTTCAAATTAATGGCCCCCCAANGGCTTAATTGAACCAAAAAAGGGTNCAAAAGGGGAA 

CMTTCTTCMNAAGGGGTGGGGTTCCACCCCCCCNTTCGGGGTTTTCCCC t I I I I GGTT 

ACCCCTTGGCCCCCGGNNNCCGGGCCCNGCCTTCCTAAGAAANCNTAGGGTTGGGGNANT 

NCCCCCCCCNGGG 

Sequence 1462 

GCGGGCAGGTACATGGATGGGAGCAGCTTCACCAACCCCTGCAAAGTGACTCTGAAGAAG 

ACGACAAGCCCTGCTCCAGTCACACCCGGNAAGNCTGACTGNNTNCCACCGCNACAGNCT 

GAAAGGCATTGAGGGNAAAACCTCMTTCNTCGGGGNACCTAMTTTTTTCX^CCTTTAAA 

MNATTTT7TAAGTMCCTTTGGCCAACCMGGTAAAGGGGGMCT1TTCCAAAACNTN 

AACCCCTTTCCCTCC AAG N ATTTG N AAG G AAACNTGTTNTTTCCC AG GTT AATTAATT AC 

CNATTTCCAAAAGGTTCCAACCTTTNGGAANGGGGTTAAGGGGGAAACCAANAAAAAATT 

TTGGCTTAACCAAGGTTCCCTNMTTAAAMTTTTTAAAANGGGGGG I I I M I > 1 1 1 1 I A 

TATAAAAAAGGGNGNTGGTNTNANNCCCCTTTTNGGNGGTCCCCCC I I I I 1 1 I I I I I NAA 

GAAAAANCCNTAAGGGGNGGGGGAAATTCCCCCCCCCCCCGGGGGNNCCNTTGGGCCAAA 

NGGGAAAAATTTTTCCNGNATTTAATTTCCAAA 

Sequence 1463 

AGGTACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAGGCGG 
AGCTTGAGGAMCCCGCAGATMGTTTTTTTCTCTTTTGNAMGANTCCCNCNTTTAAM 
TAACCAAAACCTTACCTTTAAAAAAAAAATTMTTANGTTCCAAAATTAANGGGNTTTTA 
ACCTTNAAMGGNMTTMTTTTTNGGCCTTTT^ 

I H H 1 1 I I AAAAACCC GGTTTAAAAA I I I I I I I I I I AAAAATTAAAGGCCCTTTTTAAA 
AGN NAAI I I I I I I I I I I AAAGGNGANGGAAAAAAAA A l I 1 1 1 11 1 I I GGNANAAGTGAAA 
ANCTTTTTTTNTTAMAAAATNTAMNN 

NAMTGGGNGGGGTNNNANANAAGGMTTTAAAAMNTGGGNTNTTTTTNTTTAAAAAAA 
AAACCAMTTTTATCCCTTGNTNAGG 
CCI I I I INCTT 
Sequence 1464 

TTTGAGAAGCCAGCGCTCACCCACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACT 

TGGAGAAAAGCACAAACACGACCAGTCCCATCCTGGGCTCCCGNNGGCX^GNCCTTTCTTT 

CCTMTNCTTACCGCCATTTTGGTATTCCGGACCTGGCCMTTTTAGGTTTNGGGACCTT 

MMGNMTTGGMTTGGAACCTTCCMGTTTTTTAAAAAAGGGGGAAGGGGAAGNAACC 

NAAANATTNNCCTTNGAACCCTTGGTTCCTTAAAAGNCCAAANGGAAATTGGGGCCCCCC 

AANAGNCCCTTGNNNGCCAAAAAGGNAAAAAAAAAAAAGNNCCNAACCCAACCTTTGGNN 

CN MTT TAACCCNAATTTAAAGGGGGGAAATTACCAAGGAAAAGGGGGGNCCAAGGAAGG 

CCTTTTCTTGGGCCCCTTGNGCCGGGGGGMTTTCCTTNGGCCAAAMCCCMTTTTT^ 

ACCCAN I | | | | | | IGGGl I I I I I I I I GGGCCCCCTTTGGCCCAANAAAACCCTTAACCCC 

CCCCGGGCGGGTTNACCCCCTTTCGGGGGGGCCCCNGGCCTTTCCTTAAGGNAAAACCTT 
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AAAGGNNTTGGGGGGNAATTTCCCCCCCCCCCCGG 
Sequence 1465 

AGGTACTTTATTTTTC I I I I I 1 1 1 I I I 1 1 1 1 GTGGATGGGGACTTGTGAATTTTTCTAAA 

GG TGCTAT TTAACATGGGAGGAGAGCCGTGTGCCGGNATCCAGCCCAAGCCCNCCCGNTT 

CACTTTTTCNCACACNCTTTNTTTTrCMCCCTTGGCACCTTCTGGGGCNTT^ 

MGNGGNCCCCTTTNTTTGGNCATTACATTAGCAGCGGMAACANCCTTTTANTTNCTTT 

CCCCNTTNATTGGNGMNANMCCTCCCATTTTNTTTTNTTCAAANAAGGAGCATTNGNC 

AMNANCTNCCCAMTNTNNCCTTNNACCACNGNGGGNCNNTANNCCACN^fTNCTANCTT 

NAGG NGNNGCCATTTGGGTTNCNCCCTTTGNNCCNGGMNTTCTNCCCTTTTCCNTTNGN 

Tm^CCCCNCNGGGNNGTTTTT^^ 

CNCNAANAAAAANGTTNAANCCNAAAAAAAAGNGCCCAAGAG I I I I ll I II I II I I CCCC 

CCCCCC 

Sequence 1466 

AGGTACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAAAAAAC 

AAAAACAGCAAGTTTACATGATCCCTGTAACAGGCCATGGNNCTNCAANNCTTCAGGAAT 

GCCTTNCCNTCNCATTCTGGCCCAAAGNTGGTTGTTTNCNTGGGAATNACCAGGNAGGCC 

ACCMTTCGGTGGGGCCT^CCTGGGGGGNGGTTCAACCAACCTNCAAGNCTTTTAAGGG 

CTTGGTTGGGGGTTCCCCACCANGAGGCCACCTTNCAATTCTTGGGCTTGGGGGACCTAA 

TTGNGGNTGGGGTTGGGNTTGGGACCTTCCTTAACCTTCCAAAAGTAAGNCAANAAGCNT 

GTTTM CCCMGCCMCAATTTNCAAAAMCCMGGTTGGTAATTTGGNAAACCANTCCT 

TTTTTTNAAAAATNATTCCAAAAAAGTTNGAAGGAAAAANCCANAGGAAANGGGNCAAAC 

CATTAAANTTAAAATNGGTTTAATCCAAGNAAAAAAG 

Sequence 1467 

CCGGGCAGGTAC I I 1 1 I I I I I I 1 1 I I 1 1 I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGACAAGCCATGCCNCCNNCCCNGGAAANGCC 

GCNCAGAGGG TTCCC TTGGGCAATIVITGGTATNTACTGGGGTATCCTTCTTNTGCGGTTNC 

TTTCGGGNCMTTTTTCCGATNNCCACCTTTNNCCTTACAAAGGGNCCCCAANNNTTGNC 

CTTTCTCCANCNCCCCAAAAGGTTNNGTGCCCCTTTTNNNCCCCGNAANNNGGAATTGTT 

NMGNCCTTNGNMGNGGTTATTGGGGAGGACCATTTTCCTTTCCTTTNCCCCCCCCCCA 

AGGAAAAAAAAMTTTTCCNGTGTTNAAGGNGGTAATGNNAAGTAACCTCCCCCCAACCA 

MTTGGAAAGGGGNTTAAAGMNTGGNTTTTTTNACCTTTTTTCCTTAC^ 

CAMNNNCCTTTTTTCCAMGGGGGGGGCCCTTTTTCCTTrCCTTTMNCCCAM 

TTNGGAAATTNGGGGGGGGGGGCCCTTGGGAATTTGGAAAAAANCCGGNTTNGGCCCGGN 

CCTTTGGGTCTTTGGTTTTCCCCAAAAAACCCCCTTNGGGCCAAACCCCCCT 

Sequence 1468 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTTCCTCCACCTCCNNCGCCACAATCTTCC 

C MAGAAA GGGGAGMAGTCTTTTCTTCCTTCGGCAGNCNCGNGTAGTCNAGNGGTTCCC 

CCTTTTTTCCTTAGGNAGGCATTTMGGGGNMGTAMGGAAGGNAGGAAGNAATTCCGG 

CC TTGG TTCNTTCGGGGGNAGTAAGNAAAAATTCCACCANAGTCCCGGTCCCCGGAATTC 

CCTTTTCNTTCTTMGGGGGTTCCTTCCGGTTMGGTTCCGNMTTCCTTAAGGGGTTNC 

AAAATGGAAMAGNGAAAAATTMGGAMGNAACCMGGTTTTTTGGCCAAAGGAAGAAA 

AGGTTGGGGTGGTTAACCCCTTGGCCCCCCGGGGGGCNGGGCCCCGGCTTTCCTTMGAA 

AACCTTAGGGTNGGGAAATCCCCCCCCCCGGGGGGCCTTGGC 

Sequence 1469 

CCGGGCAGGTAC I I I I I t I I I I I I I I I I I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGNACAACTCATGCCTCCTNCNNNGAAAGGNC 

GGCCCMGGGTCCCTGGGCCAATTTGGTTTTCNTGGGATTTCTTCTTTTCGGNTACATCG 

GGNCMTTTTCCGNTACACCTNCNCCTNCNAAGGGNCCCNAGGTNTGGCTTTCNCNNCGC 

GCMNAGNTNGTNCCCTTTTCACCCGGAATGGMTGGTTAAGACCTTGNMGNGGATTTT 

GGGNGGACCTTTTNCATTANCTTACCCCCCCCAANGTAAAAAAATTTCCGGTNNTAAGGG . 
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GNMGMMGGMCCCNCCCAACAATTGMGGNGGTTAAGGTTGGGTATTTTCCTTTTCC 
TTTC CCCCT TTCCCCAMGGCCNTTTTTCNMGGGGG 

MCCTTTTTGGNMTTGGGGGGGGGGNCCTTGGGMTTGGGAAAAAACCGGTTTNCCCGG 

CCTTTNTCTTTGNGTTCNCCAAAAACCCCCTTGGGCCAAACC 

Sequence 1470 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTNCCTCCACCTCCNCCNCCNCNATTCTTC 

CCAA AGAAAG NGAAGAAAGNCCTTTCCTTCTTCGGCAAGNCCCNGNAGGCCAGGGGNTGC 

NCCCTTTTTTCNTMGGMGTATTNAGGGGGAGTAAANGTAAGTAAGGAAGGAATTCCGG 

CCTTGGTTCNTTCCGNGGGTAGGAAGGAAAAAATTCCAACCAAAGGCCCCGNTCCCCGGA 

ATNCCCTTTTCCTTCTTNAGGGGTTNC^NCGGGTTAGNTTCCGGAATTCCTTAAGGGGT 

TCAAMTGGAAAMGGGAAAATTAGGGAMGTMCCAGGTTTTTTGGCAAAAGGAAGGAA 

AGGTTGGGGGTTGGNTAACCCCTGGCCCCCCGNGGGCGGGGCCCGGCTTCCTAAGAAACC 

TTAGGGTGGGGMTCCCCCCCCGNGGGGCCTTGCCAAGGGGAMNTTTTCCGNAATTAAT 

TCCAAAAGG 

Sequence 1471 

AGGTACAAACGAGTCCTGGCCTTGTCTGTGGAGACGGATTACACCTTCCCACTTGCTGAA 

MGGTCAAGGCCTTCTTGGCTGATCCATCTGCCTTTGTGGGCTGCNTGCCCCCTGGTGGG 

CTNGCCTGNCCCACCCAACCAGGCNTGGCTTCCCTGCCTTGGCCTTGGCCTTGGCAAGGC 

TCCCCCAGGCCTTAMGGGGTTTTGGNAAAGGNCNCCAAAGGGGAAAAGGAAGGTTCCGG 

GMGG GGMNTTCCNGGGAACCGAAGGGNATTMTGGGGGGAATTTTTTGGGGTCCTTNC 

TTTT TTGGAC CNTAAAAATTCNAACCCCAAAAAAAAAAAAGGNCCNAAANCCCCCCAAAA 

ANCCTTTTTTAAGNCCCCAAGG 1 1 I I I I 1 1 MTTTTTTGGCCAAAAAAAAACCAAAAGGG 

GGGAAAAAMNTTAAAAAAMGGGGGGCCCTTTTAMCCCTTTTTT 

AANMTAAAATANNANAANANNNNlsrTNNGNGGGTTTAACCCCCTTTGGGCCCCCCCGGGG 

GGGNCCCGGGGGCCCCCCGGGNTTCCTTAAAGGAAAAACCCTTAANGGGTNGGGGGGAAA 

TTCCCCCCCCCCCCCCGGGGGGGGCCCTTGGGCCCAAAGGGGGAAAAAA 

Sequence 1472 

AGGTACAGAGTCTTTTGCTTCCT(XCACCCCTAGGGGGAAAAACTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACCGCAGGACAGANCTACCGNCGGGGAGNCN 

CGNGGNAGGGAATGGGGGCATGCAAGTCATGNTGAGAAGNGGAGNGTGATCTTACAAGCA 

NNGGAAGNACGTATGNGGTCCCGNGTAAGNAAGTCCANGTAAGNGNCCCCTTGGAAGNAA 

AGTCCCAAGTANCGACCAGTNTTGNGNAGTAAGTAGGGTGNTGGGNAAATAGGTTGNANC 

CCATNTCCGGGGGTCCTTGGGGGGNCCTTGGGTAAGNCCNCGNCCAACCAACTGCTTCTT 

CCTNCCCCAATGGTTTAAAAAATTAGGCCACCCCTTTTTT^ 

CAACCAAAAGGGTTCNCNCCCAATTCCCAACCATANAAAAAAAAANATAGATANACACAN 

AAAGGGGAAAAGGTTACCCCTT 

Sequence 1473 

AGGTACTGGTGTGTCCGGAATCCTACCCACTGTGATGACAGTGCCTGATAGTTTCTTCTG 

CCTTTCTATCCCAAMCGATTGGTCAGTTTACCCMGGTTTGGCAAATGGCCAGCCTTNA 

GNAMTCTTCC CCAG GGGMNCMTNCTTCTTTTCTTAGGTTMGNTTTGGCCCCTTTAA 

GNCCCAATTCCTTTTNGGTTAAGGTT^^ 

NTNTNriTGTGGCCTTTTCCCCAGGAAAMGGC^ 

AACCTTNGGCCANTAACCCCAAAA! I I I I I I I I I I I NGGGTTTTNCAATTTCCTTGGGTT 
TTTGGGGGTTNCMTGGMTGGGGTTTAAGGCCAAGAAAGGCCCTTGGANCCCCTTCCCC 
NTGGGTTTTMCCAAMTTMGGAAMATTGGAMTTCCGGGGTTTTTCCCCTTGGGGGG 
CCTT AACCAAGGAACCTTTTGGAAGGGTTCCTNGGTN I 1 1 II I I I I I I I I GGGTTTTnT 
TTTTAAAAAAANCCCCTNNTNCNCCCCCAAATNGGGTTGGGGGGCCNAAAAA 
Sequence 1474 

CGCGGTGGCGGCCGCCCGGGCAGGTACTT7TTC I I I M I I 1 1 I 1 1 I I I GNGGATGGGGAC 
TTGTGAATTTTTCTAAAGGNGCTATTTAACATGGGAGGAGAGCCGTGTTGCCGGCTCCAG 
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CCCAGCCCCGCNTGCTTCACNTTTCCACCNCTmiNnTTCTCACCTTGCCTTCTTGGGCT 

TTTCTNCAAGGGCCCCTNNTGGCCTTTCTTCCC^GNACCCCTTTGGTTCTTCNCTTCT 

AAANAACCCNCTTCNCTNCCAACCAAACCTNNGCAGNCTCCCATTNNCTTTCCCCGGGCT 

TCC CCCNT CNCCTTAAGGTTCTTGGTTCCCCTTGGCCGGTTQNCCTTCTTGGTTCCCCCG 

GGGTTTTT CAAGNAGNAACAAANCNTTNCCCCAAAAANGCCAACAAAAAAGGCCAAGGTT 

I I I 1 1 1 1 ICCCCCCCCTTAAAAGNGGGNNTGGGGGGGAAGGGGGAAAAGNCCAAAAAAAA 

GGAACCTTCTTGGTACCCCTTTTTNGGGGCCCCGGGCTTCTTAAAGAAAACCTAAGGGTG 

GGGGAAATNCCCCCCCCCCG 

Sequence 1475 

AGGTACTTCMTCCTGAATTMTCTTTAAACACTTTCAAATATGGAGATTAATCACCAAC 

TTCTTATTTTTTGGGCCAGTTGGATTCAA 1 1 1 1 1 1 ATTTAACCATGGANCTCTTTCNTAT 

TATTCGGTTACCTNGTNCGGAAATGGCCTAGGAAAGNAAAGGGCCTTCTTTTTCNCGGAG 

GGGTTN CGGGGAACCAAGNCNAGGAACC I I I I I GGTNAATTTTTTAATTGGTTTCCCCTT 

TTTTTTGGNTGGGGMTTTCCTTTAAATNGAACCCCCAGGTTCCGGGAACAAGGGAATTG 

NAGGCCCAMGGGTTCAAAAAAGGGGGAAAACCNTTTTTNTNAAACCCCCCAAGGGTATT 

GG CAAGTTT GGGACCACCATTAATTAAI I I 1 1 I 1 1 1 1 1 CCTTTGGGGAAAAGGGAATGGT 

TAATTTTTTCCCCCCAAAATTCCAAAANCCCCTGGGGTGGGGAAATTAAAGGGAAAATTT 

TCNTGGAAAAMCCAACCCMTTCTTAATTTTTTTGGAAAAAGGCCTTA A 1 1 1 1 I 1 1 1 1 1 

TGGGAATAACCAACCC 

Sequence 1476 

AGGTACAAAATTTTATTAAAGGTCTTTAGAGAGCAACATCCAGACTCCAGAATACAGCTG 

NGNAGGAGACCCTGTTATGCTGTGGGGACTGGGCTGGGGCATTGGAAAAGCNCCTCCNTC 

TGGGCCTTCCCMCNCCCTTTANTTGTTTCTTGNMGAAATGGGNGGGGNTTTGNTTGGG 

NGNCNATGATNATTANCTTTCAATTCTTTTTTAGGGGGGATTTTNACCAACCAAAAATTG 

NCCTTTCNAAACCGGTTGNGGNNTTCCATGTGNCCAAGGGNGGNGGCCTTTT C I 1 1 1 1 I A 

NGGGGGNCCCCCAM^r^TC^f^TTTNMCCCCCAAI^ANTmG^r^TmCCGGNNAANGT 

cccggtgtttntaccx:ggaaaattctccggngggggnancaaanggngnntttnccttgg 
ttgngtattnccccaaactaatttaaaaaaaaaaac 

Sequence 1477 

CGGCCGCCCGGGCAGGTAC I 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I GATGAAATGCCAGGGTGA 

AAGGGATAGCCAAATAGGCTAAAGCACAAGTGCCACTCTAGGTTAATTTCGGCAGGACNN 

GCCCCAAATAAAAGGGTCCCACCNGACCANNTTACCCAATNCAACAACCATTTCCCGGNT 

TTCGGGG 

Sequence 1478 

AGGTACCTTMCTTGAGTTACAGGGCTGGTCCCTCTCTTTTCATTTTTTATCCCAGTAGG 

TGAGACCGTCCCTGCTGTGTTCGGTGGCTGTGGAGTTGATGGCATCTTTGCTCCAGGTGA 

CACCTGCATGCTGGCCAGCTGAGCGGCATAGAGCTGCTGCAAAACAGGGAAGACAAACAT 

TGATCTCTTTAAATGCAGCTGAMTTGAGTTTCAAAAGAAAAAACTTACCATATTGTTAG 

ATTTCTCAGACAGCCTTGTMTTTCTTTTTCACTATTTTTAAAAAAGGGAGCTAAGAGAA 

GGCAAATAATAAAACAGAAAGAAAAAGGACAGGTATGGGAGCCATAGTTCTGTTTCTGGT 

CCTTCTAGCMCMCTTTATGATCCTGGACMAGGATTTGATCTCTTTAGAATTCAAGTT 

III 1 1 I I I I A 

Sequence 1479 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAACACCTGCTAAAGACATAGC 

TTMTAAATATTTGTTGAATGTTGTTGAATGAAGAGCATTTTCCTGCTCATAAATTTCTC 

ATTACTTAACGAATGTAGAATAAATTCCTTACTCTGGCAGACTGGAATATGTAGTTACAG 

TTTATTCTAGAA I I 1 1 1 GCATTCCCCTTCTTCTGGTTCACTTGGAGAAGTTCTTTTCCAT 

CCTAAGTGACCCAGGTGAGTCCAGTGTCCTGCTGATTTCACTGCAGGGCAGGCATGTGAT 

CCAGGCTGGCCACAGTGGTCAGTGTGGACAACTGACCCTTGTTCATCTGTGGAGTCTAAC 

AAAAGTAGTGCCCCCTTATCCTCGGTGGTTATGTTCCAAGATGCCCAAGTGGATGCCTGA 

AACCGTGGATAGTACCTGCCCGGGCGGNCGNTCTAAAACTAAGTG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

244/467 



Sequence 1480 

CCGCGGTGGCGGC( ^AGGT ACTTAGCATTGATCAMGAAATTTCAMTTACCGATCAATT 

GGGTGGGGAGAGGAATTTTCATTGTCCAAGCACCCTCAGGGAACAGAAGTCAAAGCAATA 

ACATATTCAGCMTGCAGGTCTATAATGAAGAGAACCCGGAAGTTnTGTGATCATTGAC 

ATTTMGACACCAAAAMTAAAAGACTCCTACGMGMCTGTTTTTGTTTTCCTCTTCCT 

TTTGAGMGACACTATGMTTAMTTCTACAGCTTTTTTTTGATATATGGAMTTTGTAG 

MCAGAMTATTTTAGTTAAAGTGTGACTTTCAGAAAGGGAAAATCAGGGCACAGCCTTG 

GTCTGTGTTCCCCAAATATTCACACTTTAAAGAATTCTTAAC 

Sequence 1481 

AT TGGA GCTCCCCGCGGTGGCGGCCGAGGTACATGATTGTCCTTTATGCATTAAATTCAT 

GCTTTTACATACTTGTGTTTCCTTTTCATTTCGTCCTTCCAMTCATTATTTTAGGAATA 

TAGAGAAAAGGGTTGGCCAGGTGCAGTGACTCACACCTGCAATCCCAGCACTTTGGGAGG 

TCGAGGTGGGTGGATCACTTGAGGCTAGGATTTTGAGACCAGCCTGACCAACATGGAGAA 

ACCCCATCTCTACTGAAAATACAAAATTACCCGGGTGTGGTGGCGCATGCCTGTAACCTC 

AGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTTGAATCTGGGAGGCGGAGGTTGTGGTG 

AGCCAAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTCTCAAAA 

AAATTAAATAAATAAGGCTTATGTGCAGTTATTCTCAATGAGCTAAAAAGCTTCCTGAGG 

CTAGAAATTCTTGTCA 1 1 1 1 1 G GCTGGGA 

Sequence 1482 

CCGCGGTGG CGGC CGCCGGGCAGGTACTATCTATTTGGCCTTGGAAAG1TACTCTCTAAG 

TCAGGGCTTCTTTTCTTNAACTTTGGTGTGAGACTCCGCTGGAGAGCCGGTTAGAAAAAC 

ACMGTCTCGGGCCCCACCCCGCAGAGCCX5CTCATTCTCTAGTTCTGGGTTGGGGCCCAG 

GAGTCTGCTTTT CTAGC AAGCGCCCAGATGTCACTGATGCTTACAGCTCTCANACCACAG 

TTGGAGCAGNGATTrTTAAMGTCTTTTCATTTGTAAAGAGTNGTTCTCCATGCTCCAAA 

TGACTGNGACGACTGAGAAAATGCATGTATGTAAAGTCTGCANCTGGTGACATTGTACAC 

ACTNAGCAAATGGCCTTNGGTGTTACTGTNANTTTATTTTTACTAATTATTTNTNNCACC 

NACAAATTNGGANCTGCTNCAATCGGTNGGAATTTGGAAATTGGGC 

Sequence 1483 

GCTGCCATCAGCTCCCTAGGAGCTCTCCCTCCAGGAAGGGAATGTGTCCACCGTCAGACA 

CTCAGACCCAGCATGTGGGGACAGAGGCTGATGGCCTGTCTGGCCATTCCTCTCAGTTCC 

TCTCCTCACTAGCTTGTGTCCTTGTGCAAGTCACTTACCCTCTCTGAGGTTCAGTTCCCT 

CCTCTTTGAAGTGGGTTTAATAATAAGTACCTGCCG 

Sequence 1484 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCTTTCGTTAGAAGGCAGTGACTCCTTTC 

TGTGAAGCCGATTTAGTGAAGTGTCCTGTGCAGAAAAGAGTCCAGGGCTGTCAGTTAATT 

TCTTCCGGCCACTGGAGTTAGGGTTTGAAACTCTGCAGCTGCCTATTGCACTTGTGAAAA 

GGTTTGTATGTTCATCACTGCTGGCTGGCTCAGAGTTGGGAGTGAATCCTCCAAGGGATA 

AGCTTGGAGAACTTTCTGAACAGTCAATCTGTAAAGGTGTCTGCAATCCCAAGGCCAATG 

GACTAGATTCTGAAGGCTCTCGGTGGACCCACTGTTCCTCTCTGTTTATTAAG CI HU G 

AAGGAGAGAGATGAGGGCAGGACATGTGACAACGGTGCTTTTCCTTATGCTTATATCGCT 

CTCCAACAGCATCCTT 

Sequence 1485 

AGGTACATGCAAGTTGCATGATTATAATGACGTGATCCTGGGATTTAAGTTGATTATGAC 

AGGAACAGAAGGAACTTGGAGATTAGGGACAATGAAAAGGTGGTAGTGAAAGGAGTGTTG 

GAGTTAAGTTACTA GATGTC TGGAAGACAGACTGTGGTGGTCAGATAATGAGATAGATTA 

TGGAGGGGTTACAGTTTTTGGTAATAACGAGAGAGATCTAAGGTATGACTGAGAGTGAAT 

GGATGAGAATAGCAGAGAACAAGGTCAGTGGAAAACTACTCTTCAAAGAACCTATAGTCA 

GGGTGTTGAAAGATTAAAAATTGCTAAGAATTAAATCAGGAAGTAGTGCTGCGGGAAATG 

AAATGAGCAGTGAACTAAACTTACTGAGAAATGAGAGGGGATGACCCAAGGGTTTGTAGA 

T7TTGTAGATGATAGCA 

Sequence 1486 
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AGGTACATGACATCCGACGAAATGAGACGCCACCTAA7TGATTTCCGGGAGTCCGCACCA 

GGGGCCCTCAGGGAAGAGACCCCGCAAAGATCCTGGGAGACCAAGGTGGGGACCCTGGTG 

AGAAGAGAGAGTTCAGGGGAGTCTCTCTTCATTGCCCTTCTGCTAACCCAAGCATTAATT 

TGCTAAGTATTTACCAGGGGAGTGGGAAAAAGAGTTGAGCGGGATTCTCTTAGGCTATGA 

GAGAGTCAGGCAGCCCCCAAGATAAAATAATGAACTAGAAAATCTGGAACCTTACTTCTC 

TGGGAATNTTACCTATCTGGCACCGTGGGAAGAAAGAAAAAAGGCTACTGAGTACCTGCC 
CG 

Sequence 1487 

GCGGTGGCGGCCGAGGTACTGACCTCTGCCAGTCAGCTGCGTAACTCCAGGCCCTAGGGT 

GCCCCGTCTGTCCAGCCAGGGATTGCATGGATATGCTGTGATCTCCCTTTTGGTTCTGAT 

TGAGTTGG ACCT TGTGGGAGGAGAAACATAGATGTTGATACATGAACACATATGTTGGAG 

AGAGAAAGTTTTATCTTGGCATAGGACTTTTAAAACACAAGGTMTTTTTMTCAGTTTT 

GGGACCAAAAAACACTCAATATGGGAAAAATCCAAATTCTGCCAAAATGTCTAAAGAGGT 

TTATTCTGAACCAGTATAAGTGACTGTGGTCTAGGTTACACAATTTCAAGAGATCCTGAT 

AMGCGTGCATGAGAGAGTTGGGCTACAGCTTGGTTTTACACATTTCAGGGAGACAGGAA 

TTGTANGGTAAAATTATGGCACAGGACTTTTAAAATGAAGCTGTGAAAGTTTACAGTCCA 

TAGAGAATAAAAATCTAGAAGTTT 

Sequence 1488 

CCGCGGTGGCGGCCGAGGTACTTGACACGACATATGGTAAATGAATAAGACAAAGGCTCT 

GATGGCTTCTCACAGCCCTGTGGATCAAACTCAGATTCTTTTCTAGAACCCCAAGGCCCT 

GTCAG TCTCA CTAGCCTCTCTCCAGACACACCAGCCTTTTTCTACCTTTCCAACATCCCA 

AGTTCCTTTTCTCTTACAGGACTACATACACTCTCTCTTCTGCCAGAAACCATGTTCTAC 

CAGCTMTTTCCACTCAACTTTTAAGTCTCANCTGAAATGTTACTTCCAAAGAGAGGCCT 

CCACTGAACCCCAAGCCTGGGGTTCACAGCACCTGTCTCCATAACTACATAATAATCTCT 

CTGATGTTTAGGCTGGGCATGGTGGCTCACGCCTGTAATGCCAGCACTTTGGGAGGCCAA 

GGCG 

Sequence 1489 

GGGCGAATTGGAGCTCCCCGNGGTGGCGGCCGGGCAGGTACCCAGAGTTGCGAGGAGTTT 

TTTAACTGATTTAGCCAGGTGGCAANCATNAGTGAATGGATGAAGAAAGGCCCCTTAGAA 

TGGCAAG ATTAC ATTTACAAAGAGGTCCGAGTGACAGCCAGTGAGAAGAATGAGTATAAA 

GGATGGGTTTTMCTACAGACCCAGTCTCTGCCMTATTGTCCTTGTGAACTTCCTTGAA 

GATGGCAGCATGTCTGTGACCGGAATTATGGGACATGCTGTGCAGACTGTTGAAACTATG 
A 

Sequence 1490 

GATNAGCTCGATATNGAATNNCNNCANNCNNGGGGANNCANNNGNACAAGAGCGGANNCC 

NNCGCAGAGGAGCTNCAANTTTACACACTGTTTAAATGAGGGAATANGCNGCAGCGCTTG 

GATGTAACTGAAGAAGACAGTNNGAGCNNCAAGGAGGGACAACCACGACCTATGAGGACA 

CCATGCCAGAGAGGCCTGGACCCCACGCTAGGCTCAGTGCCTGTTATACTCTTGGGACCC 

AGCG CTTTC CTTCTCCATCACGTGGCATACTTGGCATTATTTGTTGNTTAAAATATTGCC 

CTTAGTTTTNACCTTTCNTAAGGAGACACAAGGNNGACCTTTGNGACATTAACAGTTGCC 

CC AAGTNGGGG GNANANAAANAATTTTTGGGGGNGNAAAAANCTTTTGGCTTTTTNNAAA 

AAA I I "I Mill I NAAANTTT 

Sequence 1491 

cttagggc gaatt ggagctccccgcggtggcggccgaggtacctggtgacccttgctggt 

gctttcAtgttttgtgccgaggcmttagactttgtgctgaatntgtttgtgctgccacc 

tcagggmggggtggmtgtgcagcgtggtttccatttgacattgttttccctgagagat 

gggagggctgaacgttacctcttgacaagtcttagtggacagaggggcccggatacccaa 

gcgccttagttcttagggctgggtattagttcattttcacactgctgataaagacatact 

cgatactgggaagaaaaagaggtttaattggactcccacatggct 

Sequence 1492 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTACCCTTTACTTTTTCCC 
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CAAGACCATCTCAG6GTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACC 

CACCTCTCCCMGAGCAGACATTAMCATCTTTGTGCTTTGAAGAGAGTGAATTTTGGAT 

AGTCTTGTGATTCTCAGACTAACTTCCAGAATTATACTTTAACCCCTTCCAGATATGGTC 

CGCCTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTA 

ATTGTCAACCTCCAGTGTTGACTCTAGAAATATGAGGAAAGCTTTTCAGTTTTTAAAATT 

GCCATTTAMTTTAGTCTATTAAAAACAAACCTAGAGGTCTTGGTGCAGTTGATTTCAGA 

GTTTATTMTTTMGTGGTCCCAAMGTATTACATCTTTTATATTCTGGAAGAAAAGAAC 

TGTGAACAAATTAGA 

Sequence 1493 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACTGTGGAGGT 

CCCAGGAGTGCTGGTGATGGGCACAAAGTTCCGATGGGTGAAACCATTGACATAGAGACT 

GCCTCTGTCCAGGAGGNANGGGCCCAGCTCGATGATGCCATGGGTCAGTTTGCTCAGCTC 

CCAGTATAGCTGCTCTCTGTCCAGTCCAGGGTTTAGAGGGTCAAGGCGGTGAGTGCAGAT 

GGTGTCCACGCCGGTGGCTGNCCCACGTTTTTCAGGCCTGAGCAAGGTCAGTNTGCAGCC 

AGAGTACCTT 

Sequence 1494 

ATAGGGCGAATTGGAGCT CCCCG CGGTGGCGGCCGAGGTACTAATTTGAATTTGTAATGA 

GTCTGATGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGAATATC 

TTATATCCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATTATTATGAC 

AACT GAACAT GCTTATGCATCCCTCATGCCTTTATTTTTTATTTTGGGAAATCTTTCAGC 

TTCAGI 1 1 1 IGCTGATATTTATGTGATTCTTTGTTCTGCAATTCAAATTTCTGGGAGCCA 

AACAGTCTCCTTGGTTCAGATTACTGTTTTTTGACTAGAGCTTCTCGCTTCAGATTCTGT 

CATAAGATTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACA 

TTATTT 

Sequence 1495 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGCGGGGAGTTTCACGCGCGTATGC 
TTTGCCCGCCATGGCCGCAGTCAAGGNGCCTTTGGAGTTCCATGCCAAGCGGTCTTGGCG 
CCCCGAGGAGGCAGTAGAAGATCCGGACGAGGAGGATGAGGATAATACTAGTGAAGCCGA 
GAATGGGTTCTCCCTGGAGGAAAGTGTTACGGCTCGGAGGCACCAAGCAAGGATACCTTA 
TGCTGGCTACTTTGGATGAGAATGAGGGAAGTGATAGATGGAGGGCAAAAAAAGGAGCAA 
TCCGATTGACC TTTNA CCAAGGGGGANTTTTGGAAGC I I 1 1 1 1 1 1 1 I AAAA AI 1 1 1 1 NTT 
TTTNGGNGAAATTTTTNNAAAAAC I 1 1 1 1 1 1 I NTTTNAAANNANATTACCCCCCTTTTAA 
AAAAAAAAAATTTNCCCCCCAAAAAAAAAANGGGGNAAAANANCCCCCCAAAAANAAAAN 
ATTAAAAAAACCCCCCCCAAANGNTTCAAGGGGCCC 1 1 I 1 1 1 1 1 1 1 I T NGGGGNAAAAAA 
AMAAATTNGGCCCCCNCCNCNTrTTTTTGGGAGGAGAGGGGGGNCCCCCNCCCCCTTTA 
AAAAAAAAAAAGGGCCCCCCCGGGGGGGGGGGGNANTTTTANAAAN I 1 1 1 1 1 1 1 I CCCCC 
CCCC 

Sequence 1496 

CGGGCAGGACCATGGGAAATAAGAGCNGGCTNNNGGCATTCTGNGTANGGAGCCTGAGCC 

AAACTCTAAAGCTGTCTTTATAAAGGGAGGTCATGTGATGGCCAGAAATTGCCTTTGCTT 

CATGGTGCACTTGGTGGGGAGTCAGGTGTGGGGTGCTGGGTTTCACATCATCCCATTTTC 

TTTTNNGNNTTCAGACCTGCAATGCTTCTTTTGCAACCCGAGACCGTCTGCGCTCCCACC 

TGGCCTGTCATGAAGACAAGGTGCCCTGCCAGGTGTGTGGGAAGTACCT 

Sequence 1497 

AGGTACTTTTNGAAGTAACTGGACATGNGGGAGGNNAGGGGAANGGAAGTATTGNTATGG 

ACTGAACTGTGCCCCCAAAATfCATATGTNGAAGCCNTGAGCTCTGACATGATTGNATNT 

GAAGTCCTAAAGCCAGGAATGAGGAAGGCTGTGAATGTNCATTGTTCCATGCMGAATGA 

CTCTGGNGNGGGCTATTTAGAGATCATGAGGGATACTGCCCCAGTTTCCACAGGCCAGAT 

GGNCTCCAACAAAAGCCACGGGGGAGTCACCCCTGCCTGGCAGATCTATCGGGTCAGGAC 

CACCGCCCAGGGGGTCCTGGAGAGGACAGTATAGGACCAAAGAGGAT 

Sequence 1498 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

247/467 



CACCGNGGTGGCGCCCGAGGTACTTGGCCAAGCGCTCAGATCGGCAAGGGGCACCAAGTC 

TTGATCTGCCCAGTGCACAGNCCCACAACCAGGTCAGCGATGAAGGTATCTTCAGTCTCC 

CCCGAACGATGAGACACCATGACGCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGA 

A GAGACTCG GTCACCGGAGCCATTCTGGNTTGACTTTTTAGCAAGAAGGNANTTNAAGGA 

Nil I II I n I ANCGNCTTNGGNAATCCTTTTGGGGGTGGGGCNCNTGGGNGAAAAACTCC 

CCCCC ANCCN TNGNGATTCNTTNCNNGNGGGNNGAAAAATTTTTTGGNNAAAANTNNCCC 

ATMNNTTTTTNGTGMNAANGGANTTT^ 

NNNAAAAANATTTTTTNTCTTCNCNGGGGGGGNCNNTTTTATAAAAAAAANGANNANACC 
CCCCCNCCGTGGNGGAGAAMTTNTTTTANATTA I I I I 1 1 I I CNTCCCCCCCCCGGGCCT 
TGGGGGGGGGGGCCCCCCANCC C I I I I I I 1 I I I I I I I I 
Sequence 1499 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTAA 
TTTGTGT CATC CCA TCAGC AATGAAGGTCCCTATCCAGGGTCCTGCTTGGAGCAGCATTT 
CATGTTCTTTTGCTGTTTTGTGCT7TGCCGATTTTGG ATTTTATTTT^ 
ATTTAAAAMCT CGTCA CCTTTTGGA^ 

AGTCCCT C CTGATTT TGTGTGTGTGTGTCTGTGTTTAAGCAAGCGTTCGGTTGGTATAGN 
i I I I I 1 1 I GTTTTTTTAATTTAAATTGAAGGTAGCTGCCTCCTGAAAGCCAGCATTAAGC 
CAGAACACCCAGNTTCAAGCAAAAGACCCACCTCTCTGCAGAGGCAAAGTCTACTTTCTG 
GTACCT 
Sequence 1500 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAGTAGAGACGGGGTT 

TCACCATGTTTGCCAGGCTGGTCTCGAACCCCTGGCCTCAAGTGATCCACCCGCCGCAGA 

CTCCCAGAGTGCTGGGATTTCAGGTGTGAGCCACTATGCCCGGCCTMTACGTGGATTTT 

TAAAGCTTCAGGTTCTGGTTCAGAAGTTTCCTGGGTCTCATTAAAATAATGAGGCACTCA 

GAATTGGTCTAATAAAAATMCGACCATTTCTTTCTACTTCAGCTNTTTCACAMCTTTT 

TTMTGAAAATGACMGNGAGGNCCTTCAGTAGGGGCATTTTCAGGGGANAAAAATAGCG 

GGNA GACCTGAMCCTGGGNTAGGNAGTTTNTTTTTATTTTTTGAACMGMGANNAATT 

TTTTCANAAGACCCTAAAAMTTmTTTCCCAAAMCAAAAGNGNTTTh^ 

GGNGGGACCCCCTTCCTTTTGGGNNTTTTNCCTTCCCCCCT 

Sequence 1501 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACTGACCACAGCTCTTTCTTCAGGGACAGACATGGCTCAGCGGATGACAACACAG 

CTGCTGCTCCTTCTAGTGTGGGTGGCTGTAGTAGGGGAGGCTCAGACAAGGATTGCATGG 

GCCAGGACTGAGCT TNTNA ATGTNTGCATGACCCCCAAGCCCCACANGGAAAACCCCGCC 

CCCGNGGACMTTTGTTTTNACCANGTTTCCCCCNT^ 

TTTNCCNNCCCCCCCNCGGNGNNCCCNAAGGGGGNNTTTTCCCCNNTTTT^ 
CCCCCCCCCCCCCGG GGGN GGGGNGGNCNCCCCNCTTTTTNNNAANNAN I I I I I I II I NT 
TNNAMAANCCNCNNNTTTTTTANAANGGGGNTCCCCCNNAATNTNGGGGCCCNTTATCC 
CCNGGGGGNTAAAANAMNThnTNNCCAAAMGGGGGGN(XCCCTCCCCCTTTTTNAAAA 
NAGGGGGCCCCCCCCGNGGGGGGNNGNGAANTNATAAAAN I I I I I I I I CNCCCCCCC 
Sequence 1502 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCCAGCG 

CAGTCTGGGCCCCAAAGCAGTTTCACCCTTACCTAACTTACCTAACTCTTAATATAGCCT 

AAACTCACTGAMMTMGCTMCTTCATTTCACCTTTNGTAGCATACACGTAGACTCAG 

AGTNTATACTGAAATATAATGGGAGGCTGTTATNAAAAAAAACCNNCCCTCNAAAAAAAA 

AAMTNTNNNCNNCTNAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGGGGGGGGGNNNNG 

GGI Ml I I I rNATNNTAAAAAMAAMTNNGGGGGGGGGGGGGGGGGGGGGGGTTTTTTN 

TNNTNTNTTCCCCTNNANTCCGGGGGGGGGGGGGGGGGGGGTNGTNGGTTTCCTTNTATT 

NTCTNTNNCTANNNCACNCCTCCNANACCCGCNNNNGANNNNAANATNNNTTNNAATTAT 

NNNNATTTNNNAAAAAGNGGGGGGGGGGGGGGGGGGGGGGTTNCTTTCNNTTTNNANTAT 

AANTANNATAGTNTCTNNNGNANNNTTNTNTNNNNANATNTNGGNNTNCNCNCNTNCNTN 
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CNTCTTNTTTATAAAANAAAAAAA 
Sequence 1503 

CCCTCGCGGTGGCGGGCGAGGTGCATCACCCTGCTGAGGGACATCCAGGACAAGGTCACC 

ACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCX5TGGTCACCAACTTG 

ACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGC 

CTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCC 

ACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACA 

AGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCC 

CAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGAT 

GCGCTCAACCAACTCTTCGAAACAGC 

Sequence 1504 

CCGGGCAGGTACTCTTGACAAAGCCTCCCCAACAAGGTGAAATCACTCTCGGCTCACCAC 

TGGTGGCAGAAATACTCTGGGCCTGTCTGTTCTGAATGGTCAGTTTCACTGTAATCCTCA 

GGCCCTTCCAGTCACCCGTTGCCTTGGCAATGTCATCACCAAC7TTTTTTGGAGACAGAC 

CCAGGGGGCCCGATCTT GGGG GCCAGGGCAGAAGTTGGCACCGACTTTNCCTTCCGGGNN 

CCCCTAAGGTTA AAANA CI 1 1 INI 1 1 1 1 1 1 GGGGGNNNAAAANTTTCGGGCNNTTGGGGG 

GGGGC(XTTTNNTTTTTAAANNAAA(X)CCCNNTTTTNGGGAAAAACCCNAAAGAAAAGTN 

CCCCCTTGGGCCCCCCCCCNNCCCAAAAGGCCGNNAAAAACCCCCGGTTTTTTNGGNGTT 

TTTTAAAAAAAMGGAMNCCCCCCGCNNCGGGAAAAhrmNTNANAANTTTTTTTCCCC 

CCCCCCCCNCGGGGGGGGGGCCCGGCCNCNN 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 

Sequence 1505 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGTGGTAA 

ATGCTTAACTCCACCTATATAAAAATCAAAAACAGGCAAAATGAATCATCAGACTACAGC 

TTACCTTTAGGGGGTGTGAGTAGTGACTAACAGGGGTCAAGATGAAGGTTTAAGGATGTC 

GCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGAT 

CACTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACTAA 

AAATACAAAAAATTAGCCGGGCGTGGNGGTAGGCGTCTGTNGTCCCAGCTACTCGGGAGG 

CTGAGGCAGGAGAATGACGTGAACCCAGNAGGTGGAACTTGCAGTGAGCCGA 

Sequence 1506 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTGCATTTATTTTAGCCCATGGAATAA 

C TGTG CTGAGAAACCACAGAGTCAATCAGATTCAAAATGTTAAAATCCTTCCTGCTTGGA 

GTTTTCCGTCTTCACATCAMGCATTTCATGCCCGTCAGCMCTTTTTAATGCATTTGCT 

CCTCGTTTGCACAATTTCCATTTAMGACTTTCCTTGGCTGACTTCTCTGATGAGGTTTC 

CTGCTTGCCAGGAGAGCACGCTAATGCAGAAATTACAAAGGGGGCTTCACGTCCCTTTTC 

CGGAGGACCTGATATTTCAGATMTTTCCAGCTTCAGTTTTGGAGAAACGACTGTTCTTT 

GCACCAGGGGAAAATAAACTGATTTTCAGTGTAAGCAACCTTTCTGCAAGTAGAATGGGG 

ACTGTTGGGAATGGAGATGAAGACTTCACTTCATGTTATTCATTTCCTTAC 

Sequence 1507 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 I IG 

GTTAGGTTAACAGAAACAATGGGGTTATTTCCGGTAGTGGCAACAGCACGAACTAGTTTT 

CTTGTGGACTAATGGCCCGAGAGTTCCTGCGATTTGTGGATTATTrCCCTTAGATGCAAT 

CGACCATTTCGTTCCAATAAGAATTAATGCTACCACGGGGCTTACAAAGAACCCCTCAAG 

TCTTCCAAATCTGCCCATGACATCAACCTNTGCTGCGTAATCGGACCTGCACCCAACCCA 

GGTTT 

Sequence 1508 

CCGCGGTGGCGGCCGAGGTACGCGGGAGAACAGCTCAGAAGGAGACCCACAGTGAGCAGC 

TCCCCTGTGTCGGCGGGCAGGTCGTCCCTCAAGTGTTCAGCTCTCAGCAGAGAAAAGGCC 

CTGGAGAGGGTGACTCCTCTCAGCTCTCAGCAGAGAAGCAGCCCTGGAGAAGGTAGCTTC 

TGTTCGCAGGCAGATTGTCCAGAGGTCCTGCTGCTCTCAGACGGGGCCCTGGAGAGGATA 

GCTTCTATCCATAGGCAGG 

Sequence 1509 
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CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAGCA 

GTGGAAGCTGCAGAGGGCCGAAACAAAAATGAAGTTTTCTACCAATGTCCAGACCAAATG 

GCTCGAAATCCAGCTGCTATTGACATGTTTATTATAGGTGCTACTTTTACTGACTGGTTT 

ACCTCTTATGTCAAAMTGTTGTATCAGGTGGCTrCCCCATCATCAGAGACCMATTTTC 

AGATATGTTCACGATCCAGAATGTGTAGCAACAACTGGG 

Sequence 1510 

CTCCCGCGGTGGCGGCCGCCGGGCNGGA C I 1 1 1 I 1 1 M I 1 1 1 1 I I I I I 1 1 1 1 1 IA AAGAT 

CCAGACTAAGACACATCAACAAGAAATTTCCAAATACCAGGTCAAGAATACTTCACATGT 

TTCTGGAGGGAAAGAAAACAGTTCATATACAGNGAATCAGGAATTACAATTGCATCAGAC 

TTATCAACAGCATCGGGAAGATAATGNGAAAATACCTTCAAAACTCTNGAGGGACACAAG 

AAGATGCTGCCTGGCCACAGGAGCGAGACTGCTGGCCTCCAATCAGTCTTNTGGGCAGGC 

CTNTGATGCAATTACAGGGCTAAGGAACCACGGTAACCAATGTGCCATTGGCTGTGCTAA 

AACCCAGNGGCCCCAGGAAGAATCCTGTTGACACTGGCTTGGAACATGAGGCTTAA 

Sequence 1511 

CGACTACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTTTT 

M I 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 M I GGCATATAATCCAGTTTATTAAATACAGATGATGGGC 

CAGACATGGTGATAGAGAMTACAGATTAAGAMCCAGATCAAATCCTTTTTAAGGAATT 

ATNTAGNGGAAAATATNTCMCTISTTNTTNTTTACNCTACTATTCAT^ 

AATCTTCACCTTTCCATTTTGACNGNCGCTNTTTTACTTCAGNNTCCTGAAMCATNTTT 

CCAACAGAAGTTACATAAAMTNCTMTCTTCMGGGGCTTTCTAAMTATTTTNATCAC 

GTCATTAAAATCTTCTTTCACTAGGCAANGGTTNTGTCTTTATGGGGGCTG 

Sequence 1512 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATAT 

TGAGGATGCGCTCMCCMCTCTTCCGAMCAGCAGCATCAAGAGTTATTTTTCTGACTG 

TCMGTTTCAACATTCAGGTCTGTCCCAACAGGCACCACACCCGGGGGTGGACTCCCTGT 

GTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATCTATGAGGGAATTTCTG 

CGGATGACCGGAAATGGGTACCTCGGCCGCTCTANAACTAGGTGGGATCCCCCCGGGCTG 

NAAGGAATTTCGATATCNAGCTTATCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCGG 

GTCCCAGCTTTTTGGTCC 

Sequence 1513 

NGGGGCCGCCGCCCCCGGGGCCGGNCAAAGCAANGGGGGGCCAACNAAGGCNNGGNNNCC 

CCNGGGGGGNANGGAAAAACGNGGGNAANCCCCGCGGCAACCAAAAGGGGNCCAACAACA 

AAACAAGAACCGGGAGGCCCCGGGGGAGNCCANAAAAAANGGCGGCANAAAAGNCCCNGG 

GGGGGGGGGGGCCCCNAAAAAAGGAGGGNGGGANGCCAAAAACCAGCNACNAANCAAAAN 

GCGGCCCGGGGGGGCCGCCCCCAACCNGGGCCCCAGCCTTAAAGNCCCAAGAANCCGCGG 

GGAAAAAACCCCNGGGNCCGGGGGGCCCCAAGCCCNGGGCCAANNAAAAAANGGGAAAAA 

NNCGGGGCCCCCAACCCGCCCGCCCGNGNGGGGAAAAGGAAGGGGGGCCGGGNNCNCCGC 

CCGGNAANNCCGGGGGGGCCGGCCCNCCGGNGCCCCGCCNNNCCCCNNCGGCGGGCNAAC 

CNGGGAACCNGCCGGCCNGGCCGGCCCNCCCGGGGGGNCCCGGNNNACCGGGANGNGGNC 

CGGGCCGAAGGCCGGGGNANGCAAGCCGNCNAACCNCCAAAAAAGGGGC 

Sequence 1514 

GTNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTMGTCTTNTAMTCTGGTGTGTTT 

TTTACACTTGCAGCACATCTCAAATGAAATAGCAACATTTCACATGCACAAGACTCACAT 

ATCTTCTGATGGCTATCTTGTCGAATAGCACAATTCTAGGAAGTTCAAAAGGGGGAACAA 

TTGCTTCTCAAATTGATGAAGGTAGAGAAGAATGGTATCAGAGAAACTGATTAAGCAGGG 

ACCTGGGMTCTGAGTTGAGTTCCGGGGCATGTTTGTTCCCTAGCACCATGACCTACATG 

GCAGTGTCTCTTCTCATACTATTNTAAGAGTCCCTGG 

Sequence 1515 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGCCAAGC 
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TGGACTGCATAMGATTGGTATGGCCTTAGCTCTTAGCCAAACACCTTCCTGACACCATG 

AGGGCCAGCAGCTTCTTGATCGTGGTGGTGTTCCTCATCGCTGGGATGCTGGTTCTAGAG 

GCAGCTGTCACGGGAGTTCCTGTTAAAGGTCAAGACCCTGTCAAAGGCCGTGCTCCATTC 

AATGGACAAGATCGAGCGGCCGCGGGCAGGTACTTCATGAGACAAAATGGAAAAAGGAGA 

ATTAAATTAAAATGCACAACTAATATTTATCTACTACAGACATAATATTTCTCAGTTGTG 

MCTMTTACTATGCTTGGAAMTGCTAGCATCCTCATAAATATTTTGGTTCTATTGGGA 

TACAAMTCTGATTTCGCAMCTTTGCAAAGGCACATTTT 

Sequence 1516 

CCGGGCACGGTACGCGGGGGCATTTATCTTCACTCTGATGAGGGCTCAGACTTGATAACG 

CCCGTGGTGCCCCATCCCTATAGGAGCTGGTGAGATTGCAGCCTGCTGCCTCCCCTCCAT 

CAGCCACAGC TATT GGATTTCCCACCCAGMTCTTTAGGTAAATGAGGTAAGTCCTGATT 

TTTAAMCTTCTTTTGAATCTGGMTCCAMCACCTGAGTGGAMGAGAAGCCTGCTTTA 

AACTGGACAGATGAAACTAGAACAGACTCTTGGAGACGGCTGGCAGGAAGTGAAGCTCAC 

CTTACCTGGGCTTACCTCACTGGGTCAMTCAGAATTTTATTTTGGAGGGCAGGTTGGCT 

ACTTTGGATATTATCTGNGMTTTCCTGCATTGGCTGGACTTCTMTCTCTGNGAATTTA 

AAAGCC 

Sequence 1517 

CCGCGGTGGCGGCCGAGGTACCTTTTAGTTATTGACAAGGTTAAAATAGCACTCTCAGTT 

TTTCAGTATTACAGAGAGCAAATAGTTTCCTCTCCTGCTCTGNGCAGTAGCTTTTCCAGA 

ACTATGGACAAAATTNGATCAGAMGMGATTGATTATTTCCTCATCTTTTT^ 

TTTGAGACAGAGTCTCTCTNTGTCCCCCAGGCTGAAGTGCAGCGGCATGATCTCAGCTCA 

CTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTTCTGAGTAGCTGG 

GATTACAGGCGCCGTGCCANCATGCCCGC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGTAC I I I M I I I I I I I I I I I I I I I I CTTTCTTTC I 1 1 I I I I I I 

I M I I I I I I II I I I I I 1 1 I I I 1 1 I I 1 1 I I I I 1 1 1 I I 1 1 I I I I I I IGGCANNAAAAANCNT 

NATTNCCATTNGGCCCAAGGCTTGTTAGGAAAGTAAAAAAAGCCCCCNATTGGCNGGNGG 

GNNNGGCTTAGGCAAAACCCCTANTACTTTGCANGGGGCCCTCNAAAAAGCCCGNGGGCC 

CNAAAAAGNCCTTAAANCCNCCNTGNAANAGGGCCCCTTNAAAAAAAAACCCCCCNNNNN 

NAAATTTTGNGGCCCCCCCCCGGGGGGGGGGGGGGGGGGGGGGNGGCCCCCCCCCCCNTT 

TNTNNAACCIMCCCNCCCCCCCCCCNNAAANGGGGGGGCCCCCCCNAAAAAAAAAANGGGG 

GGGG GGGNNNGGCCCCCCCCCCCCCCCNAAAAAAAMAAMNGGGGGGGNGGGGNTTTTT 

TTTTTNCNCCCCCCCCNCCNNNCNNNNNNNNNGGGGGGG 

Sequence 1519 

CCGGGCNTGGAC I I I I I I I 1 1 1 1 1 I I I I I 1 1 I I NCTTGCTTTTGGTTTTTTCCTTCGATA 

TCTTCAAAATCTGTGTCAGAAGAAAAATGTGTTTCTGACTCCTGTAGTAGATTAAAATCA 

GTTGGTATTTCTGGAGCTGCTATCATTTCTTTATCATCTTCTGGAACACCTCAATGTCAG 

AAATATATTCATATATTTACGTGGGTAAMTGGNGGCTTGCCTTCTTTCTTCTAAGACAA 

TTCCTC ACGG NGGGCCAAGCGCCCTCCGGGTGCCCCTCCAAACTTNTAGAAGCTTCGCCT 

CCGCCATTTTATAACTTACCTCCCCGCGTACCTN 

Sequence 1520 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAG 

GACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCG 

CTCAACCAACTCTTCCGAAACAGC 

Sequence 1521 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 




• 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

251/467 



CACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGMTGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGT 

Sequence 1522 

TACT ATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTI 

TTTTGGTGGTMTTGAGCTGGGAAAAATTCAGAATTGGGTCATAATTAATGGTAACTAAA 

CAGATTTGNGMTATGGGACATCTGTGGNCTTGAAMCATCAGTATGATTTGNCCCCATA 

TTTCTTCAGCCTGGACAATAGAAACAGACAGGGGAGGGGGGTAAAGTGCANTAAAGTAGG 

CTGAGTGATGTGGTGCTAGCAGCTGGAGTCX)AGAGAAGTTCTGACAGTGCAGGGAGCAGC 

CCCTTTGTTCTTTGGAGCACTGGAAGGGCTGAGCTGCATCTGAGGTGTTCAAGCCACCAA 

CAGGACAGGGTAGAGGACTAAGTAGCACATGTCCTCCAGAGCAGCTTCCTGTCTTTGTGT 

GGTCACATCACATCGGGGGAA 

Sequence 1523 

CGACTNCTAT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I II I I I 1 1 

i i i 1 1 n I I I iGCTTCT GGTA AAMGAAGCATGTTACAATTTTCCCCCCCAATTCACATA 

GGACTCTAAGMCATATTTTAAATCAGTGCTTCCATACAGGMCGAAATCCACTATTTTA 

GMTTCTAAATCTTGTTGAAAAGCAACCTTATCTGAAGAGTAAACAAGAAGATTCAAAGT 

TAAGTATCAGTGCAGTCCAGAGCCCCTAAATGAATAAAACTGAATGTATCTTAAAAATAG 

GATTTGCACACCAGTMGAGACTTGTTTACGATTCTGGGGAGGAGGGAAGAAACTGTAAG 

AGGGAGAGAAAAGGGAAGAAAAACAAGAAGAAAAATAATNGGAAAAAAAAAAAAGAAAGA 

AAGGTTTGTTTAGCTCAAAG 

Sequence 1524 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATTGAGCTTAG 

TGATATTAACTTGGTAAATTAAGCCAGTTTGTCTCTTTGCTTTATCTGTCTTGTGGAATC 

TGTATCAGTGGAAATTAAATAGTCTTTACATTTGATGAACCCTGTTTAGGTGTTGGAAAT 

ACCCATCTATTTGTTAAAAAGGCAAGGTCCCATGATTTAGTGAATGGGGGATACAGACAG 

CCTTTATTCAAGTAACTGAATAAACAAAAGAATTAGAGAGTGTGATGAGTTTGAATAAAA 

AATATAGNTCATAAAAACCAGAAATGTGATAGAGCATAGTGGCTGGAAGAAAGTTACCCA 

AGTGGCTTGGGTAGTCAATGAAGTTGACTCCAACATGCAGTAGTACGCCGGGTCCTAAGA 

TAGAGATTAAGTCATGGTTTAAATGAGGAACAATCAGTAA 

Sequence 1525 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACACCACTGTGCCTGCTTTGAATCCTTTACGAA 

GAGAAAAAAAAATTAMGAAAGCCTTTAGATTTATCCAATGTTTACTACTGGGATTGCTT 

AAAGTGAGGCCCCTCCAACACCAGGGGGTTAATTCCTGTGATTGTGAAAGGGGCTACTTC 

CAAGGCATCTTCATGCAGGCAGCCCTTTG 

Sequence 1526 

GAGCNCCCCGCGGTGGCGGCCGNTGTAC I I I I I I I I I I I II I I M I GCTTNTTAATTGGA 

TGCCTGGAGACMTTCCATTTCAATTACCTTATTGGCATGACNAGATATACAAGGGCTGC 

CAATGTCAATACATTAAGACTGAGCGTGCTGGAGCAGCAGCCAGGGTTCAGGGCACTGCT 

GTGTCATCTGCGCCACGGTGCACAAAGGCAGNTTCAAAAGCATTTCAGCATGATCGCTTC 

CCTCTCTCCGCTCCTGGGGAGAGAAGGATCCTGCACACCACAGGCAAATCATGCTGAAAT 

TGAGGNGGNGCCTTTGGGACTCCCATCCCATCACAGTCTTGGGAT 

Sequence 1527 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATTGGTTGAACA 
AACAGGTGGGCAAAGTGAGGAAGATAAGAAGTCCATCCGTTCAGTTTCCCCACTGCGGAG 
GGAATAACACTGTCTTTCCACAGGTCACAGACTGGGATGAGCAACGGGCTGAAGGCACGT 
TTCCCGGGAAGATCTGAACTGGCTGCATCTCCCTTTCCTCTGTCCTCCATCCTTCTCCCA 
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AGATGGTGAAGGGGGACCTGGTNCTG 
Sequence 1528 

AGGTACGCGGGATGGCACATANGACATCAGCTAGGCTTTTGGGMTCGTTTGTGTTCTTT 

GTGGAAATGTCCTTTAGAAGCACCCATGAAGTAGTGTGTTCAGACTGTGCACACAGAAAA 

CAGGCTCTGCCTTCACATGTGAGACGGTGGACTTTTCCTNTGGACAAAATGACAGCATNC 

TGGCGACTCCACAGTGGAGCTGAGCGCCACTCCCTGTAGCCCGATCTGGGACTGAAACGC 

TTACACCTCTGCCTTAGAAGGAGTCCCCCNTGCC 

Sequence 1529 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACGCGGGGGAATTAGTCCGAGTGGAGAGAGCGA 

GCTGAGTGGTTGTGTGGTCGCGTCTCGGAAACCGGGTAGGCGCTTGCAGCATGGCTGACC 

MCCTGACTGAMGAGCAGATTTGCAGMTTTCAAAGAMGCTTTTTTTCACTATTTT 

CAAAAG ATGGG TGGATGNGAACTATTAACCAACCAAAGGGAATTTGGGGAACCTGTAATG 

AGGATCI I I I IGGGCCAGAATCCCCACAAGAAAGCAAGAAGTTTACCAGGGAACATTGAT 

TTMTGGAAAGTMGAATGCCTTGATTGGTTAAATTGGCACCMTTTGGACTTTCCCTTG 

MTTTTCTTGACAMTGGATGGGCCAAGGMAAMTTGGAAANGACCCCAGNACCNGTTG 

GAAGGAAAGGAAAA 

Sequence 1530 

CCGC GGTGGCGGCCGAGGTACG CGGGTGTTCCTTTTTGTTCAAAGTCTATTTTTATTCCT 
TG ATATTTTTC I I 1 1 I I I I I I I I I I 1 1 GNGGATGGGGACTTGTGMTTTTTCTAAAGGTG 
CTAT TTMC ATGGGAGGAGAGC GTGT GCGGNTNCANCCCAACCCGCTGCTNACTTTTCAN 
CCNNTmCACCNGNCTNTGGGTTTT^^ 

AANCCCTNNNCCNAAN NGNG GGGCAAANANCCNGNGGGCCCNCTANAAAANANCTTGCGG 
CCCX^TGTCCCCCGGGGTTTTGAGGACMNTTTNCCCAMGCNNAAAAANAGANGTTTTTT 
CNCCNTCNNCTCCTGGNGGGGGGGGGGNGAAAAAAAAAAAAAAAAAN I I 1 1 1 1 1 l I CCCN 
GNN NNGGG GNGGNNNCTTNAAAAAAAAAAAANAATGTCCCCCCCCCCCGNNNNGTGGGCG 
NNMTTTTTNTTAANAATATTTTTNNCCCCCCCCCCCCCCCCCCGGGGGGGG 
Sequence 1531 

ATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCATTCCGGGTCATCCGCAG 
AAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACAG 
GGAGTCCACCCCGGTGTGGTGCCTGTTGGGGACAGACCTGAATGTTGAAACTTGACAGTC 
AG AAAM TAACTCTTGATGCTGCTGTT^ 

CCTTTTGTTCCTCTGGTAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGTAG 

GTTGGTGATGGTGAAATTCAGGTAGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTTGATA 

AACTGATGACTCCATTTCTGTCACATGGATGTCCACCAACTGGTAGGTGGAGCCCAGCCA 

ATGGGAATGAGGCATTCAGGGTCTTATCTAGAAAGACTTGCTCCACCAGGCTGGGGTCCA 

AATTGGAG 

Sequence 1532 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGMGGTCCCGGCAGCACGCACGGA 
AGAAGACGGACCCCGCGATGAGGGCGGCGGCAAGGAGCACCTTCATGTTGCGTTCGGAGA 
GGCG CA GCATC CACAGGCCCCGCGTACAGACTGGTTTGGTAMTGCTAMCTTTTGTGTC 
TTTTGCCI I I I I AAAGGAATTGTTAACATTGGAATTGAGGGTATGTACCT 
Sequence 1533 

AGGTTTTGGGCGATCGT11TCATACGAMTATTTGAGATGCTTTAGATGTGTGTGCATGT 

CAGCTGCCACCTGAMGAAAGGCCTCATTAAAGATTTTCACTGATTAACTCTTTGATTGT 

TCTTGGGATCTCAGATGGGAATTCACGCTGCTTGCTGCAGAGCTCTTGGGCTAAGTGTAT 

TTTCT TAATT ACTGAGAAATGCGTGTTATCAGTAAGCAGTGAAAAGTCTTGAAAAAACTA 

AGTAATTTTAAAAAAAAAAAAAAAAAAAAAAANGTCCTGCCCG 

Sequence 1534 

NCGACTNCT ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI MINIMI 
TTTTTTTTTTTTTTl I GGNGGNNGTTTTCATTTTCTTCTGGACTGGTAATTTTTACTTCT 
GGCCTTGGTGGTCTTTGAGGGTCTACATCAGCCTNATCTTGTCCACTGCCAGCTCCTTGA 
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GACTTTCTTTCCTTGTCTAGGACCAGTTGAAGCTCATTATTAACCTGCTGCAGCTCCTCA 
CCATTCTCCTCATCGGNGCCTCCATTTACACGAGCAGGGCGTGCCTGCTGCTCACTGTGA 
TCCAAAATCTCCTNGACTTGCTCAGGGGCATmrGTAGCCTCC 
Sequence 1535 

CACTT^GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTGGTG 

miii iGGAAAAGTTACATTTAGATCTATTCTGAAGCTGTTCATTTTTAACAAATAAAAT 

GTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGTCTGTGA 

ACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAGTCAGAA 

AAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGTGCACGT 

AGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGTGAGAAT 

TTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCACTTATG 

CMGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTTACTGGTGGGATGTG 

Sequence 1536 

AGGTACTAATATCCCTTAATGGCAGAATGTGATAATCATGGAATTAATTATTGCTAAAGT 

AGTTTTCAAATAAAAAAAAGAAAAAAGAAAACAAAACAAATTAAACTTGACACAATCTGA 

CCAMCACGTGTCAATTTACAATTTCAAGGTTATTTTACAAAATACCTACATTTACACAA 

TAGGCTCCCGGCAG CATTT CCAGACAMTGTTCTTTTAATTTATCCTGACATGCTATAAA 

TGMTAMTTACACTATTTTAAAMTTATCATCAGTMGTTTTTCCTTTCTCATGGGGGT 

CAANAGCAAAAAAGAAAATNNAGGCNTGCCAAAGGAAGGATTTGGAGAGGGGAAAGCCGC 

CACGCACCACCAGTATAACCTTAAAATAAACAGGGAGGGGGCTGGGGGGAGGGGGTCAAC 

AAATTCACTTGAAA CCCTC ACAACAAGTGGTGGATGCCTCTTCCAAAGGGGGGNGGAAAA 

GGTTCMNCTTGGCTTTTATTAGGCCCANTTTCAAAAAMGGTATTTTTAAAAGNAAAAA 

ATGGTTTTTGGTANGGGGAAAAA 

Sequence 1537 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAAACACCTCATGA 

GGCTGGCCAGAGATGACACCACTGCCCCGGATGCAAGGGGGCAGCCTTGCTTTGGACCCC 

GTTCTGAATCTCAGGCTTCATCTATTAGGCACACTGAGCAGAAACACTCCTCCCTGCCAC 

TTCCGAGTTGCAGCAAAAAGGGTCTAACTATGATTTAAAGCAAATAATTTTAAACTTCAA 

ATTTTCATTACTGTCATACTTATGAGGACCAAATAATGTAATGACTAAGACATCAAATTA 

CATGTAATATGAATAAAACCAGTAAGAAAATGATAGTTACTTATAATTGGATCAACATAC 

AAAAAGAACGTCATTTGGCCAAAGTAAAACGGTTAAAAAAATAGTGTCTATAAAATCACC 

ATATAAAATCTTAGTATGTACCTGCCCG 

Sequence 1538 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAG 

TTGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGT 

GTGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCA 

CTGGACCTAAGCAGGGCCAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAA 

TGCAAAGTTATrTATTTTGTTAGAGTTTGAACAACNGTGATCTGTAGAAAACCAGTCACA 

TTTCCTTAAGGATGCATNAATAAATATGAGCTAAATGTAATAATTAGAACTTTGGTTTCC 

AAGCAGATT CCATG CTCTAATTCCCTGGATAGGTGAATATAATAAGATGATNTATCCNTG 

AATAAGNTNCTTTTNCTTTGCCCAAGGGGACANGATGGTAAATAAGGGCTACTAAATCAA 

GTTG NCCTTTAAAATAA 

Sequence 1539 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAGT 

TGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGTG 

TGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCAC 

TGGACCTAAGCAGGGCCAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAAT 

GCAAAGT 

Sequence 1540 

GTACTTA GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGGCCCTTCCTT 
TGAAATCATTTTTTCTTCCAGGCCCTTGCGCTCCGGGCCTGGGAGACAGAATGAGAGACA 
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GCCCC AAGGA TCTGTGATACACTTTCAGGGTCATTTTTTCCTTGTCCXDTGCCAGTCCAGG 

CAC ACCTTT TGTGTTCTCTCCTGMCATACTTTCTCATTTTTCACAATATGGAAAGGCTG 

ATAA I I I 1 I CAMCCTTTMGTTCTCTTTCCTTTTGATTGACCATTTTGTCTTTAAATCA 

TTTTTCTTCTCACATTTTATTATTATAMCATTCMGGGAAACCMGCCATTATTTTGC^ 

TAGAAATTTATTCAGCCAGCTGGGGCGGTGGTGGCTCACACCTGTAATCCCAGCACTTTG 

GGAGGCCCGNGGTGGGGTAAGATCACCCTGAGGTCAAGGAGTTTCGAGACCAGCCTGACC 

MTAATNGGTGAMCACTTGNCTTNTACTTAAAAAATACCAAAATTT 

Sequence 1541 

GGAGCTAAACCCCGGNGGCGGCCGAGGTACTTACCCTCMTTTCAATGTTAACAATTTCT 

TTAAAAAGGCAAMGACACAMAGIsrrTAGCATTTACCAAACCAGTCTGTACGCGGGGCCT 

GTGGATGCTGCGCCTCTCCGAACGCAGCATGAAGGTGCTCCTTGCCGCCGCCCTCATCGC 

GGGGTCCGNCTTCTTCCTGCTGCTGCCGGGACCTTCT 

Sequence 1542 

CCGCGGTGGCGGCCGAGGTACACAMCGAGATGCTACCTAGGAGAAGGGTATTCTTTTCA 

CTATTCTTTCAAATTTTCTGTATGTTCGMCATTTTCATAGTAGAAAGTTGGGGGGAAAA 

TCTGTTT CATAA ACATTTCCTCAGCAGCAGTCCAGTCTATTGCATTTTAATTGGTTGTGA 

TATCATTGTTTTATGCAATACGTTCTCAACAAGTATATCCTCCGGCAAACTGAACAAGGA 

CCAAGTCTGTTCTGCCTACAGCTCTGCTTCCTCATAGCTGCTTTCCAGAACGTGACTCTT 

GCAAATTATCAAGAAAGGGGAACTAATCTAAGGGATCCCAGATCAACAGCCTNATGAAGA 

CCTTAAmATGNTrrCTMNATAAMGATAGGMGTTTTCAAAAAAAGCCCCTGCTTCAC 

ANAGGATCAANANCAGGGGGTGGGCCCTGCTGGGCTTNCACTGGGATTTTTTGAGCATTN 

CTTTCCCNGGNGGCNCGGNAAGGGGNGTGGNGTGAGCCCNAGGGNGGAAAAAAATTT 

Sequence 1543 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCTC 

TGAGCTTAATATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTCA 

CGTGTATTCACAGCCTCAATGCCATAAGGAAACTCTTT 

Sequence 1544 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACCGACTTTGTCCAGCTGCAG 

GTGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTG 

CCAACTTGTCTACTACATAGGG7TGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCT 

CTGAGCTTAATATTGTMTTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTT 

TATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTC 

ACGTGTATTCACAGCCTCMTGCCATMGGAAACTCTTTTAAGAAGTTCTGACAGCTGGT 

CATGTTAGGTATMGMCAGGGTGCCTTATCACTGGTGGATTTCATTTCT 

Sequence 1545 

CGAGGTAC I I I I 1 1 I I I I I I I I I 1 1 I I I I GGGGAGTTTGTTGTTTTTAACCAAATTATNA 

TAGATGGMGCATTAGGCAGCTGAATGTTCATTTGCCTTCANACATCATNTCCTATTTCA 

TTTGCTNGNCTCGCATTAAAMGMTCATTTATCAGCAAAAGCATCATTTATTTGTTTAA 

ATGACAAGGTTTAGCTAGCAGAGNAGAGTTTGCNATGCTTTTAMNAMTMCNTTTGACj. 

TTTTC TTCAA GACACTACAAMCCATTTGTTCAAAAMGGCTGCCCAANGTCATTTANAA 

G MTATT TTTCAMANGTCTATTTCNTATTTTTAAAAMC 

GGTTTTGAATTCCACCTGGGCCTTTTCTTTTTMCCTTGAAAGGGCTTAAA 

Sequence 1546 

GGCGGCGGCCGAGGNACAAGNANCNNTTGGNGGAGGGGGGNGAAACCCAANACCCGAACN 

NGGGACTGNGCAGACAAGCTATATCTTAANCCCNCNCCGGGCCAGACCNCNAGCAAGGGN 

GAGGAAGCAAAAGNCCACAGNNACNGGGGCAGGNAANNGGNANAAANGAGGGNGNGGGGC 
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NGGNTGCGGTGNTTNACAGGGGGAANCCCAACACCCCGGGAGGNCGAGGCAGGAGGANCG 

CCCGAGCCCAGGAGGNNGAGACCAGCCNGGGCAACANGGNGAAACCCNGGCNCNACAAAA 

AANACAAAANNAGCNGGGCANGGNGGNNNGGGCCAGNGANCCCAGCNANNNGGGGAGGCN 

GAGGNGGGAGAAANGCNNGAGCCCAGGAGGCAGAGGNNGCAAGNGAGCCCAAGAANGCGC 

CACCGGAACNCCAGCCCAGGCAACAGAANGAGAACCCGGNCNCACANNAAAAAAAAAAAG 

AAAANGAAAAAGAAAAAANGNCCCNGGCCGGGCCGGGCCGGNNCNAGAACCANGGGGAAC 

CCCCCGGGCNGCAGGAANNCCGAAANCAAAGCCAACNGANACCGGNCAACCCCGNGGGGG 

GGGCCCNGGNACCCANNNNNNNGGNNCCCNNNAGGGGAGGGGNAAANGGCNCGCCNGGNC 

GNAACAAGGGGCAAAAGCNGGNNNCCNGGGGGGAAANNGGNANNCCNGNCNNCAANNCCC 

NNNAAAAAACNAANCCGGAAGCAAAAGGGNAANNCCGGGGGG 

Sequence 1547 

AACACCACCGCGGNGGCGGCCGNCCGGGCAGGNACTTTNNTTNGGGGGNTAAAAAACCCC 

CCNAAAAACNGGGCCCTAGNAGAAGGCAACNTTCATTNCAAACGAGGGGGCCCCNGCCCC 

GNGAAAAAACGGGGAACACGACGNCNAAGGCAGANCCCCGNAAGNACCTACNNNGGACAG 

CCGGGGCAGGCGNGCAAANNNTTTGGGCNNGGCCNCGCAAAGCACAAAGGGGGACACANA 

ACCCACTGCCCACGGCGCAGGAGAAAAAAAAGAAACAGCAAANCACGAGGGGACAN 

Sequence 1548 

CCGGGCAGGTACATCANTTNTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAA 

AGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTC 

CGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCA 

AGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTT 

GGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAG 

CACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGC 

CCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCGGTGAGAAG 

GGGGTGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTA I » 1 1 I CTGACTGTCAA 

GTTTCAACATTCAGGTCTGTCCCCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAAC 

TTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATTCTATGAGGAATTTCTGCGGAT 

GACCCGGAATGGTACCTTNGGCCGNTTCTAGAACTAGGNGGATCCCCCGGG 

Sequence 1549 

CCGCGGTGGCGGCCGAGGTACGCGGGGCTTGCATCTCTGGGGCCAAGGAGTGGTGGGTGA 

GATCTTCCATGGCCCTGGCATGGGTGATATAAGCGGGACCGGTAAGGTGGTGGAGCTCTT 

ACCAGACCCTGCANAACCCTCTCCGTGGTGTTGAACTTCCTGGAACCAGGGTGTTGCATG 

TTTTCCTCATAATGCAGGTTGGTGATGGTGAAGTTGAGGGTGAACGGCACCAGGAGAGGG 

CCAGCAGTTGTGGGGCTGGGGAGGGAGGATGGAGTCCCTGACCCAAGGTCCACTGTGGAG 

GTCCCAGGAGCTGAAAAAAGT 

Sequence 1550 

AGGTACTTTACAAACAAGTCTGAAAAAGGAGGGAGTAAAGTATGGAAGAATGATCTCTGG 

ATGTTGC TACTG GCCTCAAAAAAGCAGTGCTACAGATTTCTGTGTGAAGAGAATACGCTG 

TTCACACATTTTCCTATTTCCAGGCATGAAAATATTCTATTGGGTAGAAGAAATAGGAAA 

ATCTCTTATGACAAATGAAAGACAGGTGCAAACACACCAATCCCTGTCTAGCAGTATAAA 

GCATATTGGGCTCAGMTTTGTCTGTTGCTAGCACCTGGCTTTCATACTATATCCTTATC 

AAATAATCAGATTGAAAGTCCAAATCATTCTTAAGCAAGCAAAAATCCTCAGTGGCCATA 

CCTCA 

Sequence 1551 

GGCGGCCGCTCTAGAACTAGNGGANCCNTTTTGCGGGGGGAAAAAAACCCCAAGCCCANC 

GANACCGNCGACCNCGAGGGGGNNCCCGGNACCCAGCGNNNGCCCCCNAAAGAGAGGGNN 

AANN GCGC GCNNGGCGNAANCANGGNCANAGCNGNNNCCNGNGNGAAANNGNNANCCGCN 

CACNTTTTNNTTTNNCNGACGAGCCGGGNGCANAACCCCCAAANANA 

Sequence 1552 

CCGGGCAGGTACGCGGGCTGCCTGGGGATGGCAGCCGCGTGCGTTCCCTGTGCTGTCCTG 
TAGGGTTGTGGTGGCTTTCTGATATTCAGGTTGGTCCC7TTTCCTTCTTTTTGGATGGGC 
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TTTCGAGGTTGGGAGAGGAAGCTCAGGGATGGAACTGACTAAACATATGTGAGATGTTTC 
CTTGTTTGCTGCTCATGGAGGTTTCATrTCTC 

TGTGGATATATGAGGTGACTCTAGMCCCTCATTTTATGGATGAAGAAACTGTGATTCAC 
AGAGGGAGAGTGATTTGTCCAGTGTCACGTGGGGAGCTGGCCGGGGAGCCAGACCTCCTG 
TACCTCGGCC 
Sequence 1553 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTCCATG 

GGCCCTOTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCA 

GTCTTTTATCACCTGAACAGTCT TCCG GCCATAATGATAGTAACTATAAGCTGATGCAGC 

TGTGGTGAAAGCTGTAMACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAG 

TCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAA 

AATGMTTTA AAMG TAAAMGGGCTCTGTGGTGTMTTTTTCTTAACTACAAGAGTCTA 

AATACACTGCTTTTCTTTMGAGTTCATTTTAATTAGTMCGTCAMCAAMTTATTCTA 

GATAATGAGCCCTACAAA 

Sequence 1554 

ATCGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 
GGGTGGCCAGATTMTTTTCCAGCCMTAGAGGAMGTTTTT 

TGCTGGAGAGGGGAAAGGATGCCACACTTCAGAAGCAGGAGGACGTTGCTGTGGCTGCTG 

CAGTCTTGGAGCCCCTGCTCAAGCTGGCCCTGCTGGCCGGCCTGACCATCACTGTTTTTG 

GCTTTGCCT ATTCT CAGCTGGCTCTGGATATCTACGGAGGGACCATGCTTAGCTCAGGAT 

CCGGTCCTGTTTTGCTGCGTTCCTACTGTCTCTATGTTCTCCTGCTTGCCATCAATGGAG 

TGACAGAAGTGTTTCACATTTGCTGCCATGAGCAAAGAGGAGGTCGACAGGTACCT 

Sequence 1555 

CCGCGGTGGCCCGCCCGGGCAGGTACAGCAAAAAAGAAACTGAGAAGCCCAAACTGCTTT 

CTTGTTAACATCCACTTATCCAACCAATGTGGAAACTTCTTATACTTGGTTCCATTATGA 

AGTTGGACAATTGCTGCTATCACACCTGGCAGGTAAACCAATGCCAAGAGAGTGATGGAA 

ACCATTGGCMGACTTTGTTGATGACCAGGATTGGAATTTTATAAAMTATTGTTGATGG 

GAAGTTGCTAAAGGGTGAATTACTTCCCTCAGAAGAGTGTAAAGAAAAGTCAGAGATGCT 

ATAATAGCAGGTATTTTAATTGGCAAGTGCCACTGTGGAAAGAGTTCCTGTGTGTGCTGA 

AGTTCTGAAGGACAGTCAAATTCATCAGCATGGGGCTGTTTGGTGCAAATGCAAAAGCAC 

AGGTCI I I II AG 

Sequence 1556 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCCACATCCCGAAGGCCATAGCGTC 

GAGCAAGGTCACATGGATGGCAGCACCTTACCACTCAGGCTCAGGATATTGGGATCTGTT 

GCCAAAGCCACCACACATTTGCCACTCAATTCTGTGGTTTCCGCAGATGAGAAGGCTGAT 

TTGAACTGCTTCMCACAGGATCCTGCAGGACCTCCTCCTTTGCCATATGCTCCTTCAGC 

AGTTCTGTCTGCACAATCCCCGGCCACAGAGACACACAGCTGACCCCATGGCGCCGCAGC 

TCGTGGGCACAGTCAGCAGCCAGCTTGTCACACGCAGCTTTGCCCACACCATAGGGGACA 

TTGAACATATACTGCAGGCTTCCTGGGGAGGAGATGACCACGATNAGCCCCTGGCCAGCT 

GGTACCT 

Sequence 1557 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCAGGTACGCGGGGGGT 

CCTGGTGCTGCGGCCACACCACCCTGGGGTGCTCATTGACAGAGCTGCCATAATGAACTT 

GAAAGGACGGGAATCACCGAGGGAAGCTGGGGCTCCCCTGCCCACAGGAGAGGATCCCCG 

TTCTTCMGCTTCTCTGCTCAGTGTCTACTAACGACCGACATTTGCTAATGTAAATAATA 

GTAAATTATTGAGAATTCTMTTCTTTTACACAGTCTGTTTTTMTCTATTT^ 

TAAAATCTATGACT 

Sequence 1558 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

CAATGAGGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACG 
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ACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCA 

TCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCC 

TTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCCCTTCTGTCCCGCTGGAACCCATGGT 

CCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAAT 

TCCTTGAAATTTGCAAGTTGTCTGGGTTCATG 

Sequence 1559 

TACT T AGGGCGAA7T GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCAGGCACAGCTAA 

TTTTACI I I I IATTTTCTTGAGACAGGTCTCACTCTGTTGCCCAGGCTGGTCTTGAACTC 

TTGGCCTCAAGTGATCCTCCAGCCATGAGCCACTGTAAATGTCTGGGAATGCCAACTTGA 

AGCCCAGCTGGTCAGAAGTTCTAGAGGCCCAGACTTGCAACTGGTGTCTGAAGGGATGGC 

AGTCTTGGGGACTGAGCCCTCAACCTGAGGGATCTGATGCTATTTCCAGGCAGATAGTGT 

CAGAATTAAATTGGAGGATACCGCAGTTAGTGTCTGCTGCAGAACTGATTGCTTGCTTGC 

TGGTAGGGGGGAAAT CCCCA CATATTGGGGGGGTATTTGAAGTCTTTTGGTGTTGACTCT 

TACTGNGTTTNGTTGGTTTTTCATGTGAAGAGCAGAGGGAAAAGCAAGGGA 

Sequence 1560 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGACCCAACAACTGTGACTA 

GCTGGCCACGTCATTCAGGGCTGGTGTGGCATTTATGTGTGTGTGTGTGTGTGTGTGTTT 

TTCCTGTTTGCCCAGCAGTGCATTGTGGGTTCCAAGAGTGGGTAGTGTGTGTATGTGTGT 

GTGTCAGAGGGAGACCTGGCAGGCACCTNTTTGAGAGTAGCTGTGGTCAGAGCTGTTTGG 

TCAGTGCATTATGTTGAATGAGGTCCAGGAACCCAGAGCCACCCAGCAGACACCACTGTG 

GCTTGCCAGCTGCCAAGATGGAGAAGCATGTGCCCCTGTAGAGCGTCTTCCCAGAACCAG 

ACCCC GAGCCACTCGCTTCCTCTGTGCTGNGACAACATTGGTGCCAGGGGAGATNGTTNT 

TTTTTCAAAGGGGACCTACTGTAGCCCCTTTTAAT 

Sequence 1561 

CCACTCACTATAGGGCTGAATTGGAGCTCCCCGCGGTGGCGGCCGGAAGAGCAACCGAGA 

TGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACT 

TACAGAGAGTTCCAAGAAACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACTGAGAA 

TATGTAAGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGCAAAACCATTCCTTGCT 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCTGGCT 

ACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAAT 

TGTATCCGTTACCT 

Sequence 1562 

GGGCGNGTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCCTGCCTTCCCCCATATA 

CTGAAGTTTGAGAGGCTGGGAAGGTGCGGAATGGGAAAAGGAGCAGCTGCTTATGTTCAG 

TTTMCATTCTCTGGGTTTCTCCATCTAGGTCTTGAGTTCATTCTCTTCCTGTCCTTTTG 

GCTTCCTTGTTTAACCTGGTCCCTGTTTCAGGAGAGAAGCCTCATCAGTGCCAAGTCTGT 

GGGGAAGACCTTCTCTCAGAGTGGAAGCAGGAATGTGCATATGAGAAAGCATCACCTGCA 

GCTGGGAGCAGCTTGGGAGTCAAGAGCAGGAGCAAACTGCTGAGCCACTAATGGGGCAGT 

AGTTTGCTTG 

Sequence 1563 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGC 

CAGCCCAATGCACAGCAGTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGA 

CTTTGTCCAGCTGCAGGTGCCTGCAGGTCACACTGGCTAACTACTTTGTGATGGGCTCTT 

CTTTCTGAGGTTCTGCCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATA 

TTTCTTTCATTTGCTCTGAGCTTAATATTGTMTTTGTATTTGATCTGCTGGGTCTTTGG 

AGTCAGGACTGGTTTTATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGC 

CCACAGAACCTTCACGTGTATTCACAGCCTCAATGCCATAAGGAAACTCTTTTAGAAGTT 

CTGACAGCTGGTCATGTAGGTATAAGACAGGTGCCTTATCACTGTGGATTTCATTTCTTG 
Sequence 1564 

CCGCGGTGGCGGC CGAGG TACAMTTGTCGTTTTTATTCCTCTTATTGGGATATCATTTT 
AAAAACTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCC 





WO 01/070979 



PCT/US01/09126 



TABLE 1 

258/467 



TATTCCCCTCGC5TGTTGGAGCAAACCTTATACCTTACTTCCAGCAAGCAAAGTGCTTTGA 

CTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAACTGTTCTTTTGCATTTTGCCG 

CTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCAA 

ATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCCA 

ACTGCTMTGCAAATTAAMCATATTTCATTTTAACATGATTTCAAAATCAGTTTTTCAT 

ACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGAA 

TGGGGTAGAMCATTGTAMTATTTACTCTTTGCAAACCCTGGTGGGTATTTTATTTTGG 

CTTCATTTCAATCATTGAAGGTATATTCTTATTGGAAATGTACCTGCCCNGGCCGGCCGC 

TCTAGAACTAGTGGAATCCCCCNGGC 

Sequence 1565 

ACTACT ATAGGG GCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 

TTGGTGTTTTTTGGAAAAGTTACATTTAGATCTATTCTGAAGCTGTTCATTTTTAACAAA 

TAAAATGTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGT 

CTGTGAACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAG 

TCAGAAAAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGT 

GCACGTAGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGT 

GAGAATTTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCA 

CTTATGCAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTCACTGGTGGGATGTGGCC 

CTGCTTGTATTCAGAACCATAGTTTTAACAAGCCANCATTAAATCCACAAGTCTTTGCCA 

MGCACTTTAAGCCTNTTGACATTTATTGGAATTAATTTACCTGCAAGGAAAGTTCATAT 

ATCTAGCTTTGGTAACCCTACATTCGGGAAAAATGTTTCCATGANATAACTAAAANCCCC 

ATGAATGATACAATCTTGGACAAAACCCAAAGNNGGCATAATTAGCATAAAACTCCAAAT 

Sequence 1566 

CCCGCGTCCGGCATCTCCCAACGTGACTGACCCGCCGACCACGACCCGCAAAGTGGTCCC 

GACGACGCTCACCACCACCAAGCCGCCAGAAACCTGTGAGAGCTTCAACAGCTGTGTTTC 

CTGTGTCAACGCCACCTTGACTAATAATATTACCTGCGTCTGGCTAGATTGCCATGAAGC 

AAATAAGACCTATTGTTCAAGTGAATTAGTAAGTAATTGTACCCAGAAGACXJAGTACTGA 

CTCCTGTTCTGTAATACCTACCACCCCAGTGCCAACCAATTCTACAGCTAAGCCTACAAC 

TCGGCCTTCCTCTCCTACACCTACTCCCTCAGTTGTCACATCAGCAGGTGCAACAAATAC 

CACTGTGACTCCAACCTCACAGCCTGAGCGGAAGTCCACCTTTGATGCAGCCAGCTTCAT 

TGGAGGAATCGTNCTTGTCTTGGGTGTGCAGGCTGTMTTTTCTTTCTCTATAAATTCTG 

CAAATCTAAAAGAACG 

Sequence 1567 

TCGCCNCGCGTCCGGGCAACTGCAGTTGGAAAAAAAGATTCAACTTCAAAGCAGAGGATT 

TTTGATGAAGAACCAGCTAATGGAGTGAAGATAGAAAGGTTTACAAGGGATGATCCTTGG 

TTATCTTCAT GTGAA GAAGTGGATGATTGTAAAGACCAGTTGGAGAAGCAACAGGAAAAA 

CAAGAGATACTTTTGCAGGAAGTGGCATTCACTCAAAGGAAAGCAGTTATTCATGAGAGA 

GTCTGCAAAAGTGATGAAACTGGGGAGAAGAGTGGTCTGAATTCCAGTCTATTTTCATCC 

CCAGTTATACCCATAAGAAACCATTTTCATAAACATGTATCACATGCTAAAAAATGGCAT 

CTTAATGCTGCTGTAAACAGTCATCAGAAGATTAATGAGAATGAGACACTATATGAAAAT 

AATGGAATGTGGAAAACCCCCTCAGAGCATTCACCTTATTCAGTTTACAAGAACCTCAAA 

CA 

Sequence 1568 

GCTCCATGCCCTTCTCTGAGACGGGGACCAGGGGATGGCAGNCATGCACCTGACAGCCTG 

GCCCNAGAAGTCGGTGACCTTTGAGGACGTGGCTGTGTACTTCACCCAGGCGGAATGGGA 

TGGCCTGTCCCCTGCACANAGGACCCTGTACAGGGATGNGATGCTGGAGAATTATGGGAA 

TGTGGCCTCCCTGGGATTTCCACTTCTCAAACCTGCTGTGATCTCACAACTGGAGGGAGG 

AAGTGAGCTGGG 

Sequence 1569 

CGCGTCCGTTTCTCCTGGCACCCTGTATTCATGGCCTTGGCGTTCTGCCTCTGCATGGCT 
GAAGCCATCCTACTCTTCTCACCTGAACACTCCCTGTTCTTCTTCTGCTCCCGAAAAGCA 
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CGGATCCGGCTCCACTGGGCAGGGCAGACCCTAGCCATCCTCTGTGCAGCTCTGGGCCTG 

GGCTTCATCATCTCCAGCAGGACCCGCAGTGAGCTGCCTCATCTGGTGTCCTGGCACAGC 

TGGGTGGGAGCCCTGACACTGCTGGCCACTGCTGTCCAGGCACTGTGTGGGCTCTGCCTC 

CTTTGTCCCCGGGCAGCCAGGGTCTCAAGGGTGGCTCGCCTCAAGCTCTACCATCTGACA 

TGTGGACTGGGTGGTCTACCTGATGGCTACAGTAACGGTGCTTCTGGGCATGTACTCAGT 

ATGGTTCCAGGCCCAGATCAAAGGTGCGGCCTGGTACCTGTGCCTG 

Sequence 1570 

CGTCCGCTAAGTTCCAATATTGAAAAATCTGTAAAAGACCTCCAGCGCTGCACAGTGTCT 

CTTGCACGGTATCGAGTTGTAGTTAAAGAAGAGATGGATGCCTCCATTAAGAAAATGAAA 

CAAGCCTTTGCTGMTTGGAGAGCTGTTTAATGGATCGAGAAGTGGCGTTGCTTGCTGAA 

ATGGACAAAGTGAAAGCTGAAGCAATGGAAATTTTGCTCAGCCGACAAAAGAAGGCTGAA 

CTTCTAAAGAAGATGACTCATGTGGCTGTTCAAATGTCAGAGCAGCAA 

Sequence 1571 

GCAGCCGGCCATGCAGGCCGTCATCCGAACTGACATGGACCAAATTATCACCTCCAAGGA 

GCACCTTGCCAGCAAGATCCGAGCCTTCATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAA 

CCATGTCCAGCCCTACATCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGG 

CTTCACTGAGGTGCGAGATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCAT 

CAACGAGGGCCGGCATTGACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTTGGCG 

TACCACCCCCTGAAGATGCAGAGCTGCTATGAGAAGATTGGAGTTCGCTGCGACTGGACG 

GGCTGCAGC 

Sequence 1572 

CCGMCAANGTGGCCACCCAGGTTTTTAACCCAAGTCTAGTGGTCATCCTATTCTTTCCA 

CACCAACATGCCAAAAGCCTTACCTNGAAAGAAAATATAATTTGCAAGAAGCATCACAGT 

GCCGGGGTCTATATTCTCGATCAGGTTG^AATTTTCCCATGGGTTTTTTGACTGATAA^ 

GNCATTGATCTGCTTCTGAGCCATTTCCAMTTCTGAMGTTGGTAAGGATGGTTTCGGN 

ACTGTAAAAGTTCTTGGCATCTTCC 

Sequence 1573 

CGCGTCCNGGNCGGAGAAGACAGTAGGGATACTGGATATGGGAGGAGCCTCTCTCCAAAT 

TGCTTATGAAGTTCCTACCTCAACCTCTGTCCTTCCTGCAAAGCAGGAAGAAGCTGCCAA 

GATCCTGCTGGCTGAGTTCAACCTGGGCTGTGATGTGCAACACACTGAACACGTGTACAG 

GGTTTATGTCACMCTTTTCTGGGTTTCGGAGGCAACTTTGCCCGGCAGCGCTACGAAGA 

CCTTGTTCTGAATGAAACTCTTAACAAAAACAGATTGCTTGGTCAGAAGACAGGTCTGAG 

TCCCGACAATCCATTTCTGGATCCCTGCCTGCCAGTGGGACTCACAGATGTGGT 

Sequence 1574 

CGCCGTCCNGTTTACTTGGAGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTT 

TCAA GANNNT GGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATGCACATG 

GCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGGAAAA 

AGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCNGTATGAAAGCT 

GGMCTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 

AAAGMGAGMTTTGATGCTGCTGTTAATTCTGGAGMCTGTGGTTTGTAMTTTTTAC 

Sequence 1575 

GAGGCGCTCAACCTACCGAGGCGCCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCG 

CTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTG 

GCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACTGCG 

CCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTT 

CCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGC 

GCCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCAT 

CCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTTCAAGTTTCTCGACATCG 

GATACAATCAAGC 

Sequence 1576 

GACCACGCGTCCGCGCACCGCTTCATTGAGGCTGCAAGAGCACACGGGCACCCACGTGCT 
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GGTCCACTGCAAGATGGGCGTCAGCCGCTCAGCGGCCACAGTGCTGGCCTATGCCATGAA 

GCAGTACGAATGCAGCCTGGAGCAGGCCCTGCGCCACGTGCAGGAGCTCCGGCCCATCGC 

CCGCCCCAACCCTGGCTTCCTGCGCCAGCTGCAGATCTACCAGGGCATCCTGACGGCCAG 

CCGCCAGAGCCATGTCTGGGAGCAGAAAGTGGGTGGGGGTCTCCCCAGAGGAGCACCCAG 

CCCCTGAAGTCTCTACACCATTCCCACCTCTTCCGCXJAGAACCTGAGGG 

Sequence 1577 

CTACACTCAACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAG 

CTACATTCAACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGA 

GCAGCATGGGCCCC7TCTACTTGGG7TGCCAACTGATCTCCCTCAGGCCTGAGAAGGATG 

GGGCAGCCACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGC 

TGGACATACAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGTGTCACCCAACTGG 

GCTTCTATGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAGAATTTAT 

CAATCCGGGGCGAGTACCAGATAAATTTCCACATTGTCAACTGGAACCTCAGTAATCCAG 

ACCC CACATNCTCAGAGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACAC 

TTTTACAAAGGCAGTCAAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGAC 

GATGGACTTCCGTGTTGGTCACTGTCAANGCATTGGTCTTCTTCAATTTG 

Sequence 1578 

GCGGCCGCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGNNNTNCAAT 

AGNTTTATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGC 

TTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGA 

CGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1579 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATNTCT 

ATTGNTAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1580 

CTCCCCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGT 

AGGCCNTTNCCCTACCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGA 

AGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTC 

CAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1581 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTA 
TTGTTNAACAATNNCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 
AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 
GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 
Sequence 1582 

AGGTACAAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGNTAGGCCNTTACCCTACCAA 

CTMCTMTGTTCCGCACCCCCATTTTTAAGTGMGCTGTGMGCTCCTTTCTATTACTC 

ATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCT 

TAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1583 

CCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTTTTNGGC 

CATTACCCTACCMCTAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCT 

CCTTTCTATTACTCATCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAAT 

AGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1584 

T CTTC GANCACGNTTCGGGCGGCTTTTCCCCGGGCAAGGCTTCTAAATCGGGGGGGCTTC 
CTTTTAGGGGGTTCCGAATTTAAGTGGCNTATAACGGGCANCCTTCGAACCCCCAAAAAA 
AACTTTG 
.Sequence 1585 
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AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTNGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1586 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGCnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCCCAACGT 

GMCCGGTTACACTTTGCCMGCGCCCTANCGCCCGGTCCTTTTCGCTTTTCTTTCCTTT 

CCTTTTCTTCGGCCACGGTTCNNCCCGG C I 1 1 1 1 CCCCGTCAAGGCTTNTAAATTCGGGG 

GGCTTTCCTTTTAAGGGNTCaDGATTTAAGTGGCTTTANCNGGNAACCTTNGAACCCCCA 

AAA 

Sequence 1587 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCXJAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1588 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1589 

TACCNCGCGTCCGGGGCCCGGATGCTGGGGGCCACCAGGGCCCCGGGGATGTGCTGGTCT 

TCATGGATGCCCACTGCGAGTGCCACCCAGGTTGGCTGGAGCCCCTCCTCAGCAGAATAG 

CTGGTGACAGGAGCCGGGTGGTATCTCCGGTGATAGATGTGATTGACTGGAAGACTTTCC 

GGTATTACCCCTCGAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCATT 

GGGAACCTTTGCCGGAGCATGTGAGGAAGGCCCTCCAGTCCCCAATAAGCCCCATCAGGA 

GCCCTGTGGTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAG 

CGTATGACCCTCTTATGTCGCTGCGGGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCT 

GGCTCTGCGGTGGCTCCGTTGAAATCCTTCCCTGCTCTCGGGTAGGGCACATCTACCGAA 

ATCAGGATGCCCCGTCCCCGTTTGACCAGGAGGCCACCTTGAGGAACAAGGTTCGCATTG 

CTGAAGACCTGGCTTGGGGTCA 

Sequence 1590 

GGCGMTTGGAGCT(XCCGCGGTGGCGGCCGAGGTACTGTGATATCCACATATTTTTGAG 

AAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACC 

ATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACA 

GCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGC 

CTCAMTCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTC 

TGGTTCTTCAGCCTTGCTCCTTC 

Sequence 1591 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTGAATATCAATTTCTAAATATTTACCC 
AAAAATGTATATTTAAAG 1 1 1 1 1 AACAACTTCTTTTCAATCTTGAAAAAGTTC CCTCATT 
III I CI 1 1 I AAAATCCCATTAACAATCAATGGTTCTTTT AAAATAGAATAAATATTTTTT 
TGTCCATTTAAAGMGAATATGGATCTTGGAAAATCATTTGAATATTTTTTCTTAAAAAT 
AGATTTTG I 1 1 1 I I I GTTAATTTGTTTCCAGAAATAACTTCATCTAATAAAGTAACTTGA 
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CCACCGTAAGATTCATATMTCTMTCAMGATCTACCMCTGGTTGTTTTACCCTGGAT 
CCAGATTCACCMTTAMCCTAAMTTTTTCCTTGTTGGATCTAAMTTTTACCGTTATC 
AACAGCTTTGACAACCCAAANTTATTGGAAAAATA I Mill I AAGCTTTGAACTTTTAAG 
GGTGGGTTAACTTGCATTA 
Sequence 1592 

CTACTTAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGC 

GTTTCCCTGCTATGTTTTGTCACACATCATTGCTGGGAGAGTTCAGTGCTATCCMGTGA 

CTCCCCGGGGAGAGGACAACTGGAAATTTATGCCTGGTGTCTTCCTGGACTCCGACCTAT 

GCACCTTTTTCCTTGGATGATTTTAACCTGTCTCCTTTTGCTGCAATGAACCATAACCAT 

GA GTATA TCAGCTTTTCTAATTTTTGTGAAGTCCCTCAAGTGAATTATCAAACCTAAGGG 

TAGTTTTGGGGACCCCCATCACAGAAGAGGTCAAACAGGGGGCAGG 

Sequence 1593 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAA 
ATTATTA7TTATCTATACAATACTGGAAAMCTTGTAGACAAAAACATGACTTGAATTGC 
TAAAAAAAAAAAAAAAAAAAGAGGGAGAATGAAAACTTCCGGACGCGTGGGTCGAAGCTT 
GACCT 

Sequence 1594 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGGCTGTCTACACTGACATCATCCAGGGCAAGCT 

GGACCAGCGAAACCAGCTGCTGGAAGTGGATTTCTGCATTGGCCGTGACATCCGAAAGAA 

GGATATCAATAATATTGTCAAGACCCTGCATGAATGGTGTGATGGCTGTGAAGCAGTTCT 

ACTGGGCATNGAGCAGCAAGTTCTGAGAGCCAACCAGTACCTT 

Sequence 1595 

CCGCGGTGGCGGCCGCCCGGGCAGG I II I I I ll M I I l l I I CTTTCTGTTCCTTGGACTA 
GATMTCTGAAATCMCTGTCTTCAGTTTTGCAGACTCTTGTGCCAGCTAAAATGTTCTG 
TTGAG CCCCAGAAGCTAATTTTCTTTTCAGTTATTATGATTTTCAGCTTTAG 
TTTTTMTA TMTTTCTA CCTCTTTTTTATATTCTCCA 

TTCTTCCAG I I I I I I 1 1 AGACGTAGTTTCCTTGAGTTCTTTGAGCATATTTAAAATAGTT 
GATTTA AAGTAT TTGTCTAGTTACTCCACCATCTGAGTTTCCTCAGGGAAMTTTCTATT 
GCCTCCTTTTTTCCTGTGTGTGGTCGGNCCATACGGACGCGTGGGTCGAAGACCT 
Sequence 1596 

ACTTNNTN I I I I I I I I I I I I I I I I I 1 1 1 AAGCGCCCGGCATTTTCTAAATAAAATCATTT 
TATTTGGNAAMGGGTTTTMCAGNTATACCTTTCTAGCTAAAAGAAAAGAMTAGCGGG 
ATGTACCT 
Sequence 1597 

AGGTACGAAGAGAAAGGAATCAAAGCCTACTANCTCAAAAAATTGTCAAATTGCAAATGA 

GGACATCTAGAGAGGAAGAAAGGAAAAAAGGAACTAAAAAACAGAAACAATTAACAGTAA 

GTTCTTAACTATCAATAATTATTTTAAAAGTAAATAGATTAAATTATCTAATCAAAAGAC 

ATTG AATGGCTGAATGGATTAAAAAACAAGATCAACTATACATTGCCCATCAGAGATTCA 

TTTTAGCTTTAAGGATAAACACTGGTTGAAAGTGAAAAAAGTCAAGGCTGGGCATGGTGG 

CTCATGTCTATAATTCCAGCACTTTGGGAGGCCAAGGTGGGCAGATAATCTGAGGTCAGG 

AGTTTGA 

Sequence 1598 

CCGGGCAGGTACCACCTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 

TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGMGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCGGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAGCAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTAT 

Sequence 1599 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACAGGACCAATGCTGCC 
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GGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATAC 

TTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGG 

GAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGT 

AGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCT 

GGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCT 

GGACCCAGAAAAGCCAG 

Sequence 1600 

TCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTCACTGTCTCATATAATCNCAGCCTCC 

TGTGTGATAGCTGGTGTCATCTCCACTTACAGATGAGGAAACTGAGGATAAGCAGGGTTG 

AATAACTTGCTCGAGATCACAGAGCCACGGGTGGNGAAACAGGATACAAACCTGGTTCTG 

TTTGACTCTMGACCATTCATNTTTCCTCTGAAACTCAGTATTGCACAGTGTAGAAATGC 

AG 1 1 1 1 1 AAGACCTCCCAAAGTGACGTGCTGNGTCACTGCXJCATCATTAGCTANATTGAG 

TAAATTGCTGCTTAGCCCCANTTGTTTTGACAGAATCAATAGCCCTTGCTGAGGGGCCAN 

CAGCCTACGGACACAGGAGCATGCTTCATGGGCAAGACCACCATGCACACTCAGAGGGGA 

AGCCACAAGGCAACC TCCACG CCACTTAAGATTTGTAGGGCTCTGAACACATCACCAGAT 

ACAGACCACCTACTTATTTTTTNCACTGTAATANCAAAGGCAGGAATCTTTTTNCTGTAG 

GGTAAAGTTTGGGGG 

Sequence 1601 

GGCAGGTACAAGGCCCCAAAGAGGAGGAATTCCTTGTAGAGGAGCTTGTAGATGCTTCCC 

CTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTGAGAGTG 

TATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCCCCAGGG 

CTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCCACACCXJCAGCCTTCA 

CCCTGCGTCAAGTGGACAAG 

Sequence 1602 

AGGTACCAGTGGGGACTTCTGAAAGAACNNTACTNGTGTCAGTGGAAAAGCTGGCATTTT 

GGGAAATGCTGGTCTCTCTCAGTCCAGGAGTCAAGGAATATGTTGACTCTCTCTTAATTT 

TTGTAGTCTCAGAGGAAACAGACATTGATGTGGAAACAGTTGTATGCCCCATGGTGGAGG 

TGGTATCCATNGGAGCTGTGGCCTTGG7TTTTTCTGAGTCAGCTAGGACAGAGGATTGTG 

ACCCATGTCCAGAACTGGTGGTTTCCACATTAGTCGCTGCTGTGCTTGTGGAAGGATGCA 

TGGCTTCTATAGCTGTGGTGTCTTCATCTGTTGTCAGTATCTCATGTGAGGNACCTGCCC 

G 

Sequence 1603 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

GGCTGCAGTMGTTTGTCTTAAGAMGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTG7TTCCTCA 

Sequence 1604 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCACCTGAAGGCCCTCACACTC 

AACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAGCTACATTC 

AACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGAGCAGCATG 

GGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCC 

ACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGCTGGACATA 

CAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGGTGTCACCCAACTGGGCTTCTA 

TTGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAAAAT7TATCAATCC 

GGGGGCGAGGTACCTGCCCCGGGCGGGCCGCTTAAAACTAGGNGGGATCCCCCNGGCTTG 

CAGGAATTTCGATATTCAAGCTTATCGATACCCGTCCNACCTTCGAGGGGGGGG 

Sequence 1605 

CCGGGCAGGTACCACNTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 
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TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCTGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAACAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTATCAATCCGGGGCGAGTACCT 

Sequence 1606 

CGGCCGCCCGGGCAGGAACNNNNTTTTTTGGGGGGGGGAAAACCNAGACGGAGCCNCGCN 

CAANGGCCCAGGCGGGAGTGNAAGGGCACCAGGGGGGGCNCACCACAAANACCGCCGCCC 

G GGNGAAAGCC ACNCNCCGGCCNNAGCCNCCGGAGNAACGGGGGGAACAGGGGCAGGCCA 

TNi 1 1 1 1 1 1 i i iGNGGGGGGNGNANGGGGNGGANNCNCCAGGNAAAAANCANGCNGGCCA 

GGGGGGGGGGGAACNCCNGACCTNATGANGCACCCGCCNNGGNCNCCCAAAANGCGGGGA 

NNANAGGGGNGAGCCACCGNGCCNAGCNGACGG 

Sequence 1607 

CGAGTTACCAGAAGGAGAGATCACCACCATCGAGATCCACCGCACTAACCCGTACATCCA 

GTTAGGAATCAGCATCGTTGGCGGCAATGAGACGCCACTGATCAACATCGTNATTCAGGA 

AGTNTACCGGGATGGGGCCATCGCCAGAGATGGAAGGCTCCTTGCCGGAGACCAGATTCT 

TNAGGTCAACAACTGTGATATCATGCAACGTGTCCCATAACTACGCCCGGGCTGNCCTTT 

CCCAGCCCTGCAGNACCCTGCACCTGACAGNGCTTCGGGAGCGGCNGCTTNGGCAGTCGT 
GCAAA 

Sequence 1608 

CGAGCCTTTAGATGGCGTCTCCTCAGGGGGGCCAGATTGCGATCGCGATGAGGCTTNGGA 

ACCAGCTCCAGTCAGTGTACAAGATGGACCCGCTACGGAACGAGGAGGAGGTTCGAGTGA 

AGATCAAAGACTTGAATGAACACATTTGTTTGCTGCCTATGCGCCGGCTACTTNGNGGAT 

GCCACCACCATCACAGAGTGTCTTCATACTTTCTGCAAGAGTTGTATTGTGAAGTACCTN 

CAAACTAGCAAGTACTGCCCCATGTGCAACATTAAGATCCACGAGACACAGCCACTGCTC 

AACCTNAAACTGGACCGGGTCATGCNGGACATCGTGTATAAGCTGGTGCCTGGCTT 

Sequence 1609 

GCGTCCGACGTCCCCCAGGAGAATGGTAGACACAGATGAGGAAATTGTGGAGATGGGCAC 

AAACCGCAAGGTGAAGAAAACGAACAAACACCGAGTTGATACGGATAGTCCCCGTTCCCC 

TGAGGGCCGACCCCGTGACTCCCGATGAGCGTCCAGTTGCGCCGGGCATCCCTGGGCCTC 

CCAGCGTCTTTCCCGGAGGTTCATCGCCGCACGGCGGAAAGCGCTCTCGGTTCCGCTTTC 

CGGCCCCAGCCTCCCGGGCGCCCTCGCGCGGCGGCTAACGCTGGTCCTCGGCCGGGCGCG 

CTGACGTCATCGTGCGTCAGAGTGAGCCCGGATGGGGCGGCGGGCTTCGGGAGCGCCCGG 

GCTGATCCGAGCCGAGCGGGCCGTATCTNCTTGTCGGCGCCGCTGATTCCCGGCTCTGCG 

GAGGCCTCTAGGCAGCCGCGCAGCTTNCGTGTTTGCTGCGCCCGCACTGCGATTTACAAC 

CCTGAAGAATCTTCCTATCCCTAT 

Sequence 1610 

CGCGTCCGGCGGCGGCGGCTGAGGAGGGCCCGGCCTGCGAGAGCCTCAGTGGGAGCCGGC 

TCAGCCCTCGGCCACCATGTCGGCGCCGTCGGAGGAGGAGGAGTACTGCGCGGCTGGTGA 

TGGAGGCGCAGCCGGAGTGGCTGCGCGCCNAGGTGAAGCGGNTGTCCCACGAGCTGGCCG 

AGACCACNCGTGAGAAGATCCAGGCGGCCGAGTACGGGCTGGCGGTGCTCGAGGAGAAGC 

ACCAGCTCAAGCTGCAGTTCGAGGAGCTCGAGGTGGACTATGAGGCTATCCGCAGCGAGA 

TGGAGCANCTCAAGGAGGCCTTTGGACAAGCACACACAAACCACAAGAAGGTGG 

Sequence 1611 

CGCGTTCGAGTCTGGAGACGACGTTNCGAAATGGCACCTCGCAAAGGGGAACGGAAAAGA 

AGGAATGAACAGGTCATCAGCCTTGGACCTCAGGTGGCTGAAGGAGAGAATGTATTTGGN 

GTCTGCCACATCTTTGCATTCTTCAATGATACCTTTGTCCATGTTANTGAACTTTCTGGC 

NAGTGAGTACTTCAGAAAGGCATNAAACANGCCTCAAAGGGAC 

Sequence 1612 

CCCCGCGTCCGCCCACGCGTCCGGGCTCGGCTGCACCGGGGGGATCGCGCCTGGCAGACC 
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CCAGACCGAGCAGAGGCGACCCAGCGCGCTCGGGAGAGGCTGCACCGCCGCGCCCCCGCC 

TAGCCCTTCCGGATCCTGCGCGCANAAAAGTTTCATTTGCTGTATGCCATCCTCGAGAGC 

TGTCTAGGTTAACCGTTCGCACTCTGTGTATATAACCTCGACAGTCTTGGCACCTAACGT 

GCTGTGCGTAGCTGCTCCTTTGGTTGAATCCCCAGGCCCTTGTTGGGGCACAAGGTGGCA 

GGATGTCTCAGTGGT ACGAA CTTCAGCAGCTTGACTCAAAATTCCTGGAGCAGGTTCACC 

AGCTTTATGATGACAGTTTTCNCATGGAAATNNGACAGTACCTGGCACAGTGGTTAGAAA 

AGO 

Sequence 1613 

GTTNAGTNGMGTTCTCTACCATTGAATCAGTGAACTAGAAAGATCTGATTTGGCCTGGG 

ACCAGTGTTCAAGTTGGTTTGGTCTTTATTAAAAATCACAATATTCCGAAAACAAAAAAA 

CCTAGGAGATAMTGTAGAGGTATTGACTTTTCGTATCTTTTATCTTCACACTGAAACAA 

GAGCTATCCTATTTGATTATTAAAGTGAGCTATGTGTTAAGTGCCAGGACATTTCTAGCT 

TTTGTGAGAATGTGTCTACATATGAGTATAATAAACCCACATGTATACACAATTGTCTCT 

TATGTACTCCTACCTGGCAGGAGTCTTTG 

Sequence 1614 

CGCGCCGGTGGTGCGATCTCGGCTACTGCAACTTCAGCCTCCTGGATTCAGGCAACACTC 

C TGCCTCA GCCTCCCACGTGGCTGGGATTACAGGTGCCTGCCCCCATGGCTMTTTTTTG 

TAu m I I GTAGAGATGGGGTTTCACCATGTTGGCTGGGCTGGTCTCACTCTCCTGACCT 

CAAGCAATCTGCC TGTCTCAGCCT CCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCC 

CCCAGCCTGAGCCI I 1 1 I I I I I I I CTAATGCATCCAAGGTTAAGGGGAAGACGCAAATAA 

CAGGACTATTCTAAAAGGAAACCTGTTTGAACTCTGTGAGATCAAGTCATCAGTCTCAGT 

AT TNCA CAGGCACAC CTTM TTTCATTGGTAAAAGATATATATATTTTGNCTATTTTTGN 

GCTTTTGGGGGCCTATTTTGNGCTTTTTTACCTTTMTGNM 

GTGGAI II I 1 1 IACCA 

Sequence 1615 

TCGCCNCGCGTCCGTAGAAACTCACACTAGACACACGCAGTAGTCATACGTCTTCACACG 
GTTTAGGAGCTACTGGACCAACATTCTTGTTTTTGCTTT^ 

TCTGGAGCTAACTGTGGAGCAGCCAAATAGTAGCTGGCATGTTGATTCAAACCATGGGCT 

GMTTTGCTCATAGGCTGTGCA TCAGA CMMGCTTGAATATTTGTGTTGTATGCTTGTT 

CCAACCACCGCTTGTG TGAGCAT TTTTGTGGCTTGTACAGAAAGTACACTTTTAAATTGT 

CTCT TGCATCACTAAAATTTTTTTAAAATGAGCATAACAACGAAAGGCATCCAGCTGACT 

TTTTGATTCCMGATTATTGATTGGATTGACTTTTTT^ 

TAMTCATATGGCGAGTCAGGGMTAAAAAAGTCAAAMGGAAACAAATAGMGCTTTTT 
(kill I AAAAA 

Sequence 1616 

GGNCGAGCGCGCCTTGCGGGGGCGGTATCCCGGCGCCCTAAGACCCCACGACCNCNNGCA 
CCGGCCGNTGCTGCNAGACCCCGGCCCGNGTCGGTCCGATGTCGCCCCCCGGNCCCGGCG 
GAAACGCCTCCCTNCTGGCCAGGCTGTTCNAGCACCCGCTT 
Sequence 1617 

TCGCCACGCGTCCGTTNAGAT GCAGA MTGMMAAAMCACCTTTGTTTTATAAATATC 

AMGTACATGCTTAMGCCMGTTTTTATCTAGTTTATTCTAGTACTTAGCTTGCCTGGA 

ATAGCTAATAMTTATTCATGTATGTGCTTTTGAAMTCCAGAGCCCTATTTTTACACAC 

TTGTGTGMGTTGGCAAACATTTTGAAAAATGGAAAAAAGTTTCTAATAATTGGGAACAA 

TTACATTMTTAATATTTTGTAAAATATTGMGCTTTTAGCCCTATGTCAATTTGTAGAT 

TAAAATAAATTAATTATAGGAAAGGAAGATAACAGTGAGAAACCAAACATTAC 

Sequence 1618 

CACGCGTCCGCCCACGCGTCCGGCGCGGCGGGAGCGGGCGGNGCGAGCGGGAGGCGGCGG 

CTCACAACTGAAGCGCTGNGGCATGGNGCGCGCCTGCCTCCAGGCCGANNNGTACCTNAT 

GTTCGCCTTCMCCTGCTNrrrCTGGCTGGGAGGNNGTGTGNGGGCTGGNTGTCTGGCNTC 

CTGGCTTGGGC 

Sequence 1619 
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TCGCCACGCGTCCGGACCTGGATGTGAGGGTGAANGGGCTGTGCTGCTGGGAGCCACATT 

CCTCATTGACTACATGTTCTTTGAGAAGCGAGGAGGCAGCTGGGCCCTCTGCCATCACCA 

GTTAGAGGCCACCATGGTGTGAGGAGACCATCACCTCGACCAGAACTCCAGATGGTCACC 

TGCCCTGGCCCCTCCTCTGGGCAGCCCCTTTCCTCCATGTACACTGCAGGGGACAGAAGG 

GGGGCCCCATCCCTACCCTACTCCCTGGCCGCCTGCCCCTGTGGTTCCCAAGGAAGGGGG 

TATTGTATTGAGAGCCGCTCTCCTGCTACCTCCCACCACTGTCCAACAGTCCCTCGGCAC 

ACAGGCATATTAAGCTTTCACACTTTTCCCATGCACTTTTTTCCACCCC 

Sequence 1620 

GGAGTCGACCNCGCGTCCGGGGGCTTGCTGGGATCATGGCGGAGAATCACTGCGAGCTCC 

TGTCGCCGGCCCGGGGCGGCATCGGGGCGGGGCTGGGGGGCGGCCTGTGCCGCCGCTGCA 

GCGCTGGGCTCGGCGCCCTGGCCCAGCGCCCTGGCAGCGTGTCCAAGTGGGTCCGACTCA 

ACGTCGGCGGCACCTACTTCCTCACCACTCGGCAGACCCTGTGCCGGGACCCGAAATCCT 

TCCTGTACCGCTTATGCCAGGCCGATCCCGACCTGGACTCAGACAAGGATGAAACAGGCXa 

CCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAACTACCTGAGACACG 

GCAAGCTGGTGATTAACAAAGACCTCNCGGAGGAAGGG 

Sequence 1621 

GTCGCCCCGCGTCCGGGGCCCGCGGGCCTCGCCTCCGCCCTCCGCCACCTCGAGCTGCGG 

TAGCAGCGACTCATGAGAGCGCGGCCGGAGGACAGATTTGATAATGGGCTGCATTAAAAG 

TAAAGAAAACAAAAGTCCAGCCATTAAATACAGACCTGAAAATACTCCAGAGCCTGTCAG 

TACAAGTGTGAGCCATTATGGAGCAGAACCCACTACAGTGTCACCATGTCCGTCATCTTC 

AGCAMGGGAAC AGCA GTTMTTTCAGCAGTCTTTCCATGACACCATTTGGAGGATCCTC 

AGGGGTAACGCCTTTTGGGAGGTGCATCTTCCTCATTTTCAGTGGTGCCAAGTTCATATC 

CTGCTGGTTTMCAGGGNGGNGGTACTATATTTTGNGGCCTTATATGATTATGAAGCTAG 

MCTCCAGAAAGACCTTTCATTTAAGAAGGGTGAAAGATTTCAAATAATTAACAATACNG 

AAGGAGATTGGTGG 

Sequence 1622 

TTCGGGAGTCGCCCCGCGTCCGCTTTTAGAAAAGGCCAATATACCTATCACACTTTGGAA 

GTAAAAATA CACACT TTCGTGTGTACCTAAAAAAAAAATCGTTGAAAATCAAGGCCAAAG 

GTAGTGCAATTTTTTCATTMGATTTAAAAAAMGGGAATGATAGTCTTTGAAAGAAAAC 

AGTAGGCA TCCA GCACTGGACAAMCATGGGTATCAAAGATGAATAATCTTTGGAGATTC 

TGGCAGTGTTTTCCCAGMCMGTCMGTGGAAAGTGGAGAAATTATCTGTATMTTTTG 

GACACATACAATGGCAGTTTATCAMGGGTTTTGTTCTGTGGCCTGAATTTACTGGGGTC 

CTACCTACACATTGAACATGTTTTGGCTGGC 1 1 1 1 M I I I M II I I C AACTTGCCAGTTT 

CACTTTACATGGTTAGTAATAAATGGTTTCCACGGGGTGAGTTGGGATAAMTTNTTNAA 

AACATNTTAAATTCCA 

Sequence 1623 

GGAGTCGACCNCGCGTCCGAGCCGGGCCGGGGCGATGTGGAGCGCGGGCCGCGGCGGGGC 

TGCCTGGCCGGTGCTGTTGGGGCTGCTGCTGGCGCTGTTAGTGCCGGGCGGTGGTGCCGC 

CAAGACCGGTGCGGAGCTCGTGACCTGCGGGTCGGTGCTGAAGCTGCTCAATACGCACCA 

CCGCGTGCGGCTGCACTCGCACGACATCAAATACGGATCCGGCAGCGGCCAGCAATCGGT 

GACCGGCGTAGAGGCCGTCGGACGACGCCAATAGCTACTGGCGGATCCGCGGCGGCTCGG 

AGGGCCGGGTGCCCGCGCGGGTCC 

Sequence 1624 

CGCGTCCGGGGCAGCCGCGCCCGCGGAGTTTTCCGCCCGGCGCTGACGGCTGCTGCGCCC 

GCGGCTCCCCAGTGCCCCGAGTGCCCCGCGGGCCCCGCGAGCGGGAGTGGGACCCAGCCC 

CTAGGCAGAACCCAGGCGCCGCGCCCGGGACGCCCGCGGAGAGAGCCACTCCCGCCCACG 

TCCCATTTCGCCCCTCGCGTCCGGAGTCCCCGTGGCCAGATCTAACCATGAGCTACCCTG 

GCTATCCCCCGCCCCCAGGTGGCTACCCACCAGCTGCACCAGGTGGGTGGTCCCTGGGGG 

AGGTGCTGCCTACCCTCCTCCGCCCAGCATGCCCCCCATCGGGCTGGATTAACGTGGCCA 

CCTATGCGGGGGCAAGTTCAACCAGGGACTATCTTCTCGGGAATGGCGGCCAACATTGTC 

TGGGGACATTTGGAGGGAGCCAACATGCCCAAACCTGGACCCTGGGGCCCCTGGGGGCTG 
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Sequence 1625 

CACGCGTCCGGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGCGGCCTCCCGGCGGGAGC 

GAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGGCGGCTGCGGGCG 

CAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGGCGGCGGT 

CCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCGGC 

TCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAAGATGTTCCGCGAGGTTGAGGAAC 

TGATGGAGGACACGCAGCACAAATTGCGCGCGCGGTGGAAGAGATGGAGGCAGAAGAAGC 

TGCTGCTAAAGCATCATCAAGAAGTGGAACCTGGCAAACTTAC 

Sequence 1626 

CCACGCGTCCGGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAGGAAGCGGCAAAG 

GGGACCATGCGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTGCTCTGGATC 

ACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCCGACGGGACCTGA 

AGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGATGA 

GCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAAGCTGTATG 

CTTTCAACCAGCGGGAGAGTGAGGCGGGATCTCCAGCAATCGGGCCATCCCGGACACTCG 

CCATCTGGAGATGCACATGGCTTGGTGTATTGGACGGGACCTTCCACCCACTT 

Sequence 1627 

GCCACGCGTCCGCCGCCCGCTTGCCCGTCGGTCGCTAGCTCGCTCGGTGCGCGTCGTCCC 

GCTCCATGGCGCTCTTCGTGCGGCTGCTGGCTCTCGCCCTGGCTCTGGCCCTGGGCCCCG 

CCGCGACCCTGGCGGGTCCCGCCAAGTCGCCCTACCAGCTGGTGCTGCAGCACAAGCAGG 

CTCCGGGGCCGCCAGCACGGCCCCAACGTGTGTGCTGTGCAGAAGGTTATTGGCACTAAT 

AGGAAGTACTTCACCAACTGCAAGCAAGTGGTACCAAAGGAAAATCTGTGGCAAATCAAC 

AGTCATCAGCTACGAGTGCTGTCCTGGATATGAAAAGGTCCCTGGGAAGGANGGGGGCTT 

GTCCAAGCAAGCCCTACCACTCTCAAACCTTTACGAGACCCTGGGNAGTCGNTTGGATCC 

ACCACCACTCAAGCTGTACACCGACCGCACGGAGAAGCTGAGGCTGAATGGGGAGGGGCC 

Sequence 1628 

CCTAAGGGCAACAAGGGCGGTCCTGGCCAGCCGGGCTTTGAGGGAGAGCAGGGGACCAGA 

GGTGCACAGGGCCCAGCTGGTCCTGCTGGTCCTCCAGGGCTGATAGGAGAACAAGGCATT 

TCTGGACCTCGGGGAAGCGGAGGTGCCGCTGGTGCTCCTGGAGAACGAGGCAGAACCCGG 

TCCACTGGGAAGAAAGGGTGAGCCCGGAGAGCCAGGACCAAAAGGAGGAATCGGCAACCG 

GGGCCCTCGTGGGGAGACGGGAGATGACGGGAGAGACCGGAGTTGGCAGTGAAGGACGCA 

GAGGCAAAAAAGGAGAAAGAGGATTCCCTGGATACCCAGGACCAAAGGGTAACCCAGGTN 

AACCTGGGCTAAATGGAACAACAGGGACCCAAAGGCATTNAGAGGCCCGAAGGGGA 

Sequence 1629 

AGTCGCCCCGCGTCCGCTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAG 

CAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTG 

GAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGT 

GAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTC 

AGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTT 

AGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGGAACCTGGAAGATACCCTTCAAGGAG 

TCAGCAGAGCACACCCAAAACAACTGTTTTCTTCTAAAACTGGGTAGAAGCCTAAGAAAG 

ATAAACTCTGCTCCTCCTACAGAAACAAAATCTT 

Sequence 1630 

TNCGGGCCTGGTGAGCACCGCCGAGGCGCGGGCCAGCTCTTCGAGGTTGTGCGCGGGAGT 

GGCACGGCGGGCCGGGCCGAGCGAGGGGCTAACTTCAGCGGTGGCACCGGGATCGGTTGC 

CTTGAGCCTGAAATCATGACCACCCCAGGAAAAGAGAACTTTCGCCTGAAAAGTTACAAG 

AACAAATCTCTGAATCCCGATGAGATGCGCAGGAGGAGGGAGGAAGAAGGACTGCAGTTA 

CGAAAGCAGAAAAGAGAAGAGCAGTTATTCAAGCGGAGAAATGTTGCTACAGCAGAAGAA 

GAAACAGAAGAAGAAGTTATGTCAGATGGAGGCTTTCATGAGGCTCAAGATTAATAACAT 

GGGAGATGGCCCAGGGTGGGTGTCATCACTTCTGACATGATTGAAATGATATTTTCCAAA 
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AGCCCAGAGCAACAAGCTTTCAGCAACACAAGAAATTCAGGAAGCTGCTTTCAAAAAGAA 
CCTAA 

Sequence 1631 

CGGAGTCACCACGCGTCCGGGGCTGCCAAGGGAGGAGGAAGATGGCGGCGGGGGCGAGGT 

GAGGTGTTGGCAGTGGAAAGGGGTTCGGGCTCGGGGGGCGGGGGGACGCGGAGCGATGGC 

CCGCGCCGGCCGCAGGGGCGGATAAAAAGCCGTCGCGCTGCGGGAGTGGGCGGGAGGGAG 

AGGGGGTGTCCGAGGGCCACAAGAGTATGACGGGGCTGTACGAGCTGGTGTGGCGGGTGC 

TGCACGCGCTGCTCTGTCTGCACCGCACGCTCACCTCCTGGCTCCGCGTTCGGTTCGGCA 

CCTGGAACTGGGATCTGGCGGCGCTGCTGCCGCGCCGCCTCTGCGCGGTCCTAGCGCCGC 

TCGGCTTCACGCTCCGCAAGCCCCCGGCAGTCGGCAGGAACCGCCGTCACCACCGGCACC 

CGCGCGGGGGGGTCGTGCCTGG 

Sequence 1632 

CGTCCGTTTGTTTAATA I 1 1 1 1 1 I ICTCTCTTGAACAAAACTGAGATAATTTAGAAAACA 

GGTGCTTAATTGCAATAAAATTACTATGAAGTATATTAAAAATCACGACATTGTAAAATC 

TCACTTTAGATCATCAMGAAAACCATTGTTACTATCTGCTTTGAGCTTAGGAAAATGTA 

CAAGAGAACA AATTA AAATTGAAAAATTGATTTCACTTAGAAAAACTTCTAGGAACAGGG 

TGAACCACTGATTTTAATTTGCCTAATrATCTTATGACAAGTATCAAATTAAGATGACAC 

TTAAAGGATCCTTAGCATTTAACTTAATGATGGAGAAAGAGTGCTCAATAGGACAGTTCC 

CCAGTTMGGGGTMTGGAGATGCCCATTTTCAGGAGGACCATTCTAAGAAGATATTTTT 

GGATTCATTAAAAACATTTAAATAAAAAGCCCTTCTTCAAGATTGGGAAC 

Sequence 1633 

CGCGTCCGCCGGCCTGGTGCACGCGGCGCACCGAGGCCTCCCGCAACGCCGCCGACAAGG 

AGCGGGCGGCGGGCGGCGGCGCCGGCAGCAGCGAGGACGACGCGCAGAGCCGCCGCGACG 

AGCAGGACGACGACGACAAGGGCGACTCCAAGGAAACGCGGCTGACCCTGATGGAGGAAG 

TGCTCCTGCTGGGCCTCAAGGACCGCGAGGGTTACACATCATTTTGGAATGACTGTATAT 

CATCTGGATTACGTGGCTGTATGTTAATTGAATTAGCATTGAGAGGAAGGTTACAACTAG 

AGGCTTGTGGAATGAGACGTAAAAGTCTATTAACAAG 

Sequence 1634 

CCCCGCGTCCCGGTTGGCCGGGCGGAGGTCTTCGCTGAGGCCCGGGGCGGGGTGGCGCCA 

CCCCTGATTGCGGTGCCACGGACTGCTCCTGCTGGGCGGAGAGGACAGATTTTGCAAAGC 

GGAGGCTTGCGACGGGTCCTGCAGGGGGACAGTGAGGAAAGGGCCCGCCTCGTNTCCGCT 

CCTGGGGGACCCGCAGAAATAAGAATCAAACTCCACAATGACAACCTATTTGGAATTCAT 

TCAACAAAAATGAAGAACGAGAATGGGAGTCCCGATTTAGT 

Sequence 1635 

CCACGCGTCCGGGCGGGGCCATCCAAGCAACGCTGAAGGCCTTTTCCAGCAGCTGGGAGC 

TCCCGGATTGCGTGGCACAGCTGAGGGGCCTCTGTGATGGCTGAGCTCTCTTATGTCCTA 

TACTCACATCAGACATGTGATCATAGTCCCAGAGACAGAGTTGAGGTCTCGAAGAAAAGA 

TCCATGATCGGCTTTCTCCTGGGGCCCCTCCAATTGTTTACTGTTAGAAAGTGGGAATGG 

GGTCCCTAGCAGACTTGCCTGGAAGGAGCCTATTATAGAGGGGTTGGTTTATGTTGGGGA 

GAATTGGGCCTGAATTTCTCCACAGAAATAAGTTGCCATCCTCAGGTTGGCCCTTTCCCA 

AGCACTGTAAGTGAGTGGGGTCAGGCAAAGCCCCAAATGAGGGGTTGGTTTAGATTCTGA 

CAGTTTGCCAGCCAGGCCCCACCTCAGCGTCTGTCGAACAAACAAAGNTNGGGNGGGTTT 

N 

Sequence 1636 

CCNCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCTTCTGCAGCAAGCTCAGGAGAGCT 

GCTGTCTTCCCTCCCGCCCACCAGCAACGCACCCTCTGACCCTGCCACAACTACTGCAAA 

GGCAGACGCTGCCTCCTCACTCACTGTGGATGTGACGCCCCCCACTGCCAAGGCCCCCAC 

CACCGTTGAGGACAGAGTCGGCGACTCCACCCCAGTCAGCGAGAAGCCTGTTTCTGCGGC 

TGTGGATGCCAATGCTTCTGAGTCACCTTAACTTTGAACCATTCTTTGGAATTGGCGTGG 

TATATTTAACCACGGGAGGCGTGTCTGGAAACGCAAACTATCATTAATTTCATACTAGGT 

TTGTACCGTATCTGTAGGCATTCCTGTAAATAATTCCAAGGGGGAAAACTAAACNGGGAC 
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GTGGGGTT GTATC CTGCCAGGTTTGAGTGGGGGCTCACACCTAGGGTGAGAAGTCAGAAA 

GCGCTTGTATTTTAAACAACCAAAAAGAATTGAAAGGGTG 

Sequence 1637 

CGTCCNATAGGCTTGCACACTTTTCTAACTACATGTTTAAGTGGCAGAGTCCAGGCTGTC 

GAGTCACGGTTGGGTTTGMTCTGACTCCACCAGTMCTTTGGTTGGAAAAAATCACTTA 

TCCTCTTTMG CTTGA TTTTATTTATTTTATTTTATGTAAGAGTGAGACAGTAGTAGC^ 

MTAGGGTTGCTTTTAAATTAGAGTGAACATGAGGCAT7TA7TCGGTGCCAGACAGATAA 

CTGCCTATAACAGGATGTGATCAGCACAAGTAACAGAAAATTAGCCTGGACGGTGGCTTA 

AGCMTGGGGMTGTTTATCTCACATAGCAAAAAGGTCTGTAAATAGGATGGTTTTAGAG 

TTGGGGTGGGGAAGCCAAAAATGTCATCAGGATTTCCTTGGAACCCGT 

Sequence 1638 

CGCGTCCG GATTAA TACAACTCTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTT 

AGCGTTMGTTTTTMCCGTMTTTTMTAGCTTMGATTTTMGAGAAAATATGMGAC 

TTAGAAGAGTAGCATGAGGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAGAAGGAAAG 

AAGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAACCCACCATGCCCCAGGCTCAGCAG 

GGAGCTGCTGGATGAGAAAGGGCCTGAAGTCTTGCAGGACTCACTGGATAGAAGTTA7TC 

MCTCCTTCAGGTTGTCTTGAACTGACTGACTCATGCCAGCCCTACAGAAGTGCCTTTTA 

CGTATTGGAGCAACAGCCGTGTTGGCTTGGCTGTTGACATGGATGAAATTG 

Sequence 1639 

CGCGTCCGGCTCCCCGCACCCCCTCGCACTCNCTCTGGCCGGNCCAGGGCGGCCTTCAGC 

CCAACCTTGCCCAGCCCCACGGGCGCCACGGAACCCGCTNGATCTCGCCGCCAACTGGTA 

GACA 

Sequence 1640 

GTCGCCACGCGTCCGGCGGCCGGGCCGGGGCAGCCGGGAAGCGGGTGGGGTGGTGTGTTA 
CCCAGTAGCTNCTGGGACATCGNTCGGGTACGCTCCACGCCGTCNCAGCCACTGCTGTGG 
TCGCCGGTC 
Sequence 1641 

CGTCCGCTCCTCCCGCCTGAGGCGAGTCTGGGCTCAGCCTAGAGCTCTCCGGCGGCGGCG 

CAGCTTCAGGGCAGCGCGGGCTGCAGCGGCGGCGGCGGTTAGGGCTGTGTAGGGCGAGGC 

CTCCCCCTTCCTCCTCGCCA^ 

CTCGCCTTCCTCCTCCTCGTCAGGCTCGACCCAGCTGTGAGCGGCAAAGATGGGCGGCGC 

CCAGGCCGCCGCCTGCCAGGCTGTCGGGGCGTCATGGTGCCGGCGCCCATCCAAGACCTG 

GAGGCCCTGCGCGCGCTCACGGCGCTCTTCAAAGGAGCAGCGGAACCGAGAAACAGCACC 

CAGGACTATCTTCAMAGGAGTTCTGGATATCCTAAAGAAATCTTTCTATGCTGTTGAGC 

TTGCCTGCANGAGATCCATCCCAAGTGGAAAACCT 

Sequence 1642 

ACATTTATCATGGATGCTGACCGGGAGAAAGAAAGAAAGAAACGGGAGGAGCGGGAGCGT 

AAGCGGCGGAAGGAGGAGGAGGTGCAACAGCCAAAGTTGGCAGAGGAGAGACGGCGGCAG 

AATTTACAGGAGGAAAAGGAAAGGAAGTTGGAATGCCTGCCCCCTGAACCTTCCCCTGAT 

GACCC TGAAA GTGTCAAGATCATCTTCAAATTACCTAATGATTCTCGAGTAGAGAGACGA 

TTCCACTTTTCACAGTCTCTAACAGTAATCCACGACTTCTTATTCTCCTTGAAGGAAAGC 

CCAGAAMGTTTCMGATTGMGCCMTTTTCCCAGGCGAGTGCTGCCCTGCATCCCTTC 

AGAGGAGTGGCCCAATCCCCCTACGCTACAGGAGGCCGGACTTAGCCACCAGAAGNTCTT 

TTTGTTCAGGACCTMCTGACGMTGACATTTTTTTCTTTCTGTCCCCTCCTACCCCAGT 

CCCTAAAAGAAATGGGGNAAAAAGGAAACAACAGCAGTCNTAAAAAAAAAA 

Sequence 1643 

CGCGTCCGGAGGGGCTAAGMGGTTGTCCTTGCCTAATGCTCTGATCTGTAAGTGAATAG 

GGCAGMCAGTTCAGCCTTGAGGTTAGMTTTAGCAGGAGCTATCCTGACTTAATATCCA 

GTTGTGGGGTTTGCAAAACAAAACAGCTGTATGTAATCATTGCCACTAGTTCCATCTAGA 

ACTCCTTTCTAGTTTGTTATTTTTAAAATGTTTATACATAAAACCACCAAAATACATAGC 

TTCGACMGATGGAAGTTTATTTCTCTCTCCCATAACAGTGCAGTGATAGTCAGCTGGTC 
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CAGGCCAGGCAAGGGGCTGGTCCATGATGTCATCAGGCACCCAGGTTCCTACTGGCTTTG 

CATGTGGCCACAGTTAGCAACAAANGGAGGCTGTAAATTT 

Sequence 1644 

CGCCACGCGTCCGGTGATGCGGACCCCGGGCGGGCGCAGGGCGCGGGCTCCGGCGCCGCC 

GCTGCGTCCTCCCCGGCCGCGGGCGAGCCGCTGCAGAGGGAGCGTCGCGCCGGGGCGGAG 

TGCGGGCTTGCGCGGCAAGTGCGCGCCGAGGTCACGAAATGGATTGGAGTGAACCGGAGA 

CCCCGAAAACGGAAGCGCAGGGAGAAGGAAGAGGTGTTTGAAAAGCTTCTTCCAGACCAG 

CTGGTCTTGCTTCTGGAGCATCTCTTGGAGCAGAAGACTCTGAGCCCCCGAACTCTGCAA 

AGCCTCCAGAGGACATACCACCTCCAGGATCAGGATGCAAGAGGTTCGCCATCGGTGGTG 

TGAACTCATTGTTAAGCACAAGTTCACGAAAGCCTACAAAAGTGTGGAGAGGTTCCTTCA 

GGGAGGATCAGGCCATGGGGTGTGTACCTCTACGGGGAGCTGATGGTTGAGTGAGGACCC 
CAGAC 

Sequence 1645 

TCGCCACGCGTCCGGGACATCGAGTNCGGGCTGGCTACGAACTCCTCGGGGGCGAAGGTG 

GCGGAGAGGGATGGGTTCCAGGACGTCCTGGCGCCCGGGGAAGGCTCGGCGGGACGGATT 

TGCGGTGCGCAGCCAGTGCCGTTCGTCCCTCAGGTGCTTGGCGTGATGATCGGGGCCGGA 

GTGGCGGTGGTGGTCACGGCCGTGCTCATCCTCCTGGTGGTGCGGAGGCTGCGAGTGCCA 

AAAACCCCAGCCCCGGATGGCCCCCGGTATCGGTTCCGGAAGAGGGACAAAGTGCTCTTC 

TATGGCCGGAAGATTATGCGGAAGGTGTCACAATCCACCTCCTCCCTCGTGGATACCTCT 

GTCTCCGCCACCTCCCGGCCACGCATGAGGAAGAAACTGAAGATGCTCAACATTGCCAAG 

AAGATCCTGCGCATCCAGAAAGAGACGCCCACGCTGCAGCGGAAGGAGCCCCCGCCCGCA 

GTGCTAGGAAGCTGAC 

Sequence 1646 

TCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGACGAGGGGCCGGGTTCTGCG 

GCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAGCTTCTTCACCGACGGGTC 

CTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTCTAAGCGACACTCAACCTC 

TGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGGCTGGTTGTATTATAAGCA 

GATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACATGCTGGGCTGGATCAGAGC 

GATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCGGCTGTGCCAACCAAGCTCTGAT 

CAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGCTCTGGGCCCAAAGCTGATTC 

CTCCCCCAAAGGCTCTCGCGGCCTGGGGGGCCTCAAGTCTGAGTTCCTCAAGCAGAGTGC 

GGCCACGTGGCCTCANGACTCAAGACCTGCCCGCAGGGAGCAAGGATGACAGTGCTGCAG 
CCCCCAAAACCCC 

Sequence 1647 

GGTGTCGCCCCGCGTCCGGTTTCTTCCTAATTTATATTTCCGATACATANGTGTAGAACA 

GGAATTTGCAGAAGCCATTTAAGTTATCTTTTGAGGTMNGCTCTGATTTAGCATTTATT 

CTGATAAAATCTAATACATCATGGGATATATATAAAGCAACTTAATTCTTGTGGTGTAGT 

CTTMTAGTTTTGMTGTTGACTGMTGTCTATAAAATTGTGAGTTTGTCTTTGTTACAT 

TCCAGTGTTTCTGCCTCTTGGCATGCTTAAAGCACGGCTTACTTCATCTGCTCCTTACAC 

ACTAAAATGCTG TTAGTG TGCTCAACTACAGAAATAGCCGCTGCTAAGTTGATGTAGATT 

TTCTACTTGM TATTT TTATGGTTGTAGGMCCTCAGGAGGGTCAGTGTTTACTGGTTTA 

TATATGCCTTCTTTTTCCTGTTTGAGCTTCTCTCTTTGAAGGGATTCTAACAGAACAAAA 

GCTGCTGATCACCCTAAGTTGGAAACAGNAAAGGNGTAATTAATATAACTTAATGC 

Sequence 1648 

TCACCACGCXaTCCGAAAGTCCGTGACATGGTTCCCCGTGGTGGCCCGTGGCAGCCCGTGG 

CATGGCGTGGCTCAGCTGTCTGTTGAAGTTGTTGCAAGGAAAAGAGGAAACATCTCGGGC 

CTAGTTCAAACCTTTGCCTCAAAGCCATCCCCCACCAGACTGCTTAGCGTCTGAGATCCG 

CGTGAAAAGTCCTCTGCCCACGAGAGCAGGGAGTTGGGGCCACGCAGAAATGGCCTCAAG 

GGGACTCTGCTCCACGTGGGGCCAGGCGTGTGACTGACGCTGTCCGACGAAGGCGGCCAC 

GGACGGACGCCAGCACACCGMGTCACGTGCCMGTGCCTTTGATTCGTTCCTTCTTTCT 

AAAGACGACAGTCTTTGTTGTTAGCACTGAATTATTGAAAATGTCAACCAGATTCTAGAA 
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ACTGCGGTATTCCAGTTCTTCTGACACCGGATGGGTGCTTGGGAACCGTTTGAGCCTTAT 

AGATCATTTACATTCAATTT 

Sequence 1649 

CNCCACGCGTCCGGGGATCCCTGGGGAGAGTAACAGTGGCCCCACATCCCTCCTCCTGGG 

AGACGCTGGTGCAGGGCCTCAGTGGCTTGACTCTCAGCCTAGGCACCAACCAGCCCGGGC 

CTCTGCCTGAAGCGGCACTCCAGCCACAGGAGACAGAGGAGAAGCGCCAGCGAGAGAGGC 

AGCAGGAGAGCAAAATAATGTTTCAGAGGCTGCTCAAGCAGTGGTTAGAGGAAAACTGAG 

ACGTGCACCCCCATGGGA TGGA GACCCGAAGGGACTCAGACGGAGCCGCCGTGTTGGCAG 

CGCCTGGGtGTGGGCCCATTTTGGGGACCAAACAGCAAGCTGTGGTCGGATGAGTGCCAG 

GACCTGTGTACCGGGACACGTGGGGAGTCCTCCCAGCATGATGCTTGACTGACCCGAGGA 

AGGTCCTCATGTTTCGTGCCTGTCATTCTCGGATGGCTGTGAGGCATTCCTTGGCAAGGG 

ATGCTTGCGTACCAAGCGGTCCTACCGCATCTACATGGCTTCTGTGATGCATGTTGTCGC 

TTTCCCACCCNGGAT 

Sequence 1650 

CGCGTCCGAGCTTTGCAGGGAAGAACAGAGTATGGGTCATCTCAGCCCCTCATGCCTCGG 

AAGGCTACTACCGCCTCATGATGAGCCTGCTGAAGGACGATGTGTACTGTGAGCTGGCGG 

AGAGGCACATCCAACAGATTGTGCTCTTCCACCAGGCAGGTGAGGAAGGAGGCAAGGTGA 

GAAGGATCACCAGCGAGGGCCAGATCCTGGAGCAGCCCCTGGACCCTAGCCTCATCCCTA 

AGCTGATGAGCTTCCTGAAGCTGGAGAAGGGCAAGTTTGGCATGGTGCTGCTGAAGAAGA 

CGCTGCAGGTGGAGGAGCGCTATCCATATCCCGTTAGGCTGGAAGCCATGTACCGAGGTC 

ATNGACCAAGGCCCCATCCGTAGGATCGAGAAGATCAGGCAGAAGGGCTTTGTCCAGAAA 

ATGTAAGGCCTCTTGGTGTAGAGGGCCANNGTTGGTTGNCTGAGGGGGAATTGACCCGTT 

GGAAGGGGGAAGCAATGAAAGGGCCAAAG 

Sequence 1651 

CGCGTCCGGGATGCGtTGGGTCTGAAAGTCGATGAAGGACGCGATTACCTGCGATAAGCT 

TCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGAATGGGGTAACCTAACTGAGC 

AAACCTCAGTTGCATTTTGATGAATCCATAGTCAAATTAGCGAGACACGTTGCGAATTGA 

AACATCTTAGTAGCAACAGGAAAAGAAAATAAATAATGATTTCGTCAGTAGTGGCGAGCG 

A AAGCG AAAGAGCCCAAACCTQTAAAAAGGGGTTGTAGGACATtfTTACATTGAGTTACAA 

AATTTTATGATAGTAGAAGAAGTTGGAAAGCTrCAACATAGAAGGTGATATTCCTGTATA 

CCGAAATCATAAAATCTCATAGATGTATCCTGAGTAGGGCGGG 

Sequence 1652 

GTCGCCNC GCGT CCGCAACATTATTGAGATTGTCGTGTATAGTCATCGAATATCAGCCAG 

TTCCTGTMTTTTGTGACACGCTCTCTGCCMGCCCACCAAGTATTTCCTTTATAGCTAA 

AAGTTCCATAGTACTAAGGAAATAAAGCAATAAAGACAGTCTCAGCAGCCAGGATTCTGG 

CTGAAGGAAATGATCCGCCACCCTGAGGGTGGTGATGGTAGTTTCTACCCATACCTCAGC 

CTCAGGCGAGTGGCTTATAGCCTCCATTCATGGTGCACTTTATTTATGGTACTAAGATAA 

AGACTGTCAATCCATTGATTTATCTCCTCCTGTCCCCCATCTAAAATACCCATGCTGCTT 

7TCTGGAGTGTTGTGGGGGGGTTACCAGCTTGATCCACTGGTGCTCTTTAAGAAGGCCCA 

AGAAAGGTCTTTGGGGCATTGCCAAAGAAAATCCCGGATTTATGTGGGAAAACCCTCACT 

TTTCTCTTTACNGGCTGGTACCAAGA 

Sequence 1653 

CCGTCCGI 1 1 1 1 1 1 1 1 1 G AACTTACCGTAAAA I 1 1 1 I I I I I AAAAAGTGCTTGTAAAAAT 

TAAAGAGGAATAAAAGGGGGGTGAACAGCCAGTACGATAGTGCATGCCTGAAATTCCAGT 

GCTTTGGGAGGCCGAGGCAGGAGGATCGTTTGAGGCCAGTAGTTGGAGAGCAGTGTGGGC 

AACGTAGCAAGACCCCATCTCTACAAAAAATTTAAAAGTTAGCCGGGCATGGTGATTCAC 

ACCTGGAGTTCCAGCTGCTGTGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAGGAATTT 

GAGGCTGCAGTGAGCCATGATTGCACCACCGCACTTCAGCCTAGGTGACAGAGCAAGGGT 

CTAGCTCAGAAAAAAAAAAAAAGGAGGAGCAAGCACGTGTTGATGGGTGGAAATCCAGCC 

AGAAATGCTGAGGCTGAAAAGATTGTTCCCAGTTTNCTGGTAGCCAGGGGAAAAGGGGAA 

ATT 
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Sequence 1654 

CGCGTCCGCTGACATTT CCTAG GAAGCTNGGAAAAGGAAAGTGAAGGAATGGTCTAAAGA 

MTGACCATTCACACTGATTTTGTCTGGACAGTCTGGCCGCAGGTTATTCATAGATTATT 

CAGCCTTTGCAG GACTT GGATTCAGGGTTTTACTCAGTCCCTTTACCTTAGGTGGAATCT 

TCTTAAATTGAMTTTTTGGTMGGAATTCATTTCACAGGTAGTGTTTCAGACTCTGA^ 

GCCCTGACTTGGTTCTTGGCTTCTACTGTACTAGTTACTAGTTACTGGTACTGTTGCCAA 

GCMTCTGCTTGGAATTTGTGGATCCCTGCTGTTCCCTAATCCCACCCCCTGCCCTGAGA 

CAGTGAATGTAGTCCGTGAAGGGAGTGCCTCTCTGGGACCCCCTGTG7TGTTACAGGCTG 

TGCA GTGCA ACAATTCCAGCAAAAATACCCTATCCCCGCACTTAGTCATTCGTGGTAACT 

AACAATTTTGAAATACTCATATAAAATGAACAGGAAAGTGGTTAGTG 

Sequence 1655 

GACCACGCGTCCGGACCCAGACCCGGCTGACCCACCTACCCGCGATCCTGCCCATGGCTG 

ACGGGCTCTTTCGGCGCAGACCCTGGGGTCTCGAGCAGATTCGCCCGGACCCCGAGTCCG 

AAGGCCTGTTTGACAAGCCTCCCCCGGAAGACCCTCCCGCTGCCCGCGGGCCCAGGTCGG 

CGTCGGCCGCGGGCAAGAAGGCTGGTCGGCGCGCGGGCGGGAGGGCGCAGGGGGGCCGCG 

CCGGGCAGCCCCCGAAGGCCGCATCGCGCCCCCCGCCCAAGAAGGAGGCGCCTCCACTGG 

ACGAGGGCTGCTATCTCGACCATTTTCCGCACCTCTCCATCTTCATCTACGCAGCCATCG 

CCTTCTCCATCACCTCCTGCATCTTTACCTATATCCATTTACAGCTTGCCTGAGTGGCCA 

GCGCGGGACGGGGTGGGCGCAGGACCGAGCGGGGAGGGAAAGGGGAAAACGGGGGCTNGG 

CATTTTGTGTTTTAG 

Sequence 1656 

CNCTAACCCCGMC TCTAG ATCGTCTTGCTTGTTTGTCTGAAGAAGGGAATGAAATAGAA 

AGTGGAAAMTMTATTTTCAGAGCATCTTCCCTTAAGTAAGCTACAGCAAGGCATAAAA 

TCTGGTACATACCTTCAAGGAACATTTAGAGCTAGCAGGGAAAATTACTTGGAAGCTACA 

GTATGGATTCATGGCGACAGTGAAGAAAATAAAGAGATAATCTTACAGGGACTTAAACAT 

TTAAACAGAGCTGTTCACGAAGATATTGTGGCTGTGGAGCTTCTCCCCAAGAGTCAGTGG 

GTAGCACCATCTTCTGTGGTTTTACATGATGAAGGTCAAAATGAAGAAGATGTGGAGAAA 

GAAGAAGAGACAGAACGAATGCTTAAGACTGCTGTAAGCGAGAAAATGTTGAAGCCTACA 

GGTAGGAGTTGTAGGAATAATAAAAAGGAATTGGA 

Sequence 1657 

CGTCCGCGGACGCGTGGGCGGACGCGTGGGCTGGCTGTATCTATACTTTCCTTGAGAAAA 

ATCCCATAAAGTGGATGGACCTGTGAAGAAAATGTATGCTTATGGCCTAGCCTTCATGTC 

TGGCTGATGTATCCTATAAG GCAGT AAGCCCCTTTTCTAGTCTCTGGTAAGATGCAAGAG 

CTCATATCCCCAT CACTGACATTT TAGTTTGGAAATAATATTGAGACTGTGCTATGACCA 

ACCCCTGATGTTGTTTTTTCTTTTCAMCTTTTGCATATGAGTAGAGG 

GTTAAGTATTTATGTCTGGGGGGATACCTTCAGGTGTCTTATCTGTTITATGCAAGAATT 

TATGTGTTCATCTTTATTCAGTGCAAAGA I I 1 1 I I 1 1 rAAATTTTGTTTATAATTGGAGG 

TAA CATTA AGACAACTCTTNCTCCACAAGAAAACCTCTAAAATTAATATTCCTTAAGATT 

TGGTTTTCCTTTGCCTTATMTATTACCTTT^ 

AAAAG 

Sequence 1658 

GTCGCCCCGCGTCCG TTTGA TATACCACTCTGATAACTCATATAAAAATATCATCATAAA 

MGCTTMTTTCATCCCTTTTATGTTGGTTTTAAMGGTAMTGCTTACCATATTTTAT^ 

ATTGAGAACTCTTACATAGTAGMTCCATTCTATMTACATGTGTTGACAAAGCTTTAGA 

GAMGTTTCCTATTC TCTT CCATTTCCCCTGCCCAAAGTGCTGACATAGGCAGTGATGAA 

GMTCTTTACCAAGATTTTCAGGGTGTACCTATGAMTTGCTTTAAATGCACTGCTGGTG 

TAMTAATTAGCAAGCAAAAGCGTTTCTGTGACTTCAGGTACCAGCTTAAAGAGCACTAG 

GGATGGGGAACGAATGCCAAATCAGACTCCACCTAGAGCACCAGGAAACAGCTTGTCCCT 

GGTAGGGAAATGGTGTTGCTGAAAG 

Sequence 1659 

CGACCNCGCGTCCGGCTGNTGACCCCATGCTGAGTGGCCNGTGGGGAGCGGCGCCCGGCA 
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GGCTC TTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAG 

TTTTTCCCMGMGAAMGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAA 

GTAACTGCCCATGCCCAGAAATAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGT 

GTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCTGC 

CAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGA 

AGGCAGAACAAAGAATGCATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATC 

GGAAACCTGTGAGTCANAGTCAGAGAAACTTACCCAGCCACGTATTCCTGTTTCATGGGT 

NCTGTAGATGTTTTGAGTCAAGGAAGGTA 

Sequence 1660 

TCGACCNCGCGTCCGGTGGGTCCCTGCCX3GCCGGCGGCGGGCGCAGACAGCGGCGGGCGC 

AGGACGTGCACTATGGCTCGGGGCTCGCTGCGCCGGTTGCTGCGGCTCCTCGTGCTGGGG 

CTCTGGCTGGCGTTGCTGCGCTCCGTGGCCGGGGAGCAAGCGCCAGGCACCGCCCCCTGC 

TCCCGCGGCAGCTCCTGGAGCGCGGACCTGGACAAGTGCATGGACTGCGCGTCTTGCAGG 

GCGCGACCGCACAGCGACTTCTGCCTGGGCTGAGCTGCAGCACCTCCTGCCCCCTTCCGG 

CTG CTTTG GCCCATCCTTGGGGGCGCTCTGAGCCTGACCTTCGTGCTGGGGCTGCTTTCT 

GGC GGTCTGGAGACGATGCCGCAGGAGAGAAGAAGTTCACCACCCCCATAGAGG A 

GACCGGCGGANAGGGCTGC 
Sequence 1661 

GGTGTCGACCNCGCGTCCGGCGCCCCGCTCGCATTGTTCGGGCGACTCTCGGAGCGCGCA 

CAGTCGGCTCGCAGCGCGGCACTACAGCGGCCCCGGCCCGGCCCCCGCCCGGCCCCGGCG 

CAGGCAGTTCAGATTAAAGAAGCTAATTGATCAAGAAATCAAGTCTCAGGAGGAGAAGGA 

GCAAGAAAAGGAGAAAAGGGTCACCACCCTGAAAGAGGAGCTGACCAAGCTGAAGTCTTT 

TGCTTTGATGGTGGTGGATGAACAGCAAAGGCTGACGGCACAGCTCACCCTTCAAAGACA 

GAAAATCCAAGAGCTGACJCACAAATGCAAAGGAAACACATACCAAACTAGCCCTTGCTGA 

AGCCAGAGTTCAGGAGGAAGAGCAGAAGGCAACCAGACTAGAGAAGGAACTGNNAACGCA 

GACCACAAAAGTTTCACCAAGACCAAGACACAATTATGGCGAA 

Sequence 1662 

GACCACGCGTCCGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATT 

TTCCTGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGT 

CCTTAGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATC 

CCCGGATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTNCTGA 

CCTACGTGTACTTCGTTCTCTCACTTGGGCCTCGNATCATGGCTAATCGGAAGCCCTT 

Sequence 1663 

GTCGACCACGCGTCCGGGCTCCATCCGGGCTATCCTGCCGCCTTAGCGGCTGCTTCTCCC 

CAGGATGCGGGCAG GGGG CCTCTCTCCCACTCCCCACACACCGATTTCTGAGTAGCGATA 

GGGGCTGGAGGCTTATTTTATGGGGTAGGGGGCCGCTGGTAGGCGAAGATTGTCCGAGGG 

AGAGGGGGAGGATGAAGCCAGTG CGTGG CGGAGACTTGCCAGATGTTGATGCCTAAGAAG 

MCCGGATTGCCATTTATGAACTCCTTTTTAAGGAGGGAGTCATGGTGGCCAAGAAGGAT 

GTCCACATGCCTAAGCACCCGGAGCTGGCAGACAAGAATGTGCCX5AACCTTCATGTCATG 

AAGGCCATGCAGTCTCTCAAGTCCCGAGGCTACGTGAAGGAACAGTTTGCCTGGAGACAT 

TTCTACTGGTACCTTACCAATGAGGGTATCCAGTATCTCCGTGATTACCTTCATCTG 
Sequence 1664 

CCGC GTCCGG GGGT TGGTA CCCGAGCGCCTTCCCCTCACCTCAACCAGAGAAGAGCATCC 

GGTTGCi 1 1 i lAAAGCTTTTAGCCTGCCCTAGCAAGGACAAAGCATGTTAGATTAGAGAT 

GCTTCTGCTGATCGCAGGGGTTCTTATTTGAAAACATCTATGATGGGGGTGGGGTGGGAG 

GAGACAGGTTGTGGTTATGCAGGAAAATCTTGTCCTAAAAATATATGAGTTTGGGGGTAA 

GGGGTGGGATAGCCMGC AAMT CAGTMTTATTTTAAAATGAACATATGTATTTTTATT 

MCTTTTAGTTAAATACAGATTTTACAACGAGGTCAGCATAAGCCTAAATCTATATAGAG 

GGCTMCTCAGGCATTGTCTTGTTTATTTGTAGACTGGATTAAAAACAACCTGTCCTGTT 

TTGTNAGTTCCCAGCTTCTTTCGTTTAGAATAAATTAGACCAAAAGAA 

Sequence 1665 
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GCGTNCGCTTAATGTCAATGTGGCCTGGGCTGGAGGTCTGGACCCCCCCATGGGGGATCC 

TGAGTA(X)TGGCTGCTTTCAGGATAGTCGTGATGCCCATCGCCCGAGAG7TCTCTCCAGA 

CCTAGT(X)TGGTGTCTGCTGGATTTGATGCTGCTGAGGGTCACCCGGCCCCACTGGGTGG 

CTACCATGTTTCTGCCAAATGTTTTGGATACATGACGCAGCAACTGATGAACCTGGCAGG 

AGGCGCAGTGGTGCTGGCCTTGGAGGGTGGCCATGACCTCACAGCCATCTGTGACGCCTC 

TGAGGCCTGTGTGGCTGCTCTTCTGGGTAACAGGGTGGATCCCCTTTCAGAAGAAGGCTG 

GAAACAGAAACCCAACCTCAATGCCATCCGCTCTCTGGAGGCCGTGATCCGGGTGCACAG 

TAAGTGTGGAGATGGGACACTCGCTGAGCTCAGACTGAAGGATCTTGGT 

Sequence 1666 

CGACCNCGCGTCCGGTGTGATGATCGCTACTGCTGGAGACCGCACAGAGGAGTTCCACGG 

CCACNGCAGTGAACCTCCTGGGGAACTTGCCCCTCAAGTGTCTGGATGTTCTCCTCACCC 

TGGAGCCACATGGAGACTCCACGGAGTTCATGGGAGTGAATATGGATGTGATTCGTGCCC 

TCCTCATCTTCCTAGAGAAGCGTTTGCACAAGACACACAGGCTGAAGGAGAGTGTAGCTC 

CCGTGCTGAGCGTGCTGACTGAATGTGCCCGGATGCACCGCCCAGCCAGGAAGTTCCTGA 

AGGCCCAGGTGCTGCCCCCTCTGCGGGATGTGAGGACACGGCCTGAGGTTGGGGGAGATG 

CTGCGGAACAAGCTTGTCCGCCTCATGACACACCTGGACACAAGATGTGAAGAGGGTGGC 

TGCCGAGTTCTTGTTTG 

Sequence 1667 

NCGCGTCCGACACTATTTAGAGAGCTCCCTTCCCACCTCTCTGCCCAGCCTTGTTACCTC 

ACTTCTGCTCTGGCCATGGCTGTGMGGGCCCAGCCAGCTCCCTGTTTTGATGTTCTGTG 

CAACAGCTCCGGGGTCTTGTGACTGGAGATCCTCAACAGGCCCTGGAGCCAGGACTGGAG 

TCTTGGCAGCTGATGAGCAGCACCTTGCCGGCCAGGAGGAGCTGATGCTGACGATCTCCC 

CAACATCTGAAGGCTTAAAGAACATTGTCGTTCTTCAGCCCTCCTTGCTTCTCTCAATAC 

AATAAGACATTGCAGAAGCAAAAGGGTGGCCTCTGCTCCAGGCAAGGCAGCTGGCTCTGT 

CTGGGGGCGTCGGCCTGGGGCTTGGGTGCCACGTGCTGAGATTGCATAGTCAAAACAAGC 

CATTTTTGCCMCAATAGCTTGTGGCTCCCACATTTTTCTACCCTTGCACTNAANGGCCA 

GACCACTCT^GCATGGACCAANACCATNTTTCCAMCCCATGGGGCTTTTTTTNCC 

Sequence 1668 

CANGAATACTGAAAAATGAAGCCTAAAATGAAGTATTCAACCAACAAAATTTCCACAGCA 

AAGTGGAAGAACACAGCAAGCAAAGCCTTGTGTTTCAAGCTGGGAAAATCCCAACAGAAG 

GCCAMGAAGTTTGCCCCATGTACTTTATGAAGCTCCGCTCTGGCCTTATGATAAAAAAG 

GAGGCCTGTTACTTTAGGAGAGAAACCACCAAAAGGCCTTCACTGAAAACAGGTAGAAAG 

CACAAAAGACATCTGGTACTCGCTGCCTGTCAACAGCAGTCTACTGTGGAGTGCTTTGCC 

TTTGGTATATCAAGGGGTCCAGAAATATACTAGAGCACTTCATGATTCAAGTATCACAGG 

MTTTCACCTATTACAGAGTATCTTGCTTCTCTAAGCACATACAATGGATCAATCCATTA 

CTTTTGCTTTGGAGGATGGAAAGTTATGAGATATATGTTGAAGACTTGAAAAAAG 

Sequence 1669 

GTCGACCNCGCGTCCGCCCGCCCATCACTGCTGTTCCTCCAGGGCCAGCACTCGGGCGAG 

GCAGGGGAGCTGCCTTCGGTACATAATTTGAAGGGGCACTCCCTCTTGGGCACATGCCGG 

CCCTGAGTGCCTCCCTTGCCTCACTCTGATCCTGGCCCCATAATGTCCTCAGTGGAAGGT 

GATGGGGGCCGGT GCTG TGGGGAGAGTAGAAAGAGGGGTTGGCATGACTAAAAATACCAG 

TATGTGTATTMGTATTTTGAGAATGAAATGCCAAGGAGTGCCTACTATATGCCAGCTCT 

AGGAATGGAGTAGACAGTGGACACAAGAAGGACTTACGCCCTGAGCACAGGTGCCAATGG 

TGACAAGACTGGCAAGACGTGAGGGCATGAATGGTTCATTCAGGCAGCTGCTGCAGATGT 

GGTCACCTGGTGCCATCTGCTGCTCCCTTTTCCACTTTTCTATGTCCTCCTTCCACCCCA 

A 

Sequence 1670 

CGACCNCGCGTCCGGTCTGAAGGGTCTGGCTGGTGAGCCAGGTTTTAAAGGCAGCCGAGG 
GGACCCTGGGCCCCCAGGACCACCTCCTGTCATCCTGCCAGGAATGAAAGACATTAAAGG 
AGAGAAAGGAGATGAAGGGCCTATGGGGCTGAAAGGATACCTGGGCGCAAAAGGTATCCA 
AGGAATGCCAGGCATCCCAGGGCTGTCAGGAATCCCTGGGCTGCCTGGGAGGCCCGGCCA 
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CATCAAAGGAGTCAAGGGAGACATCGGAGTCCCCGGCATCCCCGGTTTGCCAGGATTCCC 

TGGGGTGGCTGGCCCCCCTGGAATTACGGGATTCCCAGGATTCATAGGAAGCCGGGGTGA 

CAAAGGTGCCCCAGGGAGAGCAGGCCTGTATGGCGAGATTGGCNCGACTGGTGATTTCGG 

TGACATCGGGGACACTATAAATTTACCAGGAAGACCAGGCCTGAAGGGGGAGCGGNGCAC 

CACTGGAATACCAGGTCTGAAGGGATTCTTTGGAGAGAAG 

Sequence 1671 

GAGTCGACCNCGCGTCCGCAACTGGTGTCCAGCTCGGTGCACTCCAAGCGCCGTTCCCGA 

GCGGACCTCACGGCCGAGATGATCAGCGCCCCGCTGGGCGACTTCCGCCACACCATGCAC 

GTTGGCCGGGCCGGAGACGCCTTTGGGGACACCTCCTTCCTCAATAGCAAGGCTGGCGAG 

CCCGACGGCGAGTCCTTGGACGAACAGCCCTCTTCTTCATCTTCCAAACGCAGTCTCCTG 

TCCAGGAAGTTCCGGGGCAGCAAGCGGTCACAGTCGGTGACCAGGGGGGAGCGGGAGCAG 

CGTGACATGCTGGGCTCCCTGCGGGACTCGGCCXSTGTTTGTCAAGAATGCCATGTCCCTG 

CCCCAGCTCAATGAGAAGGAGGCCGCGGAGAAGGGCACCAGTAAGCTGCCXAAGAGCCTG 

TCATCCAGCCCCGTGAAGAAGGCCAATGACGGGGGAGGGCGGCGATGAGGAGGCGGGCAC 

GGAGGAAGGCAGTGCCCCGTCGGAAT 

Sequence 1672 

CGCGTCCGCTCGCGGCNGGGCATCGNGTACATCCTCAGCAACCATGGGCTACGTGCGCCA 

GCTCTCCCAGGCCCTGGACACATCCAACGTGATGGTGAAGAAGCAGGTGTTTGAGCTACT 

GGCTGCCCTGTGCATCTACTCTCCCGAGGGCCACGTGCTGACCCTGGACGCCCTGGACCA 

CTACMGACGGTGTGCAGCCAGCAGTACCGCTTCAGCATTGTCATGAACGAGCTCTCCGG 

CAGCGACAACGTGCCCTACGTGGTCACCCTGCTTAGCGTGATCAACGCCGTCATCTTGGG 

CCCCGAGGACCTGCGCGCGCGCACCCAGCTGCGGAACGAGTTTATCGGGCTGCAGCTGCT 

GGACGTCCTGGCTCGCCTGCGGTGAGTCCCCACTGTAGCGGTCCTGCCGGNTTNCCCCTC 

CTGCTCCCAAGGCCAGGCCCACCTGCCCTTTGGCTCCCAGCCACCTCACCTAAGCAGCAC 

CTTCCAGATGGCAGGGGAGGTGGC 

Sequence 1673 

GTCGACCACGCGTCCGGCCAGAGCTGAGTGGCAGCCGCCTCCCTTATGCAGGACATGTGC 

TCTCGGCTTCACCAGGGTTCTGACCGGGTCTGCTTCTGCATTCACAGCGCCTCCTGGACC 

TGAAGGCATCTGAGTGTGAGACCCTGTTCTAACTCTTAGAAGTGACATTGTAAGAGGTGG 

TGGGGACCAGCTAATTGGTCCAACCCAGCCTGAGTGCACCACCCTTTGAACAAATGTATC 

AGTGATGAAMTTTGCCTTTGCCCCGGCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGTGGGCGGATCACTTGAGGTCGGGAGTTCAGGACCAGCCTGGCCAGCGTGGCGAAACCC 

CGTCTCTACTAAACATAAAAAAATTAGTCAGGTGTGGCGGTGCGTGCCTGTGGTCCCAGC 

TATTCAGGAGGCTGAGGCACCAGAATTGCTTGA 

Sequence 1674 

TGACGGCGGCCCGGCCGACGGGAGCCGGGGCGGGGCGGCGGNCCANCGAAGGAGCGCGCG 

GGCGGTCTGGCCCCGCCCCCTCCCCGCCCGCCTTCCCGGTGACCTTCAGGGGCCCGGGTG 

GCGGGCGCAGGCCCCTGCGGCGGCGGCGGGATGTTCGTGCAGGAGGAGAAGATCTTCGCG 

GGCAAGGTGCTGCGGCTGCACATTTGCGCGTNCGACGGCGCCTAGTGGCTGGAGGAGGCC 

ACCCNGGACACCTACNGTGGANAMNCTCAAGGAGCGCTTGCCTCAAAGCACTGTGCTCA 

TGGGGAGCTTANMGATCCCCAAMGTATAACCCATCATTAAATTTAATCCCACGCTGCC 

TNAANANAAGGGGTGCTTGNGTGATTGCCATGNACCATNCTTGGGAAGGAAGAAACCATT 

CCCAGGACCCAAAAGATGGGCCCTATTTNTTGGATTA 

Sequence 1675 

CACGCGTCCGGGATCCCGTACCCGGGACAGACTCGGCGCCGCTGGCTGGCCTGGCCTGGT 

CGTCGGCCTCTGCACCCCCGCCGCGGGGGTTCAGCGCGATCTCCTGCACCGTCGAGGGGG 

CACCCGCCAGCTTTGGCMGAGCTTCGCGCAGAAATCTGGCTACTTCCTGTGCCTTAGTT . 

CTCTGGGCAGCCTAGAGAACCCGCAGGAGAACGTGGTGGCCGATATCCAGATCGTGGTGG 

ACAAGAGCCCCCTGCCGCTGGGCTTCTCCCCCGTCTGCGACCCCATGGATTCCAAGGCCT 

CTGTGTCCAAGAAGAAACGCATGTGTGTGAAGCTGTTGCCCCTGGGAGCCACGGACACGG 

CTGTGTTTGATGTCCGGCTGAGTGGGAAGACCAAGACAGTGCCTGGATACCTTCGAATAG 
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GGGACATGGGCGGCTTTGCCATTCTGGTGCAAGAAGGCCA 
Sequence 1676 

TCCCTCTGCTGATG ATGGA TGCCCCTAACACCTGTGCCTAACACCCCTACTGAACCCCAC 

AGCTCCAGCCTTAGTTTTTGGAGTCAAGTGTTAAAGGTTTCTGGCCAGAGGAATTGGGGT 

CTTG CCAT CCCT GCAAT AGCCCTfTTATGGGCTCTGGGAGACAGCTTTAGGGAATAAATG 

GGGATTTTCCCCTTTTTCTACCCACTCCTTTGCTTCCTCCAAGACTTACCCAACTCCTTC 

CCCCTCAGAGAACCAAATAGCCTGAGGAAGCAGGAGAGTTCCTGGTTATGGCAGATTCTT 

GGTGATTTGGGGCTTCAAGACAGTAGGTGAGAGATGCTGTCAGGGACGTATCTTCTTCAT 

ACCAMGTCACTGGTCCTTTCTCAGCCTCTCTCGTGCTTTTCTCCTAATGACCATATTTT 

TGCCAAAAATTGGGAATATGTTATC3TGACAGACCAGAATATTTGAAGGTTTGGGCTG 

Sequence 1677 

GCGCCGCGGATATNCGGATCAACCTATGGTNTCAATATTGTNAGTTATTCAGCATAAACA 

GMTTATTTCCCMNACTTGATCTGAAATATTNNTAATGGTCNTACTNGAAACTTATATT 

CTTNCTGGGAGNGANGTNTTTATCATTTTTCCATGGAGACAGGTTCTAACTCTGTTGCCC 

AGGCTGCANTGCAGTGATGTGATCATAGCTCACTGCAGCCTGAAACTCCTGGGTGTCAAG 

TGATCTC TGGCCT CAGCCTCCCAAGTAGTTGGAACTTCAGATACGTGCCACCACAACCAG 

CTMTTTATTTTTTAGAGATGAGGTNTCGCTATGTNGCCCAGTCTGGCCNNCTAGCCNCA 

AGTGATCTGGCCATCTNAGCCTTCAGTTGGAGATGTCTGATTTATGTTAATATAAGAAAG 

CTGTTGATCGTTTATCATAAANGCATTT 

Sequence 1678 

GTCNCCNCGCGTCCX3CTCCTCCGCCGGCATGCAACTCGGCGCCCGCG6TCCATGGACCGG 

AACCTCGGGCCGACGGACGGGAACCCGGGCCGCGATCGCCGCCTCCCCGCCTCAGGCTCC 

TCCTCCTCGCTCTCCGCCGCCTCCGCCGGACTCCCGCAGGCCCTGCACCGCCGCCGCCAG 

GCTAGCGGAGCTGCCCCGGGAAGCTGGGTGACGGGTTCGCGGCTGCCGCCGGACTGCGGC 

CTACTCCGCCGCCTCTCAGTGCTATTGTCCCTGGGCCTGGCCTTGAGCGGGTCCACTGGG 

GAAGGCNCGTGTGCGCCGGCTCCGCGGAAGATGCXJGGACCAAGCCCTACAGCAGATGCTG 

GACAAGAAGTTGCTGGGTTTGTTTTGCTACTGATGAAGATGATAGAACAGCTGAATGGGT 

GAGGACCATGGCAGGTGCAGGAGGATCTACAAAATGGGTTCACCAGGGCCTGTCTACAAC 

GCTGGGTGGATGAAAAGCAAAGAG 

Sequence 1679 

GCGTCCGGGCCCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGC 

CGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGfGTGGCT 

GCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTG 

CCAGGTTAGT GGTTA CTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAA 

TAACTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGACTACTTTAGGTATTA 

CAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAA 

GGGGACTGTGCTGTCAAACCATGTCMTCTGATGAAGTTCCTGATGGAATTAAATCTGCG 

AGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGAAGGAATGTGAACAAGCTGAACG 

ACTTGGAAGCAGTGGAT 

Sequence 1680 

GTCCGGCGTGGGGAAGGGTGGGGTGAGGGGGCGTGGCCGCAGCTAGGGCGGCGAAACTCT 

CCTCCCCTCGGCCCCACCGCGTGGGACGGCGTGAACGTGGTGTCGGAGGGATGTCAGCCT 

TCTCTGAGGCGGCGCTGGAGAAGAAGCTGTCGGAGTTGAGCAACTCGCAGCAGAGCGTGC 

AGACCTTGTCCCTGTGGCTCATTCACCACCGTAAACACTCGCGGCCCATCGTCACCGTGT 

GGGAGCGGGAGCTGCGGAAAGAGTGGAGGTGCAACAAATGAGAAATTCCAATTGGAGATT 

TTGTCAMCAGGMTTTACTCCAMCCMACAGGAAGCTTACTTTTCTCTACCTAGCCAA 

TGATGTCATACAGAACAGCAAGAGGAAGGGGCCAGAGTTTACAAAAGATTTTGCACCAGT 

TATA 

Sequence 1681 

CCGGCAAAGCAGGGACTCCTGATTTATATGTCCCTCCTCCTGGCAATCCTCTCACCCCAC 
CTCCCCTGAGAACCTCAGTTCTTCCTAAATTGCTAAAGCTGAGGGGAAAGGGATGCTTTG 
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CGGCAAAGGCGCTGCTGCCTCAAGCTTGCTTTACATGCCTCTCTAGTTCCTCTCGCACTA 

CAGAGTGGTAGCGMCAAAAGCGTTCGCCCTAGAAGCGACCTGAATGGAAAAATCTGCAC 

ACGAACTAATGGGT TTGTC ACGGAAAGTAGGGAACCGGGTCTGCAGCATTCCCTGGAGAC 

AGACTTTCTGGTTGGTTTTCAAGGGTCCAAGGCAGCCATCAGCCCGGCTGTGCCCTCCCA 

CCCTGCCTCCCACCCAGTTGATTCTCTCTTTGTGTAAGTTTAGCCCCTCTGAGGGTGGTG 

GAGTGAGAGCATCCCATCAGATATATATACGATTCATCAGTCGGCACTTAAAAAG 

Sequence 1682 

TCACCNCGCGTCCGAAAAACGCAGATGATATACCTGCAACATCNGTCATGGCTGCGCCCT 

GTGCTCAGAAGCAACCCGGGTGGAATATTGCTGGTGCAACAGTGGCAGGGCACAGTGCCA 

CTCAGTGCCTGTCAAAAGTTGCAGCGAGCCAAGGTGTTTCAACGGGGGCACCTGCCAGCA 

GGCCCTGTACTTCTCAGATTTCGTGTGCCAGTGCCCCGAAGGATTTGCTGGGAAGTGCTG 

TGAAATAGATACCAGGGCCACGTGCTACGAGGACCAGGGCATCAGCTACAGGGGCACGTG 

GAGCACAGCGGAGAGTGGCGCNCGAGTGCACCAACTGGAACAGCAGCCGCGTTGGCCCAG 

AAGGCCTACAGCGGGCGGAGGCCAGACGCCATCAGGCTGGGCCTGGGGAACCACAACTAC 

TGCAGAAACCCAAGATCGAGACTCAAAGCCCTGGTGCTACGTCTTTAAGGCGGGG 

Sequence 1683 

CCGTCCGCTCCTTGGCAAGAACGAAAGGTGTGATGAAACCTCCCTGCTCGGAAGGGTCTC 

CGTGGAGGTGTCCTCATTTCACATGCTGGGTTTTGCAAGCGAGGAAGCCAGGCAGTGGAG 

GAACTAGAGAGAGGCAGGCGTGTGTGTGGACAAGCGCTGGAGCCGCAGCCCTCAGACTGG 

CACGGGAACGCCAGCGTTGGGTGTTCAGATTCCACGCGTATGTCTGGGCTCACTCACAGC 

ATGGCCGAGTGTCTGCAGTGCTGGTCCTGACCCTTCCAGAGCAGCAGTGGACAGATGAGA 

TAAGACTGTTTCAGAAACAAAGATGGCCACAGCCTTCCTAACAAGCAGGTCATCTGGCCA 

TGTCTGTATTGTAACTGGTAAAAGGCTTCAAGTCAGATTGATGATCAAGAAAANGTCAAA 

ACCCCAGCCCAAGATTGGGAAAGCAGGTTNGTGGI^CCAANGCTTTTTAAAAAAATTATT 

TGAAGCTCTTCATTCCTNTTTCTGTGAGTGTGTCTTTCTCTT 

Sequence 1684 

NCCCATACTGGGGGCCCCCTTCCTGCAGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTT 

CCAAGACACTGTCACTGATGTGGATAAATCTTGGAAGGAGCTCAGTAATGTCCTCTCAGG 

GATCTTCTGCGCCTCTCTCAACTTCATCGACTCCACCAACACAGTCACTCCCACTGCCTC 

CTTCAAACCCCTGGGTCTGGCCAATGACACTGACCACTACTTTCTGCGCTATGCTGTGCT 

GCCGCGGGAGGTGGTCTGCACCGAAAACCTCACCCCCTGGAAGAAGCTCTTGCCCTGTAG 

TTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGATCGCTTGTTCCACACCAGCTACCA 

CTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATGCACCGCTGTACTAGCATCTCCT 

GGGAGCTGAGGCAGACCTGGCAAGTTGTATTTGATGCCTTCATCACGGGGCAGGGAAAGA 

A 

Sequence 1685 

CCGCTGGTTATTACCCAGCTGGATGGTTTCCTTTTAGGCAAGAAGGAGGTCATCAGCAGG 

CTCCCAACAATAATGCCGAAGTTAACAATGATGGGCAAAATGCAAACAACTTGGAACTTG 

AAGAAATGGAGCGTCTTATGGATGATGGGCTTGAAGATGAGAGTGGAGAAGATGGAGGTG 

MGATGCCAGTGCMTTCAAAGGCCTGGATTMTGGCTTCAGCTTGGTCTTTCATCACCA 

CCTTCTTTACTTCACTAATACCAGAGGGGCCTCCCCAGGTTGCCAATTGACCTGAAAAAC 

TGTGCCAGCTACMGGA GGGT CTGACTTCAGGAMGTGGTTTAAATAACAGTGCAATTTC 

AAAAMATTTATAACTTTCTTTTGATCATCATGTACAGAGGTG I MINI I CTTTAGGCT 

TCTCAT GCATATGAATATTTTAAGCACGAATGGACTACTAAATATCTGAGI 1 1 I II I I I I 

I I I I I I I I I AAAGAATC 

Sequence 1686 

CGCGCCGGTTTGGCTGCCCTGCATAAGCTGCTACAAATAGAATAAAGAATTTCATACGCC 
TGTATCTATCATTTAGATGCATGGAMAAMTGGGCTTTGCACACAATGGGTTTGGAGCT 
GACTGGGAACAATGGAAAAAATTACATTAGCTGTGGTTGTAMGTTTTTTTGTT^ 
TTTGI I II I | U I ICI I I I I ICI I I H I II I I U I I ACCATCTTGTG AAAGGTTTCTGAA 
ACTCGATAATAAAMGCGGTTGGTGTAMTTATTCTITTTGTGTCACATTTTTAGAAGGAA 
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AAACATAAMGMTGTATCCTTAGTACTGGTTCTTAAACAGCCCATAAAAACCCATTGGC 
CTGAAGCTTATATCTCAGGCCTATGCCCATCTTATAGTCTTGGAAGACAAAA 
Sequence 1687 

CGCGTCCGTGGGTCTCGCCCTCAACCTGTGCATGTATATGTGTGTCTTTGTGTGTGTATG 

TGTGATCTCTGCCTGCAGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCC 

CGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTG 

GGCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATC 

ATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTC 

TCTGCTGTGCTGGTGGAGCCCCCGCCGATGAAGCCTGCCTGGTGCTTGGCCTGAAGTTTC 

CCAAGGAAGCTGTCTGGTGGGCCCCTTGGGTGATGGGCCCCTTGGAGGGCTTGGAAGCCC 

CCCCATGGGCCAAGCCCCAGCCCTGGGCTTCTGGTTCNTCTTGTCCCTGGTCACCCCCAT 

CCCCACTCCCCCTTGGGTGGGCCTTGAACTTNCCACTTCCCTTGGGTGGGGCCC 

Sequence 1688 

AGGAGGNTTGAAGGAGTTGNNGGAGGAGGAGGATGGAGGCGAGGGCGAGCGAGCCCAGCG 

GGGTCCNGGNCGCCCCGCGGGCCAAAGTCGAGCCCTNCCGCCNNTGGGCGAGCGCGCCAG 

CCGCXJCNTTTCANAACAGTTCXSNCGCCACAAAANAAAAGAACGGGGGGGTGCCGAGGTTN 

CCCATTGANCTCTTAAAGTGGTGCAGGTCCCTGTTGAGTGCGCTGCACCGGGCCGTGACC 

CGCGCCCCTGTGCGTCCC 

Sequence 1689 

GGAGTCGACCACGCGTCCGCGCCGGCCGCCGGTGTCCGGACCGCTCGCCCCX5GTTTGGAC 

CCGACTTCGGTTCTTCTGGGGTGTTGATGCTCCTAAAGCCCGAGAGCACGTGTCCAGACC 

CTAGCCTGTACGACGCTGACTCTGCCCGGTCCCAGAACCAAAGCCATGCCGGGGTGTGGC 

CTCTGACCCCACGCGGAGGGGACCTCGCCTTGCGGGACCCCACCTGGAACCCGACCTNCC 

AGNCTCGCAGCCGGCCTGAGCCGCCATGCGCGGGAAGTTGCTGCCGCTGGCCGGCCTATA 

CCTGGTGCAGGGCCTGCCCTACGGGCTCCAGTCCGGCCTCCTGCCAAATGCTGCTTGCGT 

GCCGGCGGCCTCTCGCTGACGCGCGTGGGGCTGGCCAAGGTTCTGTACGCTNCGTGGCTT 

GCTTCAAGCTGGCT 

Sequence 1690 

CNCCCCGCGTCCGCG GACGC GTGGGTGCTTGCTGCTGAACCTGAGCTGCAAGTTGGAATT 

GATATAATGAAGACTAGTTTTCCAGGTCGTGGTTCAATTCCAGAATTCTTTCATATTATG 

AAAAGAAAGTTT ACCA ACAAAGAATGGGAAACAATCAGAAGCTTTAAGGATGAGTGGACT 

CAGCTGGATATGTTTTATAGGAATTGGGCACTTAAGGAAAGCTTCATAAAAGCCATTGGT 

GTTGGACTAGGATTTGAATTGCAGCGGCTTGAATTTGATCTATCTCCATTAAACTTGGAT 

ATAGGCCAAGTTTATAAAGAAACACGTTTATTCCTGGATGGAGAGGAAGAAAAAGAATGG 

GCATTTGAGGAAAGCAAAATAGATGAGCACCATTTTGTTGCAGTTGCTCTTAGGAAACCC 

GATGGGATCTAGACATCAGGGGATGTTCCATCTCAGGATGATTCCAAACCAACCCAGAGG 

GCAATTTACTATTCTCAACTTTAATGATTTAA 

Sequence 1691 

GACCACGCGTCCGCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCC 

GGGCCGCTGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCGGGGCGCTGTGTCCCC 

AAGCTCCCCGTCCTCGGCCAGGCGGGCACCACGGCAGGGGCTGAGCTACCCTCATGGAAG 

GGA GAGGA CCGTACCGGATCTACGACCCTGGGGGCAGCGTGCCCTCAGGAGAGGCATCCG 

CAGCTTTTGAGCGCCTAGTGAAGGAGAATTCCCGGCTGAAGGAAAAAATGCAAGGGATAA 

AGATGTTAGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAAGG 

CAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTCTCCCTCCTTGGGCTCCT 

TCGACCCCCTGGCTGAGCTCACAGGAAAGGACTCAAATGTCACAGCATCTTCCACAGCCC 
C 

Sequence 1692 

ACAGAATTTAGGGGTGGGTGAAAGCACTTGNGCTTTAGCTNTTTCATATTAAATATATAT 

CTATATTTAAACATTCATGGCATAGATGATGATTTACAGACAATTTAAAAGTTCAAGTCT 

GTACTGTTACAGTTTGAGAATTTGTAGTATTACATCATTACATAAGTCATTTTAGTAACA 
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GCCTTTGTGAAATGAACTTGTTTACTATTGGAGATAACCACACTTAATNAAGAAGAGACA 

GTGAAAGTACCATCATMTTMCCTAMTTTTTTGTTATAGCAGAGTTTTCTTGTTTAAA 

AAMAATAAMTCATCTNGAAAAGCATTTTGTACAGTTAAAATGTATAATGAAGCTnTG 

CCAACCAGACTGGTGCTTAGC AACCAA A 1 1 1 1 1 1 1 1 1 1 AAATAAAGCTTTTATGGCAGGT 

GGGTAMTMGGTGGCCTTCCAMTATATTGGTGTCTTGATGGAGAGTTNTTTAGTTGAA 

ATGAATGTGGGTCTTTTCT 

Sequence 1693 

CGGTTAACATGGCCGTCACCGACAGCCTCAGCCGGGCTGCGACTGTCTTGGCAACTGTNT 

TGCTCTTGTCCTTCGGCAGCGTGGCCGCTAGTCATATCGAGGATCAAGCANAACAATTCT 

TTATGAAGTGGCCNATCAAACAANCTGGGCCTGTTCTTGGTGTGTACATCCCCGATTCTG 

GTATTAATTATCGACATGTTGCAAATACCCTTTCTGTTTATAGAAGTGTCAAGAGGCTAG 

GTATTCCTGACAGTCACATNTGCCCCTAATGCTTGCAGATGATATGGCCTGTAATCCCTA 

GAAATCCCAAACCAGCTACAGTGTTTTAGTTCACAANCAATNTGGAACTAAATGGTGTAT 

GG GAGAATG ATGTGGGAAGGTGNNATTATAGAAGTTTNTTGAGGTAAACNGGTGGGAGAA 

NNNTTTTTTTACCNGGGTAATTTAANCTGNGGGAGGGANTCCCCACCCTAAGTANCTTCC 

TTCGGG 

Sequence 1694 

GTCCG CAAGATGGACGCAGCTCTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAG 

ATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAG 

AAAAGGACGAGATCTTGTCTGATGTGGCATCTANACTTTTGNTTTACATACAGGAAAAAC 

TTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGG 

TGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGG 

TGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGAC 

AGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAG 

TCCATAGGGCCAGGTGGTACGG 

Sequence 1695 

CCCCGCGTCCGCTCGNAGCTGTCCGCGGTCTGTTTGGCCCGAACGGCGGCGGAGGCGCTG 

ATCATGGCGACATTCATCTCGGTGCAGCTGAAAAAGACCTCAGAGGTGGACCTGGCCAAG 

CCGCTGGTGAAGTTCATCCAGCAGACTTACCCAAGCX3GCGGGGGAAGAGCAGGCCCAGTA 

CTGCCGCGCGGCGGAGGAGCTCAGCAAGCTGCGCCGCGCCGCAGTCGGTCGTCCGCTGGA 

CAAGCACGAGGGCGCGCTCGAGACGCTCCTGAGATATTATGATCAGATTTGTTCTATTGA 

ACCCAAATTCCCAT TTTCTG AAAATCAGATCTGCTTGACATTTACCTGGAAGGATGCTTT 

CGATAAAGGTTCACTTnTGGAGGCTCTGTAAAACTGGCTCTTGCAAGCTTAGGATATGA 

AAAGAGCTGTGTTGTTGTTCAATTGTGCAGCCTTAGCTAGCCAAATTGCAGCAGAACAAG 

AACCTGGATAATGATGAAGGGATTGAAAATCGCT 

Sequence 1696 

TTCGGGAGTCGACCCCGCGTCCGGGCCAGCCGGCTCGCCCGGGGGCCATGGCAGCAGCGG 

CTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCCTCCCGGGGAAGTCATTCATCTGAATG 

TGGGAGGCAAGAGATTCAGTACCTCTCGCCAGACTCTCACCTGGATCCCAGACTCCTTCT 

TCTCCAGTCTTCTGAGCGGACGCATCTCGACGCTGAAAGATGAGACCGGAGCAATCTTCA 

TCGACAGGGACCCTACAGTCTTCGCCCCCATCCTCAACTTCCTGCGCACCAAAGAGTTGG 

ATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCATGAAGCCCAGTTCTATGGGCTCACTC 

CTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTTGGATTCGATCTTCTTGTGGAAACGTC 

CTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCCAGTGAAGCGGCGGAACCGGCACAGC 

CTAGTGGGGCCTCA 

Sequence 1697 

CGTCCGAAGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATTTTCC 

TGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGTCCTT 

AGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATCCCCG 

GATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTCCTGACCTA 

CGTGTACTTCGTTCTCTCACTTGGGCCTCGCATCATGGCTAATCGGAAGCCCTTCCAGCT 
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CCGTGGCTTCATGATTGTCTACAACTTCTCACTGGTGGCACTCTCCCTCTACATTGTCTA 
TGAGTTCCTGATGTCGGGCTGGCTGAGCACCTATACCTGGCGCTGTGACCCTGTGGACTA 
TTCCAACAGCCCTGAGGCACTTAGGATGGTTCGGGTGGCCTGGCTCTTCCTCTTCTCCAA 
GTTC 

Sequence 1698 

CGCGTCCGGCCGCGCCCATGGCCCGCCGCTGCCCGGCCGCGCGGGCGGGCCCGCCACGCC 

GCTGTCGCCCACGCGCCTGTCGCGGCTGCAGGAGAAGGAGGAGCTGCGCGAGCTCAACGA 

CCGCCTGGCGCACTACATCGACCGCGTCCGCGCGCTGGAGCTGGAGAACGACCGGCTCCT 

GCTCAAGATCTCAGAGAAGGAGGAGGTGACCACGCGCGAGGTGAGTGGCATCAAGGCGCT 

GTACGAGTCGGAGCTGGCCGATGCCCGGAGAGTCCTGGATGAGACGGCTCGAGAGCGTGC 

CCGGCTGCAGATAGAGATTGGGAAGCTGAGGGCAGAGTTGGACGAGGTCAACAAGAGCGC 

CAAGAAGAGGGAGGGCGAGCTTACGGTGGCCCAGGGCCGTGTGAAGGACCTGGAGTCCCT 

GTTCCACCGGAGCCGAGGTGGAGCTGGCAGCTGCCCTCAGCGACAAGCGCGGCCTGGAGA 

Sequence 1699 

ACGCGTCCGGAAGAATCTACACTTCTTTGCACCAGAGTATGGAGAAGTCACTAATGTGAC 

AACAGCAGTGGACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGAT 

TCAGGGCAATGGAGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCT 

TCTAGAAGACCCATTACAGAGGGAGTTCATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCG 

CAGACCAACAGCCAGAGAACTTCTGTTCCACCCAGCATTGTTTGAAGTGCCCTCGCTCAA 

ACTCCTTGCGGCCCACTGCATTGTGGGACACCAACAGATGATCCCAGAGAACGCTCTAGA 

GGAGATCACCAAAAACATGGATACTAGTGCCGTACTGGCTGAAATCCCCCAGGCCCTGAT 

CTGCGCTGTGGCTGTCCCTGGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTC 

Sequence 1700 

GGGAGTCGCCCCGCGTCCGGATTTCAGTTGGTGGCGTCATAGTCTCATTACAGTGTCTAT 

CTTGGCATTTACCMTTTACTTGTCATCTTTGTCCCMCTATTAGGGATATCTTTGGTTT 

TATTGGTGCATCTGCAGCTTCTATGTTGATTTTTATTCTTCCTTCTGCCTTCTATATCAA 

GTTGGTGAAGAAAGAACCTATGAAATCTGTACAAAAGATTGGGGCTTTGTTCTTCCTGTT 

AAGTGGTGTACTGGTGATGACCGGAAGCATGGCCTTGATTGTTTTGGATTGGGTACACAA 

TGCACCTGGAGGTGGCCATTAATTGGCACCACTCAAACTCAAACTCAGTCCATCTGATGC 

CAGTGTTGAGTAAACTCMCTACTATGAAATTTCACCTAATGTTTTCAGTTTCACTTCCT 

TTTGMGTGCAGATTCCTCGCTGGTTCTTCTGAGTGCAGMTMGTGAACTTTTTTGTTT 

TGGTTTGN 1 1 1 1 1 1 AAGAAAC 

Sequence 1701 

CCCACCGTCCGCGCGCGGCGCCTCGCCTCGGCCGGCGCCTAGCAGCCGACTTAGAACTGG 

TGCGGACCAGGGGAATCCGACTGATAAATTAAAACAAAGCATCGCGAAGGCCCGCGGCGG 

GTGNTGACGCGANTGCGATNTNCTGCCCANNGCNTCTTGAATGTCAAAGTTGAANAAANC 

CAATGAAGCGCGGGTAAACGGCGGGAAGTAACTAATGACTTCTCATTAAGGGTAGCCAAA 

NGCCCTTCGTCATCTNAATTAAGTTGGACCGCGCANTGAAATNGGATGAAACCNAGANTT 

CCCACNTGTCCCTACCTACNTAATCCAAGGCGGAAAACCACAAGCCAAAGGG 

Sequence 1702 

CGACCACGCGTCCGGACAGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGGTGCATGG 

CCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGACTCTGGC 

ATGCTAACTAGTTACGCGACCCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCC 

TCTTCCTGCACGCCGGGATCTTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCC 

TGCGGGACCTGGAGAAGCTGGCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTG 

GTGTCACCGGCAACCTGGCCAGTGCCATCTTCCTGCCATACCGAGCAGAGGTGGGTCCTG 

CTGGCTCCCAGTTCGGCATCCTGGCCTGCCTCTTCGTGGAGCTCTTCCAGAGCTGGCAGA 

TCCTGGCGCGGCCCTTGGCGTGCCTTCTTCAAGCTGCTGGCTTGTGGTGCTCTTCCTCTT 

CACCTTTGGGCTGCTGCCGTGGATTGACAACTTTGC 

Sequence 1703 
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GATCGACTTCGCCTGACGGAATCCAGGGGTCGTAATATATGTAAACTCGCGTCCGNGCTG 

CGTGCCCAGAGTAGTGGCCGAATACCTTAACGGGGCTGTGCGCGAGGAGAGCATCCACTG 

CAAGTCGGTCGAGGAGATCTTCGACGCTGGTGCAGAANCTGGCCGACCAGTCGGGCTTGG 

ACGTGATCCGCATTCGCAAGCCCTTCCACACANGACAACCACTAGCATTCAGGGCCAGTA 

GGCACCCCTTCACCAACAAGCTTGACCACGTGTCCCGCGGAACTANCGCACCCCCGAGNA 

GGGTTCMNGNATTCCTTGCACCCAGCCCAGATGCCCTGGCTTNTGGGGGGNCAAGTGAC 

CCTTTGTGNMCCCACTCATTTTTTATGGCMGGTGAGCATTNCCTAAAMCCCTTGAAA 

AATGANGGGAANAACTTTCMGGGGTTTTTTACAGGGCCCTTGG i Hi I I 1 1 A AAATCCC 

CAANATTTGGATAATAAATGGAATCCTCAAAAACACAAGTGGAGGAAGGNTCTTGAAAGG 

GC 

Sequence 1704 

TCGACCACGCGTCCGGCCGGAGAAACTTGAGCCGGCTGCCX)CGCCCACGGTGCCCGAAGC 

CCCAAAGGCTGGAATTAGGGGCTAGAAGTCTGGCACCCACCGCCTGGCCAGGTGTTCGGG 

ACGCGACCAGGTGGGCGGTCGCCCCGCCCCGGGAGCGCGGCTTAATAGCTGAGAGCCCGG 

GGGCCAGGCCGNGGCTGCGGCCCAGGCAACGCCCTGAGGGTGGCCACGCTGNCAGGTGTT 

CCACTCCCCCGGGACTATGGGCAAGGGCCCGGGGCGGGGAGGGCGGCAGGTGCTGACACT 

GGAGCTGCGCCGGAGGTCGGGGAACTCGGCCTCCTAAGACTGAGGACACTCGCCTGCTGG 

GCCGGNCGAGCTGTGCGGTGCCCTCCGGGACGCAGGGGGCGCTGCAGCCACGCTGGGTCA 

GGCTCCGAAGGGCCCTCCCAACCCGGGGA 

Sequence 1705 

CGCCACGCGTCCGGAAAGATGGAGGTGTGGGGACAGGAGCTGGGTGTGCTGGGGACTGGC 

CGCGGACCCCTAACCTGTGTCTCCGGTCTCCCTCCGGGAGCGGCTCAACCCAGCCCATCG 

CTCTGGCCCCGTTCTGGCCCTGCAGGGTGGTGGTTGGGACGTTGAAATGAGCGCGCGAGT 

GGTACGTCCTCTCTCCGCGCTCACGCCCCCCTCCTCACCGTGTTTCCCGCCAGGACCATC 

AGCACGTGCCCATCGACATCCAGACCAGCAAGCTGCTCGATTGGCTGGTGGACAGAAGGC 

ACTGCAGCCTGAAATGGCAGAGTCTGGTGCTGACGATCCGCGAGAAGATCAATGCTGCCA 

TCCAGGACATGCCAGAGAGCGAAGAGATCGCCCAGCTGCTGTCTGGGTCCTACATTCACT 

ACTTTCACTGCCTAAGAATCCTGGACCTTCTCAAAGGCACAGAGGCCTTCCACGAAGAAT 

ATTTTTGGC 

Sequence 1706 

TCGCCNCGCGTCCGCTGAAGCAAGAGAATCACTTGAACCCAGGAGGTGGAGGTTGCGTGA 

GCTAAGATCGCGCCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCGTCTTAAAAA 

AGCCCATGGCAGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCA 

AGGTGGGCGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCGAACACGGTGAAACCCC 

GTCTCTACTAAAAAAAATACAAAAAAATTAGCCAGGCGTGGTCGTGGGCACX^TGTAGTCC 

CAGCTACTCAGGGGGCTGAGGCAGGAGAATGGCGTGAATCCGGGAGGCGGAGCTTGCAGG 

GAGCCGAGATAGTGTCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAA 

MCAAAMGCCCGTGGCAATTAATGGTAAAGGAAACCCGGCTTTTAGTGTAAAGAGGTAA 

CATAA 

Sequence 1707 

GCGTCCGGCCTCCAGCAAAGCCCATTCAGTCAGCTCTGCAGGCTCATCTTACAAGAATAA 

TCCCTTTGCCAGCTCMTCTCCAAACATGGGGTTTCTTCTGGCAGCTCTTCCTCGGGAGG 

AACACCAGTCCAGAGTTCTGTTTCTGGGAGCCTGGTCCCTGGCATACAGCCTCCCTCCGT 

GGGACAGGCCACCAGCCGACCCGTCCCAAGTTCAGCAGGGAAAAAAATGCCTGTTTCCCA 

GAAGTTGACTCTGGTAGCCCCTCCAGGCGGTCCAAACGGAGATTCCAGTGGTGGGACCCA 

GGGGAGTGGCAAAGTTGCTGACCTCGCCGTCCCTAAAGCCCTCTGCAGTTAGTAGTGTGA 

CATCGTCTACCTCCTTGTCAAAAGGAGGCGAGTGGGGACTGTGCTGCTGGCCGGCTCCTC 

TTTGATGGCTTTACCCTACAAATCCAGCAGCCCAAAGCTGTCTGGGGCCATGAGCTCGAA 

CTTCTTGGGAAATTATAC 

Sequence 1708 

CACGCGTCCGGGAAGCGGTGCGCTCCGTCACCACGGGAGTGCGGGAATCCGCCGTTTGCG 
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CTGAGGCAATGGCGGCAGCTGCGCCGGTGGCCGCGGACGACGATGAGCGGCGGCGGCGGC 

CGGGGGCTGCACTGGAGGACTCCCGGTCCCAGGAAGGGGCAAATGGTGAGGCCGAGTCAG 

GTGAGCTCAGCCGGCTTCGGGCTGAGCTGGCAGGCGCCCTGGCAGAAATGGAAACX^ATGA 

AGGCTGTGGCAGAGGTGAGCGAGAGCACGAAGGCCGAGGCTGTGGCTGCGGTGCAGCGGC 

AGTGCCAAGAGGAGGTGGCCTCGCTGCAGGCCATCCTGAAAGACTCCATCAGCAGCTATG 

AAGCCCAGATCACCGCCCTGAAGCAGGAGCGACAGCAGCAGCAGCAGGGACTGTGAGGAG 

AAGGAGCGGGAGCTGGGCCGCCTGAAGCAGCTGCTGTCCCGGGCCTACCCCCTGGA 

Sequence 1709 

CACGCGTCCGCGGACGCGTGGGTCCGCGGCGGCGTCCGGGGTCTCCAGTAGGGCTGACGC 

TCCGGTGCTCGCACAATCCCCCGCCTCGGCTGGCAACGGGCGTCCCTCCACTCCCCGAGT 

CCCCGGCAGCCGCCGCCACCCCAGCGCGCCCCGATCTGGCCCCCTGCCCCGCGAAGATGG 

CTGCCGTACGCCGGGCCCGCAGTTATTGCCGCTGCCTGGTGCGCTTCTCCGACCGAGAAC 

TCTGCTAAGCTCCGCTGCAGAGACAGGCAGGAGTAGACACCXJGGACACCCAGCACCCCTC 

CTTCGGGGGGCGGTGCAGAGGGGGCACGGAGAGCCCCTCGAGCGCAGCAGGCCGCCCCGC 

CAGCATGGCAGAAGCTGAGGAAGATTGTCATTCTGATACTGTCAGAGCAGATGATGATGA 

AGAAAATGAA 

Sequence 1710 

ACGCGTCCGGCGAGTGGCCTTCCCGGTTGGCGCGCGCCCGGGGCGGCGGCGCTGGAGGAG 

CTCGAGACGGAGCCTAGTTATGTCTGGGAGGCGAACGCGGTCCGGAGGAGCCGCTCAGCG 

CTCCGGGCCAAGGGCCCCATCTCCTACTAAGCCTCTGCGGAGGTCCCAGCGGAAATCAGG 

CTCTGAACTCCCGAGCATCCTCCCTGAAATCTGGCCGAAGACACCCAGTGCGGCTGCAGT 

CAGAAAGCCCATCGTCTTAAAGAGGATCGTGGCCCATGCTGTAGAGGTCCCAGCTGTCCA 

ATCACCTCGCAGGAGCCCTAGGATTTCCTTTTTCTTGGAGAAAGAAAACGAGCCCCCTGG 

CAGGGAGCTTACTAAGGAGGACCTTTTCAAGACACACAGCGTCCCTGCCACCCCCACCAG 

CACTCCTGTGCCGAACCCTGAGGCCGAGTCCAGCTCCAAGGAAGGAGAGCTGGACGCCAG 

AGACTTGGAAATGTCTAAGAAAGTCAGGCGTTCCTACAGCCGGCTGGAG 

Sequence 1711 

CNCGCGTCCGAAGGCACAGGCGTCTTGCTCTGTTGAAGCAAGTCAGTATCCGAGAAAACT 

GCTGTTCCCTTTGTTGTGATGAGGTAGCAGACACACAATTGAAGCCATGTGGACACAGTG 

ACCTGTGCATGGATTGTGCCTTGCAGCTGGAGACCTGCCCATTGTGTCGTAAAGAAATAG 

TATCT AGAATCAGACAGATTTCTCATATTTCATGACACATGTGAAGAGGCATCGTGGACT 

mi I CTACTCAATTCCAGCCAATGTTGAAAAGAAAAAGAAAAAAAAAACTCTAATCAGT 

TGTACACACATTGAMCTTATAGCCATGGCCAGATTTTATGCTAAAAATGGTAGTTTGTC 

MAGACAAAATTCTCTTAGMTCTMTCCMCTTGCCAGCCCTGAGAAMTCCCTTTTAA 

GGCCAAGGGAAAGCTGAATGCTAGCAGCCAGGCCTGTGGTACTTCCATGAGAAACCATAG 

CAGGACAATGCCCTC 

Sequence 1712 

CCACCGTCCGGGCGGCCAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAA 

GATGGCAGAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGC 

GCGGTCGGCGGTTTCCAGCCAATGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCT 

GGAGMGCCGAAACCAGAGAGCCTTTTGTTTTCTC I I I 1 1 I CCTGTCTATGCTCTGTCTC 

ACTTAACACT ACGTT TTCTGCTATGGTCTGTGGTTGATGACCTCMTATGAGTTTCGATT 

GTTAACCGTGTTTTTGTTTGGGMGTMTTTTGTTTGAAAATGCTCTCACATACAGGAAT 

TAGGGCC TAGAT TGTMGCTCTTGCAGCAGTCACATTTGTTCCCXjGGCTTTGGTGGTTAT 

TTTCTAMTTTTTGAGGTGCCTTTGCTATTTCTTGTGTGACCTGATAGCTTCCCCTG 

Sequence 1713 

GCGTCCGAGCCTCTGGGGGTGGATCCTGAAAGGTGGTCCAGCCGCCTGGCCCTGCGTGGG 

ACCCTCCACCTGGCAGCAGGGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGAT 

CTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCG 

AGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCXJGTCGGAGGAAAATGACTCC 

CCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGC 





WO 01/070979 



PCT/US01/09126 



TABLE 1 

283/467- 



CCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAG 

GAGCAGCCTCXJACTACAAACCCACACCAGACCTGCGCATCTNCATCGAGAACTCCGAAGA 

GGCCCTCACAGTCCATGCCCCTTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTG 

Sequence 1714 

GTCGCCACGCGTCCGCAGAAGATTGACAAATCTGAGGGCCGCTTCCATGTCCAGAACCTT 

AGCCAGGTGGAGCAGGATGGGCGGACGGGGCATGGACTCCGCAGATCTTCCAAGTTCTGC 

TTGMGGAGCACAMGCCCTCAAGACGTTAGGCATCATCATGGGCACTTTCACCCTCTGC 

TGGCTGCCCTTCTTCATCGTTAACATTGTGCATGTGATCCAGGATAACCTCATCCGTAAG 

GMGTTTACATCCTCCTAAATTGGATAGGCTATGTCAATTCTGGTTTCAATCCCCTTATC 

TACTGCCGGAGCCCAGATTTCAGGATTGCCTTCCAGGAGCTTCTGTGCCTGCGCAGGTCT 

TCTTTGAAGGCCTATGGCAATGGCTACTCCAGCAACGGCAACACAGGGGAGCAGAGTGGA 

TATCACGTGGAACAGGAGAAAGAAAATAAAC 

Sequence 1715 

CCCCCGTCCGCTTTTGTTNATCTAMGGCTTNAGTCCCATTTTT^ 

AAAMCGTTTGAMGGAGTCTTACACCTGTATCATGAAMCTGMTCCTTTTGAAATACX) 

ACTATATGMGAGAGAGATGAAATTTAGTGAACAGAATTGGAAAAGGTGCTCATAATTTC 

ACTATGCAMCTTACCCCAGTCTCTAAAAMGTMTTTAGATTTAMGTTCTTTGATGTA 

TTTGATTTTCTAAATCTTTATGGTTATGATTTGGAATAAMTGTGCCT 

CATTCTGTTCTTAAATCTGAATGCCTTCTCATTTAATTCTGAGGAAATATCACACAAGTG 

TCTTCAT TGAC CTTGMGAMTGTATATACAGTTGCCTTATAAAACAACATAAATTTAGA 

CCATMCTTTTATAGAGAAAGGGTTTTGTCAAATGTTTTCTGAAAATCTGAGTAATTCAA 

AGCATGCCTCTGCCCCCTTTAATA 

Sequence 1716 

NGTCGCCACGCGTCCGGCGCTCTCGGCCGCCGCCGCCTCTGCGTGGGCCGGCCGGGAGGG 

CCTCGGGGGACTGACTGACAGAGTTTCACTCCTGTTACCCAGGCTGGAGGACAATGATGT 

GATCTCGGGTCACCACAACCTCCXaCCTCCCGGATTCAAGCGATTCTCATGCCTCAGCCTC 

CCGAGTAGCTCAGATTACAGGCATGTGCCACCACGCCCGGCTMTTTTGTATTTTCAGTC 

GAGACGGGGTTTCCCCATGTTGGTCAGGCTAGTCTTGAATTCCCGACCTCAGGTGATCTG 

TT CGCCTC GGCCTCCCAAAGTGCTGGGATTACAAGGCGTGAACCACTGCACCCGGCGAGG 

CATTTTTTACTGTCTACAGAAACTTATTGTMTTCATTTTTCCTCACTCGAAGTAGTM^ 

AATTATACCAAATTGAAAAGATATGAATGAGTATCCTAAAAAAAGAAAAAGGGA 

Sequence 1717 

CCGAGGCNCTGATAAGCCNTGGTAACGGGAAACACAGCTCTAACCTCACCTCATTCTCCA 

GGTTACAMGGCCATGTGCCCCTTTGAATCTGGCAGAGAAAGTTTCCTCGTTGTAAGTAT 

TTGCATCTACTTCAAGCCAGATTCTTCTGCCTCTTTCTCCTTTCCAGACCCCTACTCTGT 

GCAGTGCTGACCACAGCTAGAGCCACCGCCCCATTGCTCMCCAGTATTTATTTCCCTAA 

ACGACCCTTCCTCATATTCCCTTCCCTCCACCTCTCCTTACCAAGCACCCAAAAGAGGAT 

TTAGMCTAGCAGGGTGGACATCATTCTGGTTGTTTCTACTTTTCTCTGCCTAGCACAAA 

ATTGGGAGAAAACTGGAGCCTCCATCCGCAGTCACACGTGTACAGATCTGGGNGATTTGG 

ATGTAGGCTTTTTCTAACTTCTCTCTCAGAAGCTTCTACA 

Sequence 1718 

CGGACGCGTGGGTGCCGCCGCCGCCGTCGCTGTCGTAGTCGCCGCCGCCGCTGCCGGAGA 

AAGAGCACGAGCGGGGAAGCCCCAGAGTGAAATCTAGCATCCTGCCGGCTGGTCTGCCCG 

CCCCTCCTTCCTTTTCCCCCCGGCCCCNGTCCCCTCCCNCCGCAGGTGCCATCCGTCGCC 

ATNCGCCTCTCTACCCTCNCATCCCCAGGTGAGGGGGGTGAGTTCAGGAAGCGGNNACCC 

CNAGGAACCCANC AGGGTCA CCATTTGCAGCGCAACATGGCAGGAGCTGGAGGAGGGAAT 

GATATTCAGTGGCGTTTTTTTCAGGTGAAAGGAGCAGTATGATGATGATGTAGCAGNAAG 

CANGATATNATTTCTACAGTANAATTTMTCNTTTCTGGGAGAAATTCTAGCAACANGAA 

GATAAAAGGTGGGTAGAGGTTGTCATTCTTTCAACAGGGAGCAGGGAGAAC 

Sequence 1719 
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TCCGGCCGGCCAGCGTCATCACCATCGTCAAGTCCACCCCGGGCTCGGGGTCTGGCCCCG 

CCCACGGNCCGGACCCCGNCCACGGCCCGGCACNGCNGCTCCTAGTGCCCGCCCCCGCCA 

TGCCTCANCCACGCCCCACCGGCCATAGNTNCCCTCCCCCAAAGNGCNCTGAGGCGCATG 

NCTTCNTGNGACNCAANGNTCCCATTNTCTATTGCAGGCAACATGGCCATTCCCCGAAAC 

TAAAAAGCAGTTGGGGNGGGNGAAAGTACNANGTGGAAAACCCAGNNATCACCGGNANTG 

GNGGGAAAACMNGGCCNGNAGGGACACATTTCCAACTTTAAGCTGGGCNAGTGNNTGGG 

GAACCAAAAAACCTGGGGNGGNCCCNTCANTGGCANGGCCCNTGAGNNCNCAGCNCATGN 

GCATTTNCAGGGGNNGAACCAGNGGGACAAGGGGGACCTCAANAAAAATGNTGGNTGGGG 

ANGGGCCCANTCNCTGGGCNNGGGGGAAGAACCCGNATTCTACNAGCAAAAGGGGANGGG 

GGGGACNCAGCAAAGGCAATCANGGGAAGAAGGAAGATAAGGGTCNACCANCGGGAATNG 

GCAAAGAAANGGGGGAA 

Sequence 1720 

CTGANGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGNGAAACACAGGAGAGCG 

AGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTACGGGAAAGTCCAGA 

TCTACACAGCGGATATTTCAGAANTCCCTNAGTGCAACTGCAAGCCCACAGATGAGAATC 

CTTGTGGCTTTGATTOSGAGTGTCTGMCAGGATGCTGATGTTTGAGTGCCACCCGCANG 

TGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGCCAGTNCCCAGAGA 

CCAAGATCATCAAGACAGATGGCAAAGGGTGGGGC 

Sequence 1721 

CATGGCTTCTGCGAGAAAAGTGATTTAGGCAGACGGAGGTTTTTTCTCAATCAGAGGCTT 
TCAGTAACTCTGCTGATGCACAGAGAAGAGACTTCCTCAGCCTGCAGGCTACAAGAGCCA 
ACTGTTAGTGCAAAAAAGGACTTTMTACAAATTTCTTATTCCAGAAATTTTGTTCCAGG 
TCTGGACAAGCTGAGAMTTTATCATTGTTmCGAGTTTTTMG 

GAGAGGTATGGGTGTGTGTGCAAGGCACACACATACAGTCTTTCTGTACATGCATGCATA 

TTTATGCATGTACAGGGMGTATCCAGACACCMATTTTTAATAAAATGAATTCCCCAAA 

GGGGAGTCTTGAC CTGM TTAAGGCTGTTGTTTATAGGGAAGCCAGATATAATTGATGNT 

GMAMNAAACTAATTTTTATACTTAATCACCGGCAGNTANCGGGGGCANGGGGGAAAAA 

GTACAG ANGGGGTGT A I H I I IG G I U I I I I CT 

Sequence 1722 

TCGCCACGCGTCCGCTCTTAACACAGAGTCTGCAGCCCCTAACTGACACCCTGTCCTTCC 

TCCTAGGAAGTGCTGGACTCCCTGGTCAGCAATGTCAACATTGAGCTGCTCAATGCCCTC 

CGCTACCATATGGTGGGCAGGCGAGTCCTGACTGATGAGCTGAAACACGGCATGACCCTC 

ACCTCTATGTACCAGAATTCCAACATCCAGATCCACCACTATCCTAATGGGATTGTAACT 

GTGAACTGTGCCCGGCTGCTGAAAGCCGACCACCATGCAACCAA 

Sequence 1723 

ATCCGGTTCGCCCCNCGTCCGGGCGGCCGAGGCGGAGGCAGCGGCGGCGGGATGGCGGAC 

GCCAACAAGGCCGAGGTGCCCGGGGCCACTGGTGGCGACAGCCCGCACCTGCAGCCCGCA 

GAGCCGCCGGGCGAGCCGCGGCGAGAGCCGCACCCCGCGGAGGCGGAGAAGCAGCAGCCG 

CAGCACAGCAGCAGCTCCAATGGCGTTAAAATGGAGAATGATGAATCAGCAAAAGAAGAG 

AAATCTGACTTAAAGGAAAAATCTACAGGAAGTAAGAAGGCCAATAGATTTCATCCTTAT 

TCAAAAGACAAGAAT 

Sequence 1724 

GTCGCCNCGCGTCCGTGCTTTTTTTC 

TTCTTCTA TTTCT TGTGGATATTATGGCTAATAACACAACAAGTTTAGGGAGTCCATGGC 

CAGAMACTTTTGGGAGGACCTTATCATGTCCTTCACTGTATCCATGGCAATCGGGCTGG 

TACTTGGAGGATTTATTTGGGCTGTGTTCATTTGTCTGTCTCGAAGAAGAAGAGCCAGTG 

CTCCCATCTCA CAGT GGAGTTCAAGCAGGAGATCTAGGTCTTCTTACACCCACGGCCTCA 

ACAGAACTGGATTTTACCGCCACAGTGGCTGTGAACGTCGAAGCAACCTCAGCCTGGCCA 

GTCTCACCTTCCAGCGACAAGCTTCCCTGGAACAAGCAAATTCCTTTCCAAGAAAATCAA 

GTTTCAGAGCTTCTACTTTCCATCCC 

Sequence 1725 
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MTNTGCCAGCCTTTATTGAGCTTTACMCTMCMCCTGATAGTTGGCAGTTAATTCAC 

AGTTACAGA TAATG CTTTTATTTACATAMTATACCMGTAGTACCCTCTTATTGTATTC 

ACTTCATCTATTTTCTTAGAATACTTGCMTTTCTAATGACCCCTTCCCTTTCCCTCCTG 

CTGCCCTGTCCACCCTCTTTCCCCTTCTAACATCCTTAGAGGGATGAAATCTCAGCATAT 

GTTGCAGGACACCAAAAGGAAGAAAACAATCAAGCAAATAAAATAAACAGTCAAACAAAC 

CAGGAGTTTAAAACAACAACCCCAACAACAGAAGCCTTGGCAAAGAGGAATGAGTGATCA 

GCAAGTGAACACACTCTATGTCAACTCTCCTTTTTATCCAGCTGAGATTTTATGGTAACC 

TTTAATTTAA 

Sequence 1726 

CCNCGCGTCCGGAGCCGAGAGTGTGTGGAGCAGTTACAGCTGGAAGACCGGGTCCTCTGC 

CTCCACAGTAGATGGCTGAATCCTCTATGCGGGACTGGCAAATGGCACTGTGGTCACCTT 

CMCATAMGAACMCAMCGACTTGAGATCTTTGAATGCX^ATGGCCCTCGGGCAGTCAG 

CTGTCTTGCTACAGCTCAGGAAGGTGCCCGAAAACTGCTGGTCGTGGGGTCTTATGACTG 

CACAATTAGTGTACGCGATGCCCGGAATGGACTGCTCCTCAGAACTCTGGAGGGCCATAG 

CAAMCCATTCTTTGCATGAAGGTGGTGAATGATCTCGTGTTCAGTGGCTCCAGTGATCA 

GTCAGTCCATGCTCACAACATTCACACTGGTGAGCTCGTGCGGATCTATAAAGGTCACAA 

TCATGCAGTGACTGGTGGTGAATATCCTAGGAAAAGTGATGGTGACTGCTTGCCTGG 

Sequence 1727 

CNCGCGTCCGGATNAATATTTTCATCCCTGAGGTTAACMTTACCATCAAMTGTTTTGT 

GGAGACTATGTGCAAGGAACCATCTTCCCAGCTCCCAATTTCAATCCCATAATGGATGCC 

CAAATGCTAGGAGGAGCACTCCAAGGATTTGACTGTGACAAAGACATGCTGATCAACATT 

CTGACTCAGCGCTGCAATGCACAAAGGATGATGATTGCAGAGGCATACCAGAGCATGTAT 

GGCCGGGACCTGATTGGGGATATGAGGGAGCAGCTTTCGGATCACTTCAAAGATGTGATG 

GCTGGCCTCATGTACCCACCACCACTGTATGATGCTCATGAGCTCTGGCATGCCATGAAG 

GGAGTAGGCACTGATGAGAATTGCCTCATTGAAATACTAGCTTCAAGAACAAATGGAGAA 

ATTTTCCAG 

Sequence 1728 

TCGACCACGCGTCCGATCCTGGATCTGGAGAGAGAGCTCTCCAAGCAAATCAACGTGTGC 

CTCTGAGCCAGATGACGGGGTGGGACCCCGGTTAGTAAGGACCGGGCGCCCAGTGGCTAA 

GGCGGTGCCCTGGTGACCAAGGAGAGCCAGACCTGTTGCTCAGGCCGAGCTCCTGGTTGC 

CAGCGAGTTACCACGGGACCAGTCGCGTGTATGGCTGAGACTCATTCCCAGTTTCCAGGG 

CCCGGTATTTGGACACTAGTTGCCMGTCTGGGGCCTGGGGATTTTAGGGACCAGCGGTT 

GTGACCATCTTTCCTGAGCACCMGGGCTTCCCCTTTTGTTGCCAAAAAGGTAGTTCTCG 

CGCTTGCTAGGCTGGCCTCTCTTGCCTCCCCTTGGCCGGGGC 

Sequence 1729 

TCCGAMCACCTGTCATTTTACACAMTGCGTTTTGAATGTCTGAAAGACAGCTCCTGCC 

CTTMTTTAGATGTAAACCATTTAGTTTCAAACTAACCACCTGATAAAATCTATCAACAT 

TTTATCATGAACTAGAGCAGATGTCTGTTATTTGATGTCTATGTTATTTGAGT7TACTGT 

TTMTMGTGAATTCATATCMTTAATCCTGCTMCAMTTTGACACTTAAGGTGATTCT 

GMMTCCTTTAMCTTAMGTAGATGGMTCTTMGTATGGGGCCTTTTAGTGTCCGTA 

AAGAAAAACTGCATGCAACAAAATATAGCAGGTCCTCACTTGTTGAGATTCATGGAAATT 

GTGACTTTAAATGAAATGACATGGCTGGGCATGGTGGCTCACACCTGTAATCAGCACTTT 

GGGAGGCCACGGCGGGTGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 

T 

Sequence 1730 

CTGNGAGTTAMTTGGTCCAGAAACAGTTATGACCCTCTTTATACTGCCAAGAAATACGC 

AGTCCCAGCCTTGGAAGCACACTGTGTAGAATTTCTCACCAAACATCTTAGGGCAGATAA 

TGCCTTTATGTTACTTACTCAGGCTCGATTATTTGATGMCCTCAGCTTGCTAGTCTTTG 

TCTAGATACAATAGACAAAAGCACMTGGATGCAATAAGTGCAGAAGGGTTTACTGATAT 

TG ATATA GATACACTCTGTGCAGTTTTAGAGAGAGACACACTCAGTATTCGAGAAAGTCG 

ACTTTTTGGAGCTGTTGTACGCTGGGCAGMGCAGAATGTCAGAGACAACAATTACCTGT . 
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GACTTTTGGGMTAMCAAAAAGTTCTAGGAAMGCACTTTCCTTAATCCCGGTTCCCAC 

TGATGACAATTGAGG 

Sequence 1731 

ACCCCGCGTCCGAGCAGCCTCCAGTTGCCCCTACTTAGTGGCTTGCCCTCTGCCTGCCTC 

AGCTGCTGCCTGACCGGCTGGGGGAGGCACTGGCGGGAGGCCTCXaGGCTCCCCTGGAAGG 

GCGCTGGGCTGGCGGGTCAGCTGGTGGTTCTTAGGTTTCCTTCTGTTTGTTAAAAGGGAC 

MTGTGGCCACTTCTCTGTGGAAAGGGAGTTGGTTGGGGGGTTGAGATGGCCCGTGTTCA 

TMCTCAGTTTCCTGTTTTGCACGATGTAAAMCCCTGTCTTTTTGCACGATACAGCCAA 

MGTATTGGCTGATTTCTTGCTGAGTGCCCTCTTAGTTGGTGTGTGAGGTCTTGGTGGGC 

TCAGGCCAGCTGTTTGCGAGTGTGGGMCTCATAGGTTCTGTCTTTGTCTCTTCCTTTCA 

CCTCATTCTGGTAGCAGCATAAAGGTTAGGCAATCACTGGGACCC 

Sequence 1732 

GCAAATCATACATTGCATTC CCCAA AGCATCTGAACGTACTTCTAGAAAACAAACCAACC 

AAAAGGGAAMTATGCATGCTTTTTGTMTTAACTGGTGCTTTGAAAATCTTTTT^ 

GAGAAAAATCTCMCCAMGTTATGCTCATCCAGACAAGCTGACCTTTGAGTTAATTTCA 

GCACAACTCATTCTTCAGTGCCTCATGACTGAAAACAAAAAACAAAAAAAGAAAGCATCT 

TCNCMTGMGCTTCCANATAGCACCGTTTTGCTAAMGATACATTCTCATTGTTTTCCA 

ACAGNGATGGCTTC CACATM NGGTTAAACAMCTGGGNGCTTGGAAATMTTTNTNACN 

GGTTACTTNTTCGCATTTTTTNGAACNMGGAAANGGGATTCCCTTT^ 

GAATTGNGGNCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1733 

CGCGTCCGAATTTAGGAAGACCCCGGCGACCTGTTCCTCACCCCCGCTTCGCCCTCACAC 

TTTCGGGATGTCTGCGATTCCTGCTGAGGAGAGCGACCAGCTGCTGATCCGACCCCTTGG 

AGCTGGGCAAGAAGTAGGAAGATCATGTATTATTCTCGAGTTCAAAGGAAGAAAAATAAT 

GCTCGACTGTGGGATCCACCCTGGCCTAGAAGGMTGGATGCTCrrTCCriTATATTGATTT 

MTTGGACCCACCTTGGMTTGNACCTCCNANTM^NANNNCATTTTCATTTTGGAANC 

^fTGGGGGGGC NTNNCCCCNNGGTTTTTCNNAAAAANNCCCCTTNAAANGNNAAAMATTT 

TTTTNTTTTTTNTCCCAN AAAAAA I II I I I 1 1 NAAAANNCCCNCTNNTNTNAGGGGNTNT 

TNAAAAAATTNCCCAAAAAMNNNNCCCCCCTTTNTNTNTTTTNAAAAAAAAAAAAAAAA 

AAAANATNGNCC CCNNAA I I I M I I I I I I'ATATTTTTNANNAAMNAAANNTNTTTTTTN 

AAAAAANTNTATN I III II I I CNNTNNANNNNGTGNNNNCCCNNN I I I I I 1 1 1 HI I 

Sequence 1734 

CCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGGGCCGCCCGACC 

CCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGTCGATTCTGCTGCTTTTCCTGACG 

GAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATAACGCGGAGATCTGTCTC 

CTGCCCCTAGACTACGGACCCTGCCGGGCCCTACTTCTCCGTTACTACTACGACAGGTAC 

ACGCAGAGCTGCCGCCAGTTCCTGTACGGGGGCTGCGAGGGCMCGCCMCMTTTTTAC 

ACCTTGGGMGGTTTGCNACNATTCTTTGTTTGGANGANTAAAAAMGGTTCCCAAAATT 

TTCCCNC TTTNAAAAGANNNTNGNGACCNNNCACTNGGGGGGGGGCCCCAAAAAAAAATT 

TTTTTTTNTTAANNCCCCCCCGGGGNNGNAAAAAN I I 1 1 1 I 1 1 GGGGGGGGNCCCCCCCC 

NNGGGNNAAAANNNTTTTANAAAAA A I I I I I I I I I I I NTTTTTCCCCAAAAAAAMTTTT 

T 

Sequence 1735 

GCGTCCGAAAATACAATACACGGACTCTTTCGAGGCCCTGTAATTGGAATGAGTCCACTT 
TAAATCCTTTAACGAGGATCCATTGGAGGGCACTTCCAGAATACCTCCTCCCCCAGCGCC 
CGCTGCCAGCCCCACACCAGGTGTGAGAACCAAGGTCTGGTGGAGGCAGCTCCAGGCACT 
GCCCAGTCCGACACAACCTGCAAAAATCCATTAGAGCCACTGCCCCCAGAGATG 
Sequence 1736 

CGAACCTCCTGGTTCCAAGTGGGAGACATGGTGCTGTCGGAGCTAGCGGCGCGCCTCAAC 
TGCGCCGAGTACAAGAACTGGGTGAAGGCGGGCCACTGCCTGCTACTGCTTGCGCAGCTG 
NCTGCAGGGTTTCGTCGNCCGCNAGGTGCTCTCCTTTCCACCCGCGGCX^TACTCGCCGCA 
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GCCCCCGGC 
Sequence 1737 

ATCCTTTTGCCTAAAGATGTAAACAAAACTCAAGACAGAAGGAATCAGGGAATATGTGCT 
NTTGTGNGC ATCTT GTTTACATTTNGGGATCAANTGATGGCAAAAGAAGTAATGAGACCA 
CTNN AAAT TGTTTTNCANTTGNNTTTAAAANACCAGG 

NMGTTTTMCMTTGACCTTCTTMGNGACATTCTCTTTCAAAAAAGANANGTAAANCA 

GGNGAAATGAAGGGTGGNTGGGGAAA 

Sequence 1738 

CCGGCTCATTCCCTGAGGCCGGCCCGCGTCCCGTCAGGGCGCCGCGCGGGGTTAGCGCGG 

GGTCAGCGGAGGTCAGCGGGGGTCAGCAGCAGCGGCTCCGAGGGCGCGGCGGACGCAGGA 

TGTACACGCTGCTGTCGGGCTTGTACAAGTACATGTTTCAGAAGGACGAGTACTGCATCC 

TGATCCTGGGCCTGGACAATGCTGGGAAGACGACCTTCCTGGAGCAGTCGAAAACCCGAT 

TT 

Sequence 1739 

GTCCG NCCAC XaCGTCCGATTTCTTGTGTGCTTTGAAAAAAGTTTCAGCTTGCTGTCTC^ 
TTAGTGTTTTAMGMGTGTTATACAMGCATTGTTTGCAAAATATAGGGAGGATAATGG 
GAGTCC CACTT TMN TTNGG GMNT CNTT GGGNGANCTTNTTNATCCAAGGTTTANTCAA 
GCCT TCNTTT TCCAACTTTTAAAAMTTTTn'GTTA 

AAAAAl l l 1 1 AAAMTATTTTANTGTTCTrGCNACCMATTTGTTCTTN AAAAAATAATA 

AAAACTTGNTGGCAAATTCNTTTGGTCNNTTTAAAAAAAAAAAAAA 

Sequence 1740 

ACGCGTCCGCAGCCATCTTGGGATCTGGGCAAGTGAGCGAGCTCCTTCCTCACCGGGCTG 

ACTAGCCTCTCCTTTCCCTGTCCCCCTCCATCGCTGCTCTGCAGGAAGCCAGCCCCCAGG 

GCCAGTCCCGGAGGGGCTGATCCGCATCTACAGCATGAGGTTCTGCCCCTATTCTCACAG 

GACCCGCCTCGTCCTGAAGGCCAAAGACATCAGACATGAAGTGGTCAACATTAACCTGAG 

AAACAAGCCTGAATGGTACTATACAAAGCACCCTTTTGGCCACATTCCTGTCCTGGAGAC 

CAGCCAATGTCAACTGATCTATGAATCTGTTATTGCTTGTGAGTACCTGGATGATGCTTA 

TCCAGGAAGGAAGCTGTTTCCATATGACCCTTATGAACGAGCTCGCCAAAAGATGTTATT 

GGAGCT 

Sequence 1741 

CANCGCCCCGGCCCGNTCAAGCAGCNTAATAAAGCTCATANAGGCGGACNGGCATCNGGG 

TCTNGGGATCTGCACAGCGGGACTGGCATAGGGCCGTCTGGCACTGAAAACCCTAANCAA 

GAAGGTGTGAAAAGAAACTTNAGCNGANNCGACCNGAGGCATCNNCGCCAGCCAGCTCCG 

AAAGCAGAAGANGGAGGCGGTTCTGGCATGAGAAAGANACAGCTGGGCTGGCNAGGATAG 

GCCCCTCCTTCATCAAGGTACTGGTGGTGCCCCTGCACAGNANAATTNTNCCTGCCAGAG 

GCCATGCANCCTGCTTCAAGATAGGGACACTGGAACAGATACACTTGAATGAATTGGGGA 

AACACCCCAGAACTTTT 

Sequence 1742 

CACGCGTCCGTTTTTTNCMGGGTCTATTGTTTC 

GGTTTCCTCCAT(X) CGAGT MTTCATAGAGTGTTTGGCTTGNCTACCTTCCTCCTGACTG 

MGTCACCTGATACTTTTTGTTTTTTCMGMGGMGAGAGMCCCTGTTGCCTCAGTTA 

CTAGCAATGATACMTTCTCAAMTCTGGTCTTTTTTTGTTTCTTTGAAATAGTT^ 

ATGTTGTGTGACACAGCAGCCCCTGTCTTATCATAGTTGTCTTCCCTCCACCACCTGTAC 

CAGAGATGTTGGATATGTTGGAGGTGMGGTGTGCMGGTTTTTAACTAACTGTTCTAAT 

T AMGGATT CTGCAGGAMGMCATGGGTTTACAAMGAGMGCTTTTGTATTATTAGTA 

A I I I I I I I I CTTTGATGMTTTATGTGCTTAGTTTGGAGAATCGAGAGTTGGCTGGGAAA 

AGATTTCTGAGGAGTTAAGGGACTCTGGTGCTGTTTGGGAA 

Sequence 1743 

GTGATTAGAAGTAAGCGNTGATGAGGCTGAAGAAAAGGAAGACAAAACTGAGTACTTGGA 

GGMCGAAGAGTMTGGGATATCCMTAAACTGAGAATGTTTCTTCACMTCTCCTTTAT 

TCTTCGTTCCTCCMGTACTCAGTTTGGTCTTCTTTCAGGTGTAGGAACGTGATAAAGAA 
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GTAAGCGATGATGAGGCTGAAGAAAAGGAAGACAAAGAAGAAGAAAAAAAAAAAAAAAGG 
Sequence 1744 

CGATTTCTTTGTTGGACMCCCAGCTGGGGCTAGGMTGGTTCAGAAGGTTTAAGGCCGG 

MNGGGNAATGMGGGGCCCGG(^CTMCCCTCTAGGGACCTGTTTTGCTTCTGTTTAAA 

CCAAATGGGCAGTCTGTCATTACACACACCCTGNGTCTTCATATGTGGCTCGCCAGTATA 

ATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCA 

ATGGCATTCCGGGTACACCTGGGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGG 

AATGTCTGAGGGAAAGCTTTNAGGAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGA 

GTTC ATTGAATTATGGCATANATCTTGGGAAAANTGCGGAGTGTACATTTACAAANATGC 

TTTTCAAAT AGTTG CTNTA AN ANTTTTGTTCAG 

Sequence 1745 

GGACGCGTGGGTGGAMTGTAMCMGAATAGACTGTTCATTCCTGATGGCTTTTAGTCT 

ATACTMCATATTGTTTGTCATGGCATCCGAGACTGAAMGACCCATGCTTTACTGCAGA 

CTTGTAGCACTGMTCTCTTATTTCCAGCCTTGGGTCTGGGGGCATTTTGCCTCGTAGCT 

GACAGACTTCTTCAGTTTTCCACAATTCAGCAAMTGACTGGCTTCGTGCTCTCTCAGAT 

AATGCAGTACATTGTGTAATTG GCAT GTGGTCATGGGCGGTAGTCACTGGAATCAAGAAG 

MG ACTGAC TTTGGAGAAATCATTTTAGCTGGATTTTTAGCCTCTGTTATTGATGTAGAC 

CACi I I I I iCTAGCTGGATCCATGTCTTTAAAGGCTGCTTTGACTCTCCCGCGAAGACCT 

T TCCT TCACTGTTCTACTGTGATTCCCGTTGTGGTTCTGACCCTGAAATTTACTATGCAC 

CTTTTCAAGCTCAMGACTCATGGTGCTTTCTTCTGGGATGGTATTTATATCC 

Sequence 1746 

GTGAACAGGNrTATTGACTATGGTAAACTTATTTTATTGAAGTTTTCAACCGGAAAATAGT 
MG TGGAA TATGATACAACTGTTATTTTCAGAACATATTTCTTTAGGGCTATTTAAAATA 
ACCi 1 1 I I AAAGGGCAAAAACTTTCAATTTGAGAGAACAAATTCCTCTCCTCTGTGGGAA 

atattggctgagattttgtatagmtaagagacatgtatgtaacatatatttatattcag 

cataagtctactgcaatcatgtacacatcttagcmgacgagaggattttgtttagtctt 

tgtttatgacttctacagtttcctgtatctagtgttmgttgtaaggaaaaactaaacat 

gcaattta aagg taaacttgatmctatttatggaacataagcatacaccaatggttatt 

tctcacagttttcatgcgcatttgttta^ 

tmtgctggtcacaattagmtgctaatatttggggmgctatgcagaamtattt 

Sequence 1747 

CNTGTGTGCCATGTATACCTAACGGGAGTCCCAGAAGACAGGAGAGAAAAAAAGAAAGAA 

ATAAAAAGAATATTTGMTTTAAMTTGCTTGAAAATGTCTCAAATTTGATGAAAAATAT 

TACTCTGCACATTCAACCCATGAACTATAAGTTGTATAAAATCAAAAAGTTTCACACCAA 

GGCGTGTCATAGCCAAACTGTCAAAAGCCAAAGACACAGAATCTTGAAAGCAGTGAGAGC 

AAAGCAGACAAGGGATCCCCAATAGGATTAACAGCAGATTTCTCATCAGAAGCCATGCAA 

GCCAGAAGGCTATGGGAGACATACTCAAAATGCTGAAATAAAAGACTGTCAACAAACATT 

TCCACATCCAGCAAAAATCAAAAACGAAGGAGAAATCTGTTGCATGTGAGCTGAATAGAA 

TTTGTATTCTGCTGTTGTTGGATTGAAGTATTCTTTAMTGTCMTTANATCAATTTG 

Sequence 1748 

CGATGGCMTACATGTACTCAGATAGTTACATCCCTATATAAMAGTATGTTTACATTTA 

MAAA T TAGT AGATMCTTCCTTTCTTTCAAGTGCACMTTTCATTTTGACTTGAGTCAA 

CTTTTGTTTTGGM CAMT TMGTMGGGAGCTGCCCMTCCTGTCTGATATTTCTTGAG 

GCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTMGTGGGATTCTTAAT 

ATCACCATTCTTGGGACTGGTATACATAAGGCAGCCGTGAAACTGGAAAGTCATTTTGAT 

GACTG ATGTG ATACATCCAGAGGTAAMTGCATTTAAACATATTAAAGTATTTGCCAAAG 

ATACAATTTTCTTGCTGACATAAAAATCACACAAACAAGTCCCCCX5CAAACCACAACTGT 

CTCTCAAATAGCTTAAAAAAATTGAAAAACATTTTAGGATTTTCMGTTTTCT 

AAAAAGATGTTCAAGCTATTAGAGGGAATGGTAAA 

Sequence 1749 
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GCCCNTCCCGACCCTCCAGCTCATGGTGTCTGGGGCCTGCGGCTAGACTCTTGGAACATT 

CTGGAACTCTCTCCTTTCCTGGCTGGGGCTCTGACCACAAACTCCCCTCX3AGGCTGCCCC 

TGGGACATGGTGGTGATGTGGGTGCAGGAGCCAGTGTCTGTTGTCGGGACTCGCAAGTGC 

CCTCATCACAGCCACCCCCACCACGAGTGTCTCCCCAGTGCAGACTCAAGTTATGCTTGA 

AATGAAAAAGTCTATCTGGTAGTGGGTAAACGTAGACCTGGCACTGTTCCACX3CGGGCGC 

CCCAAGCCTGCCACTCCTGTGTCCCTGCCTCCCTGGGCTCCCGAGATAGGCACCACTGTA 

TCCTCCAGCTCCTTCCTTCCTTCCCCCAGGAACACGGAGGCCACCGAGGGGGCTGGGCCT 

ATCAGGAGGACACAGGCTGCAGCCTGGCACCCACCCCTCCATCTTCACCCCACATGGAAG 

ACTTGTCTTCAAC 

Sequence 1750 

GCGTCCGGATTCTAAAMAAAMAAAAAAAAAAAAAAAMTTAATTGGGGTGCCTTTTTG 

TTATAGTTTCTATTTTCTGTTTTGTAGGACAAGCTGCATTTTCTGTAAATATAGGTCTGG 

ACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTAT 

TTATCTCTAAGTATATTTGMGTGATTGCTGhiTTATATGTTGTCATTTTAAAATTGTGTG 

TCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAAATAAAGCTCTCAGGGAGACATGA 

ATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCT 

TAATATCCTTAAATTATGTGATATATAAAGAGGACTGTTA CI 1 1 1 I I ACTTTCI 1 1 I U I 

1 1 1 1 1 1 1 1 GGCTTTGCTTTATTTATTTGGAGT 

Sequence 1751 

GGGCATGCTCATAGGCACAGCTGTTGGTCAGTATGCCAATAACATCACACTTTGGATCTT 

TGCAGTCACTGCAGGCATG7TCCTCTATGTAGCCTTGGTGGATATGCTTCCAGAAATGTT 

GCATGGTGATGGTGACAATGAAGAACATGGCTTTTGTCCTGTGGGGCAATTCATCCTTCA 

GMTTTAGGATTGCTCTTTGGATTTGCCATTATGCTGGTGATTGCCCTCTATGAAGATAA 

AATTGTGTTTGACATCCAGTTTTGACCTTTCCCAGTAATCACTGTTGATTACGAGAATGT 

TACCATGCAGCTTTGCATCTGTTCCTTGTACTGTATGCACATTGCTCAAAGGAAAGTCAG 

TGGCTT GCAC TACTTACAAGTTTCATAGATTTGAGCCTAACCACAAGAGGCTGGTGCTTA 

GTACTGTTTTCCCTGCACGTAGGGGTCTTTTAAAAATATAAAGCTTGTGATAAAGAGAGG 

A 

Sequence 1752 

CTGGTTCAGCAGCCGCCCACCCACCTCTGAGTCTGACCTGGAACCTGCCACAGATGGGCC 

AGCCTCCGAGACCACTACCCTCAGCCCAGAGGCCACCACCTTTAATGACACCAGAATCCC 

TGATGCAGCTGGTGGCACGGCCGGCGTGGGTACCATGCTTCTGTCCTTTGGGATCATCAC 

GGTGATAGGCCTGGCTGTGGCCTTGGTTTTGTACATCAGGAAGAAGAAGAGGCTGGAGAA 

GCTACGCCACCAGCTCATGCCCATGTACAACTTCGACCCCACGGAGGAACAAGATGAGTT 

GGAGCAGGAGCTGCTGGAGCATGGGCGGGACGCCGCCTCTGTACAGGCTGCTACTTCTGT 

GCAGGCCATGCAGGGCAAGACTACTCTGCCCTCCCAGGGCCCACTCCAGAGACCCAGCCG 

GCTTGGTGTTTACCCGATGTGGCCAATGCCATCCATGTGTGAGTGGCCTGGGACAAGC 

Sequence 1753 

GTCGCCCCGCGTCCGGTGCTCTCATGTCATCTCAGAGTTCCAGCTTATCAGAGGCATGTA 

GCAGGGAGGCTTATTCCAGCCATAACTGGGCTCTACCTCCAGCCTCCAGAAGTAATCCCC 

AACCTGCATATCCTTGGGCAACCCGAAGAATGAAAGAAGAAGCTATAAAACCCCCTTTGA 

AAGCrTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAGAAGATTGCCAATA 

ACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATCTCCCAACCTCAGG 

AGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCTCAGCAAATCAACC 

TTGGCCCGTC GTCC AATCCTCATGCTAAACCATCTGACTTTCACTTCTTGAAAGTGATCG 

GAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCA 

Sequence 1754 

TCGCCCCGCGTCCGGACTGATCATAAACCATGCTGGTATTGCACCTTCTGGAACTATGGG 
CTTGAGAAAACCCCCAGGATCACTTCTCCTTGGCTTCCTTATTTTCTTGAGGCAGGTCGC 
ACGTTCTACCTGCCCAAGACGTGTGATATCAGCTTCTCAGATCCAGACGACCTCCTCAAC 
TTCAAGCTGGTCATCTTGTCCTGATGAGGGCTTCTACAAGAGTGGGAAGTTTGTGTTCAG 
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TTTTAAGGTGGGCCAGGGTTACCCGCATGATCCCCCCAAGGTGAAGTGTGAGACAATGGT 

CTATCACCCCAACATTGACCTNGNGGGCAACCGTCTNGCCTCAACATCCTTCAGGAGAGG 

ACTGGA MGCCAGTCCTTACNATANACTCCATAMTTTTATTGGGCCTTGCNGGTATTCT 

TTTTTTNTTTGNGAGCNCCAAACCCCNNGANGGACCCCCACTTGGAACCNAAGGGAGGGC 

NCGCAAGAAGGGTTCCTTGGCAAGAAACAAACCCGG 

Sequence 1755 

TCCGGCCCGCCCACCCGCCGCCAGCTACCATGGATGATGATATCGCCGCGCTCGTNTGTC 

GACAACGGCTCCGGCATGTGCANGGCCGGCTACGCGGGCGANGATGCCCCNCGGNCCGTT 

TTCCCTCATCGTGGGGCGCCCCANGCNCCAGNTGCGTGATGGTGGGCATGGGTCAGAAGG 

ATTTCTATGTGGGCGACGAGGNCCAGAGCAATGAGAGGCATNCTNACCCTGNATTACCCC 

ATCAGAGCACGGCATISTTNTCACCAACTGGGACCGACATTGGAGAAAATCTGGCACCACAC 

CTTTCTACMTGAGCTGCGTGTGGCTCCCGAGGAGCACCCCGTGCTGCTTGACCGAGGCC 

CC CCTGA ACCCCCAANGNCAACCCGNCGAGAANGATNACCCCAGATNCATGTTTTGAGGA 

CCCTTTTCAACACCCCCAAGCCCATGGTTACCGTTTGCCT 

Sequence 1756 

GCGNNGGGGCCGGGGCTGGGGTCXaGGGCCAGGGGTCGGCCGGNGGCGGNGGACGGGCGTN 

CGGTGCCTGGGCTCTCTCCGCTCCCTGAGGCGCCGCTGGGGAGTCGAACCCTGATGATGG 

GACCTGTGMTTCAGCGGGGTGCCAAGTCGTTTTCTGTGTGGGTTGAGAGACAGGCTGNG 

CAGCCCACTGTTGCATAGGACTAACTACTACAAATCATGCTGAGACCGAGCTATTTTTGC 

TGCTTAGAGGCTTTGCAGCCTTGAGAGGAATTTTGTTGTCAGGAGAATAAAAGGAGGTTG 

TCCATNATTGACTTTMGCAGCAATCAAGTAAMCATTGAGCTCTTNAGCTCXJGCCTTTC 

TTGCTCTGAAMTTGGAAAACCAANAAGGTTTTGNTGTTATTGTGTGACCCCACCTGAAT 

TATAMCCMGATGMCATMCCAMGGTGGTCTNGGTGTTGTTTTTCAGCAAAAC 

Sequence 1757 

GGATCTACCANGTGTCTNGAGCAGCAGGAGAGAGCCTTCACGCTGGGGCTAGCAGGACTT 

CTCGGCGAGGGAGNNTTTAACTTTGGAGAACTNCTCATGCACCCTGTGCTGGAGTTCCTG 

AGGAATACTGACCGGNAGTGGCTGNTTGACACCCTCTATGCCTTCAACAGTGGCAACGTT 

GAGGCTGGTTCCAGACTCTGMGACTGCCTGGGGCCAGCAGCCTGATTTTAGCANCTAAT 

GAM GCCCAGCCTTCTTGAGGAAACATTTCAGTTTGTTTGTTGCCTCATTGGNAGATGAC 

TTTTCACACCGACCCTGGCCNATTCAGMGACANACTTNACTTTTTTGMGGAAAMTT^ 

CCAAAAAAGTTGGCCTAAAAAAATCNACAGGTTGAAATGGAAGGGTGNGAAGNCTTTTCT 

GGGGTGGATTGGAMNGGGCACCCTTTTTCGGNTNGGGGGGCNTGGGT 

Sequence 1758 

GAGCCGGCTGGCTCGAGTGGCCTTCGTCGTCCCTTGGCGCCCTGGGAGAGTCGCTGACGG 

GTGGACTGACGGACCGCCTGAGGACGGCCGGCCAGGGCGGTGAAAGCGCCAGCCCTATGG 

CGCGGGTCGCAGTGAGGCGGAAGGCCGAGGACGGCCGGCGGCGGCGCCCGCCCCGGCGAT 

GCGGGCCCCGCCCGTCGCCTNAGGTGCCATTTGGATTGTACTTTAGTGGCACCGATGTAC 

TCTGAGTGGAGGTCACTGCATTTGGTGATTCAGAATGATCAAGGCCATACCAGTGTGCTG 

CACAGCTATCCAAAGAGCGTTNGACGAGAGGTGGCAAATGCTGTAAGTCCGGCCTCTTGG 

GCAGGTGTTMGTACCCCTTCAGTGGCTGGTAGTGAGAATTTGTTAAAAACTGACAAAAG 

AAGTA 

Sequence 1759 

GTCGACCACGCGTCCGGTCTCGGCGCCCGCTGCCCTCTCACCGCCCCACGCAGGATCCCG 

GCCTGGTCACCGGGCAGTGTGATGCTTCCCGACTGCCGCGGGGACAGCGAGGCACACACA 

GGGCTTGGGCCGCGCCGGAGGCCACACGGCCTGGCTGAGTTGCTCCTGGTCTCCCGCCTC 

TCCCAGGCGACCCGGAGGTAGCATTTCCCAGGAGGCACGGGCCCCCCCAGGGGGATGGGC 

ACAGCCACGCCAGATGGACGAGAAGACCAAGAAAGCAGAGGAAATGGCCCTGAGCCTCAC 

CCGAGCAGTGGCGGGCGGGGATGAACAGGTGGCAATGAAGTGTGCCATCTGGCTGGCAGA 

GCAACGGGTGCCCCTGAGTGTGCAACTGAAGCCTTGAGGTCTCCCCAACGCAGGACATCA 

GATTCCTCATGGTGCAAAATGGCCATTCCAAGCTTCCATCCAGCCATCACATCACAGGAG 

GAAGGGAAGA 
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Sequence 1760 

CCGATGTCTACAAGCTCGTCAAAGACTAGGGTGCCCTCTGCGCCTCCTTAAGGATGCAGG 

GTGAGCATCTCCTCTCCACACCTGCCTGGCACCCCTGGGGGGGTCCAGGATTGAGGATTC 

ATCTACCTGCCAGGCCTCAGGCCCAGGACCCAGGAGGCCTCCCCACCTACCCCAGCACAC 

ACACTCCCTGCCACTGTTCTGCGCTTTAATTGTGGGAGAAGAGAGGAGAGGAGGGCTCAG 

CGGTGGGGCAGCCTGTCCGGGGCGCTGACCCACCATCACCCTGCTCTGCCCAGCCTCGCG 

TGACCTCAGAGAGGTGGGGATAGGGGACACCTTCAGCCTCCAGCATGTGTGGCCACTTGT 

ACCCCCACCCACCCTTGGGGGAGCATGATGGGCAGGTTGAGGGGCAGGATGGAGACCAAG 

GGGAANTCAGTGAGCAGNAGGCCCTGGGGAAGTGTCCGGTCGGGGTTGGACTTGAGGGAC 

AGAGGGGCCCAACACTTTCTTGCCCCTTTTG 

Sequence 1761 

CCGCCAGGCCCACCATCACCACCGCCTCTGGCCGNCACCCCCATCTTNCACCTGTGCCCT 

CACCACCACACTACACAGCACACCAGCCGCTGNAGGGCTCCCATGGGCTGAGTGGGGAAG 

CAGCTTTNCCCTGGCCTCAGTTCCCAAGCTAACTGCCACGTCCACCCACGCATACACACA 

TGCCCTCCTGGACAAGGCTAACATNCCACTTAGCCAGCACCCTGCACCTGCTGACGTCCC 

CACNTCCCTTGGTTGGTGGGGACATTGCTTCTCTGGGGCTTTTTGGAATTGGGGGGCGCC 

CTTCTCTGCTCCTTTCACTGGTTAGNCTCTGGCTACCNATAAGCANGAGGCCNTGGGAAG 

GGGTTTCTCCCCTGGGCCCTTAAAAAAAGGNGGGCCCCAAAGC 

Sequence 1762 

AGTCGACCCCGCGTCCGGCAAGCGGGACCGGTAGGGGCCGGAGCATGCGGCGGCGGCGCT 

CGGACTGTCCCATCCGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGA 

TGGTGAAAGCGGGGCCGTGAGGGGGGCAGGNGCCGGGAGCCGGACCCGCAGTAGCGGCAG 

CAGCGGCGCCGCCTCCCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATC 

GGGCCGCCGCCGCCATGGAGCTGAGAGTCGGGAACAGGTACCGGCTGGGCCGGAAGATCG 

GCAGCGGCTCCTTCGGAGACATCTATCTCGGTACCGGACATTGCTGCAGGAGAAGAGGTT 

GCCA 

Sequence 1763 

CGCGTCCGGGGTGATTTGAGCTCATCTTGTTGAGCAAGGGGAGTGAAACCACAGAAACTG 

TTAM TTGAACGTAMTMCTTTGGAAAACAGTTTTGACACTAGAACAAGGTCGCTTCTC 

TTTTCCTCTTCTCTCCCAACCTGTTAACCTAGTGCTAGATAGTCAGCTGCTGTTGCAGGT 

AGAAMGCTCACCMTGCCACATTTGTTCATTCTGTTCAACCTGTTTTGACCTCCCAGGC 

ATCTGAACAGGATGACATCATCTTATTATATCTTACTTTACACATACGTGTATGTACACA 

CACACACACMTACAAGACATTTTTCCTGTTTTAGAAAATATAGCCCTTGTTTGGATTAC 

TGCTGTCTGAGCACTAGAAATTTCTAATGGAAAGAGGCCTCTGAATGGCTAAGGGAACAT 

CTGGAAAGGMGGAAMTGAGCCTNMGGTTTTTGGTGCTTGGTTT^ 

TTCNTGGTTCC I It 1 1 IGGTTT 

Sequence 1764 

AGTCGACCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGGGCCCTCCGGGAAGATGGC 

GGCCGTGCAGGCGGCCGAGGTGAAAGTGGATGGCAGCGAGCCGAAACTGAGCAAGAAGTG 

GTGGTAATCATTAGTTCCAGGGTGCTCTGCCATGTTGACGCAAGCTGCTGTAAGGCTTGT- 

TAGGGGGTCCCTGCGCAAAACCTCCTGGGCAGAGTGGGGTCACAGGGAACTGCGACTGGG 

TCMCTTGCTCCTTTCACAGCGCCTCACMGGACMGTCATTTTCTGATCAAAGAAGTGA 

GCTGAAGAGACGCCTGAAAGCTGAGAAGAAAGTAGCAGAGAAGGAGGCCAAACAGAAAGA 

GCTCAGTGAGAAACAGCTAAGCCAAGCCACTGCTGCTGCCACCAACCACACCACTGATAA 

TGGTGTGGGTCCTGAGGAAGAGAGCCGTGGACCCAAATCAATACTACAAAATCCCGCAGT 

CAAGCAATTCATCAGCTGAAGGTCAATGGGGAA 

Sequence 1765 

TCACCCCGCGTCCGACTTGAATCCCGTCAGCTTAAAACTTGTGTAAGGGAATCCTGACTT 
TTAAMTGTGAGGGTATTTGGATCTGTGTTGAMGTCGTATATTTTTATCTGTGCGGTGC 
TGAGTGCAGGCCACCAGCTCCTAAATAGAGGTTCCCTATATGCGCGTATGACATGGTGAA 
TAAACACAACTCTCTCCACTCAGGACATCCGGAGCGTTATGGACCGTGGTAGGTGGTCGT 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

292/467 

TCTGTGTGCTTGTGAAAGTGTCCAGGCGTGTGCACAGCCAGTGCGCCCACTTCCGGGCTC 

CTTGCTCCCTGCTGTACTGMGTTTTGGATTTTGCATCXDAATCCTGTGTGCCTGCCCTTC 

T 

Sequence 1766 

GTCGACCACGCGTCCGGCTCCCGAGCCTGGAGAGCTCGGACTGCGAGTCCCTGGACAGCA 

GCAACAGTGGCTTCGGGCCGGAGGAAGACACGGCTTACCTGGATGGGGTGTCGTTGCCCG 

ACTTCGAGCTGCTCAGTGACCCTGAGGATGAACACTTGTGTGCCAACCTGATGCAGCTGC 

TGCAGGAGAGCCTGGCCCAGGCGCGGCTGGGCTCTCGACGCCCTGCGCGCCTGCTGATGC 

CTAGCCAGTTGGTAAGCCAGGTGGGCAAAGAACTACTGCGCCTGGCCTACAGCGAGCCGT 

GCGGCCTGCGGGGGGCGCTGCTGGACGTCTGCGTGGAGCAGGGCAAGAGCTGCCACAGCG 

TGGGCCAGCTGGCACTCGACCCCAGCCTGGTTGCCCACCTTCCAGCTGACCCTCGTGCTG 

CGCCTGGACTCACGACTCTGGCCCAAGAT 

Sequence 1767 

CCGGAAGAGTGCTTTATTGTGAAATTATTTAAAACTGTCCTTTAAAAGAAAAAGAGGAAA 

CGATGAACAAAAACTMTCTMTTGCCMGTTAGAATTCATNATTTMTTTACCTCCTAT 

GCAATGATTAATGCTGCAAMTGTATGGTTATGTTACCGTATATTCACAAAAGAAATATT 

ATTACAAGGTTTCAGAGGTAGCCAATTGCATTCTTTTGGAMTTTACTGTACTGTTTCAA 

TGT GTTAAG TGCCTTGTTGTAAAGTAAMTTTTMGTCTAGATTCATTATTTTCCTGACA 

TATATTTTTCATTATGATATCTACTGTATGCTATTGTGATAGTTTTATGAAATGCTTACA 

TTTTMTCAAATATGTAAATTTCAGAAGCTCTTTTTTC 

TTTGGTTTATCACATTGGA7TCAM 

T 

Sequence 1768 

CCGGTGGGATAGTAAAAGAGAAGACGCGGAGAAGAGGAGAGGACCTACAAGAACGGAGGA 

CAGGGGCGCACGATGGTCCCGGGGGGAGCGGAAACAAAGGNACGCAAAACGGAAAAGCGT 

G TGTAG GGGAGCGGAAAAGGAAGGTCACCACCTGTGGCCTGCGACCGAAATGGCGAAAAG 

TCTTTTGAAGACAGCCTCTCTGTCTGGAAGGACAAAAATNTGCTACATCAAACAGGATTG 

GTCACTTTTA TAGTA CATCCCATGGTATGTTTATGNAGGGAAGGAAGTGNAAAANNAACA 

CCTTCAGCAGTTTTCCTGGIsnXaGNATTC^ 

G GGCATT TGGGNCATTTCTTTAChTrCTTGMCCMTTCCCAAGTTAGGAATGGAAATGCC 
CI I I 1 1 I CAGGTTGTTAATTGNATGCTACAMCTTCTTGGNAAAAAAGTTAATTTGGAAT 
TTGGGAAAAMTTTGGGAACAANAGGGGGGAAAAAGGCCTCCATTTGTNCCCGGTGGGGG 
GCCAAAAAAATACCTTTTCTTC 
Sequence 1769 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGATTTG 
TGTACAAATTGAAATGTCTGTACTGATCCTCAACCAATAAAATCTCAGTTATGAAAAAAA 
AAAAAAAAGG 
Sequence 1770 

GTCGCCCCTCAGCCGCTTCCCCTCGCCATGGAGGCGAGGCCGCCGCCGCCGCCGCGGGGC 

TCGGAGCCGCGGGCCGGGCGGCGGCCCTGAGGGCTAGTGGCGGCCCGAAACGCCGCCGCG 

GAGCCGAGGCGGAGCCGCTGTCCTCGTCCCCAGCGGTCCCGCCCAACGCCCGACTCTGTG 

ACACAACTAAAAAAAAACAAAGGTATTTATGGAATTCCACTGAGTGGTAATGGATGATGC 

AGTTCAAATMCTMGATMGCATGGATTCCTGGTTCATTCTTGTTCTGCTCGGCAGTGG 

TCTGATATGTGTCAGTGCCAACAATGCTACCACAGTTGCACCTTCTGTAGGAATTACAAG 

ATTAATTAACTCATCAACGGCAGAACCAGTTAAAGAAGAGGCCAAAACTTCAAATCCAAC 

TTCTTCACTAACTTCTCTTTCTGTGGCACCAACATTCAGC 

Sequence 1771 

GCGTCCGGGCCACATGGCCGCCGCCGCCGCTTGGAGCTGAAGTGCCGCCGCCGCCGGGCA 
GCCACGGGGAATCCGCCCGCATCGCCGCCCTCGCCGGCCGGGCGGCCGTGGGGCCCAGAG 
CGCCGGAGGCCAGGGCTGGGGCGGCACCGCGCAGCGGCCACGGGGTCCCGTTAGAGCAGC 
GCCCGGCGGCTATGCCGAGAGCCCGGAGCGGCCGGAGGAGCAGAGGGGCCGNCGGGAGGG 
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AGGAAACCTTTCTGCGAGTACGAGCCAACCGGCAGACCCGACTGAATGCTCGGATTGGGA 
AAATGAAACGGAGGAAGCAAGATGAAGGGCANAGGGAAGGCTCCTGCATGGCTGAGGATG 
ATGCTGTGGACATCGAGCATGAGAACAACAACCGCTTTGAGGAGTATGAGTGGTGTGGAC 
AGAAGCGGATACGGGCCACCACTCTCCTGGAAGGTGGCTTCCGAGGCTCTGGCT 
Sequence 1772 

TNGATGTGTATAT GGACT GTTNTGMGGGTTTTTTTCTTTATAGCCCANTTMGTTNTGT 

TTGGCTCGGTGCATTTTTCATTTATTTMTTAGTMTTTAAGTMCNGTGTTTG 

ATCATTGTG MGTT TCAAGATTCATTATGGGANGAGTTGATGGTNCANTNANGCATGATG 

GTTTMCAA ATTTT AACACCAAAAATGTTAAATCCTGCATAAATTCAACTGTANATAATA 

AATANGGTGTTTTCNT GTATAT GATAGNAATGCMTTAGAAGTACCTNTAGTAAANTCTT 

TTGGMTCACCAATNCTTTTTTGGCTTGAAMTTGGGAAAGMTTTCTGTTTATAATNC 

TTTNNAATTAAACCTTGNGNGGGGGGAGGGGGAAAATAAAAAATTGCAGGAAAAACCTGC 

ATGAGNCACCTTMNAACCTTTAAMGTMGGGGGCTTNCCMTCTTTTTANTCCCNGGA 

AAACCCTGGTTGCCTN I I 1 1 IGGCA 

Sequence 1773 

CGTCCGTTTAAAGGCTCTGACTCTTGATCTTGAAAGCCGGACGCGGCACTGGCACTCGGC 

TTCAGTTTCCACTGTGACAGATGGAGGTCTCCTTTCGCCCCAGCCCAGGTGGCCAAGCCC 

ATCCTGGCCTCAGAACATGCTGAGCACATTTTGTAGGGTGGCACCTTTTTATCCAAGTTA 

CTAGCTACACATCAGTGTTTAAAG AG AAAAAAGTG ACCTTTC A i I I Ml 1 1 i I I CTTGAA 

ACTTGAGGAAACAAGATACATACTACTG A I rTTTTTTTTI I CTTAAAACTAAATGCATGA 

CTGCAGAGCGGTAGAGGTGTATATTTTTCATACTGTGGGGCAMGTATTTGTGCTGCTTT 

TTGGAGATGGACTGGAACGTCTGGTTTCTGTCCCCGGGCCCGGCAGCTACCGTCTATTTT 

CTGTAGAAGGTGCCACAGTGAGACCTGGAG 

Sequence 1774 

CCCCGCGTCCGCTTCCGGTTGCTAACGGTTCCCAAACAGCCCCCGAAAACGCTACGTGAG 
CTGGGCCCTGGGCCAGAGGCAGAAAACGGACGGAAGAAAAGGTCTGGCCGGAGATGGGTC 
TCACTCTGTCACCCAGACTGGAGTGCAGTGAGTGGTGCGATCATAGCTTACTGCAGCCTG 
AAACTCCTGGGCTCAAGTGATCTTCTCGCCTCAGCCTCCTGAGTAGCTGGAGCTACAGGT 

GTGAGCTACCCAGCATGGCTCATTTGAGATTTCTGAGTAGAGAAGTAACATGATTAAAC 
Sequence 1775 

GGAACCTCCCCTAGATTTCAGAAGATGTATGGAAACGCCTAGATGCCCTGGCAGAAGTTT 

GCTGCAGGGGCGGGGCTCTCATGAAGAACCTCTGCTAAGGCAGTGTGGAAGGGAAATGTG 

GGGTTGGAGCCCCCAAACAGAGTCCTTAGTGGGGTGCTGCCTAGTGGAGCTGTGAGAAGA 

GGGCCATCATCCTCCAGACCCCAGAATGGTAGATCCACTGACAGCTTGAACTGTGCACCT 

GGAAGAGCCGCAGACACTCAATGCCAGCCCGTGAAAGCAGCCAGAAGGGAGGCTGTACCC 

TGCAAAACCACAGGGGCAGAGCTGCCCAAGACTGTGGGAACCCACCTCATGCTTCAGTGT 

MCCTGGATGTGAGACCTGGAGTCAMGGAGATCATTCTGGAGCTTTAAAGTTTGACTGC 

CATGCAGGATTTCGGACTTGCATGGGCCCTGTMCCCCTTTGTTTTGGCCMTTTCTCCC 

GTTTG GAACGGCTG TAATTACCCAATACGTGTATCCCCATCGTATCTAGGAAGTAACTAG 

CTTGCTTTTGTTTTTACAGACTCATAGGTGGAANGGACTTGCCTTGTCTCAAATGAGACT 

TTTGGACTGTGGACTTTT 

Sequence 1776 

GCGTCCGGMCTTTATAAGAATTTATGCCGTTNTACATGMCCGTTMGTTTTGTACTT^ 

ACGTTTCTGTTTATTANGCTAAATTGTTCCTCAGGTGTGTGTNTATATATATACATATAT 

ATATATATATATATATGTATATATATACACATATATACGTATATATACATATATATGTAT 

ATGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACGATCCCAGCTCACTGCAA 

CCTCCGCCTCCCGGGTTCGGGCGATTCTTCTGCCTCAGCCTCCCTGGTAGCTGGGGCTGC 

AGCCATGTGCCACCAAGCCCAGCTMTTTGTATTTTTGGTGGGAGACAGGGTTTCACCAT 

GTTGGTGAGGCTGGTCTGGAACTGACCTCAAGTGATCTGCCCACCTCGGCCTCCCAAGGT 

GTTGG GATTMCAGGTGTGAAGCCACCACGCCTGTCCCAGTATATTGTTTAACAAGTTTA 

TTTTTGGGTGAAAAATTCTCTTTAATGGGAAGA 
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Sequence 1777 

CCACGCGTCCGGCGCCCCCTCCCGGCCGCCATGTTGGCTGGTGTGTGGGTGTCAAACTGA 

GCCAGACGCGGCGGTGGCGGCGGCTCCGCGGGCTACGGTCGCTCCCGCCTCTCGAGCGCT 

GCCGGTGGCCGCAGCGGCGCACCCACGCCGGCCCGGAGGAGCAGAGTGCTAAGTGCTGGG 

TGCTCACTGGTGATGAGGCAGATGAAGGTTACCAAACTTGTGGACAGGAGCCTCATATCA 

GAGACGTGGACCTCACTGTAGCCTGGTCATGGCTTCCAGCTTTTCGAATCTGAGGCTCCA 

AAGGAGGAMTGACCATTCAGGGATCTTACTCCAGCTTGATTACGGAGACTGAACCTTCA 

TAGGGTGCGCACTTACCMGGACAGGMGGTTTCTCTGTTTGAAGGGCTTTAAACTTATA 

ACAAAGAAAATAA 

Sequence 1778 

CGCGTCCGAGACAGTTCATACTGGAGACAAACCCTACAAATGTAATGAATGTGGCAAAAC 

CTTTAAACGGAACTCAAGCCTCACTGCACATCATATAATCCATGCAGGAAAGAAACCATA 

TACATGTGATGTATGTGGCMGGTCTTTTATCAGAATTCACAACTTGTAAGGCACCAGAT 

AATTCATACTGGAGAGACACCTTACAAATGTAATGAATGTGGCAAGGTCTTCTTTCAACG 

TTCACGTCTTGCAGGGCACCGGAGAATTCATACTGGAGAGAAACCCTACAAATGTAATGA 

ATGTGGCAAGGTCTTCAGTCAACATTCACATCTTGCAGTGCATCAGAGAGTTCATACTGG 

AGAGAAACCTTACAAATGTAATGAATGTGGCAAAGCCTTTAATTG 

Sequence 1779 

NCCCCGCGTCCGCTGTTGGAGCAGTAGACTTCTCACATCTTTTTGTCACTTCATCGTTTG 

ACTGGACAGTAAAGCTTTGGACAACTMGMTMCMGCCTTTGTATTCATTTGAAGATA 

ATGCAGACTATGTTTATGATGTTATGTGGTCACCTACCCACCCAGCCCTGTTTGCCTGTG 

TGGATGGCATGGGGAGATTGGATTTGTGGAATCTCAATAATGACACAGAGGTACCAACTG 

CCAGCATTTCTGTGGAGGGTAATCCTGCTCTTAATCGTGTGAGATGGACCCATTCTGGCA 

GAGAGATTGCTGTGGGTGATTCTGAAGGACAGATTGTTATATACCGATGTGGGAGAGATT 

GCTGTTCCCCGCAATGATGAATGGGCACGGTTTGGCCGAACACTTGCAGAAATTAATGCA 

AACCGAGCTGATGCAGAGGAGGAAGCAGCTACCCGAATCCTGCTTAGTTCCTGAAAAGGG 

GAGTGTAACTAGTGGATTGGGAAAGGGTC 

Sequence 1780 

AGTAGGAACCAAGMAACTTCTTTTGCCAACTTTACAGGATATCTGGTAAACTATTACAT 

NGTCAGGCCAAACATGCTCCTTGCATTTTTGTTGGCTGAATNTGGGTACAGAGTGGTTCT 

ATACGATGGTAATAACCAACTTGNAATCAAAGGAAGNATTCCAACAGAAACAGATAGGAN 

AANGTCTTGAGAAGATATATNAAGGAATNTGTCACTTGTCACACNATGCCCGATCACCGG 

GACACAMTTCCTGCMGAAGGGACCACACCGGACTCTATTTTCCTACAAGNGCAGGAAA 

CCTTGTTCATTCTAAGCATGGTTTCTGNTTGCCCAGGTATTCAAAAAA 

Sequence 1781 

ACCCACGCGTCCGGCTGCGTTGGGCTTGCGTGCGGCTCGCTAAGACTATGGCGTCCGGGC 
CTCATTCGACAGCTACTGCTGCCGCAGCCGNCTCATCGGCCGCNNCAAGCGCGGGCGGCT 
CCAGCTCCGGGACGACGACCACGACNACNACCACGACGGGAGGGATC 
Sequence 1782 

CCGCGTNCGTTTGTGTTGMTGGNTTGTATACTTCTTTACACMCCTATCCATTACTTAA 

GGMTCTGCTCTTATTCTTCTACAAACTGNTCNGGAACAAANTGATATCAGAAATTNGAT 

AAMGMCTTCGMATGNTTGAAGGAGTNGAGGMGNTCATTGAATTNCATGTTTGGCAA 

CTTNGCTGGAAGCAGAATCATTTGCCACTG 

Sequence 1783 

GTCGACCCCGCGTCCGGGCCCGTCTACAAGGNTTGTAGATAAAATAGAAACATACCTTCC 

TTGAAMTGCAGMTAAATTTTTTAMGGCAGGMGGAAGTGTTTGAACCATGTGTCAAC 

AAGCTTTACTGTCAMGCAGGCTTTTGGTATGGGAAGAAAAATACTTATAAATACTNGTN 

TTMTATTTGCTTTATTAAAATACATTTAAAATACAGCATTTTTAMTCTCTMGCTCAA 

CTTGMGATATAAGMCAGTAAATTTGATAMMTGAGAMTTACATTCCCATTTCTTTA 

ACMTTTGTAAATTCCAATTATCCTGAACATTTAACACCATTTACATATTTTATTAATCA 

CAI I I I CI rAAACATTTGATAAGAGATTTAATATTTTGATCCAACTACCA 
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Sequence 1784 

GCGTeCGCGGACGCGTGGGTGTGGAGCCTGTTACCTCCGCATTTGACACGAGTCTGCTGT 

GAACCCCGCAACCTCCTCCCCACCTCCCATCTCTCCTTCCAGGCCCATCCCTGGCCCAGA 

GCAGGAGGGAGGGAGGGACAGATGGCGGTGGGTTTTTGTATCTGMTTTGCTGTCTTGAA 

CATAAAGAATCTATCTGCTGTTGAAAAAAAAAAAAAAAAAAAGGnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTCACAAATAMGCAnnnnnnnnnnnnnnnnnnnnnnnnnGTTTGTCCAAACTCATC 

MTGGTATCTTATCMTGTCTNGGATCCCCCGGGTACCCGANGCTCGAATTTMTTTCCT 

TTTTTC 

Sequence 1785 

ACGCGTCCGGCTTCGTTCTCCCGTCCCTCCCCGCGCCTTGGCGCGGGGGGTCGACTAGCC 

MGTGAGGCGGGAGGCGACTCGGACCTTTCCCTGCATTTCGTTTCGGCCAGTGCCGGGGG 

CTACCCGCCCTGGGGCCTGGGATCCTTGGGGCCCGTGAGGCCCACTCTTAGCGGCCGGGG 

CCTACCGCGGCCCGCCGCTGGCCCTCATGAGGCATAGCCTGACCAAGCTGCTGGCAGCCT 

CGGGCAGCAACTCCCCAACCCGCAGTGAGAGCCCGGAGCCGGCTGCAACTTGTTCGCTGC 

CCTCTGACCTGACCCGGGCTGCAGCGGGGGAGGAGGAGACGGCGGCGGCNCGGATCTCCC 

GGCCGCAAGCAGCAGTTTGGCGACGAAGGAGAGTTGGAAGCCGGGAGGGGGGGAAGCCCG 

CGGCGGCGTGGCCGTGCGCGCGCCCTCCCCCGAGGAGATGGAGGAGGAGGCGATCGCCAG 

CCTTCCGGGGGMGAGACGGAAGGATATGGACTTrTTTTCTGGGCTGGA 

Sequence 1786 

ACGCGTCCGGCCAGGCAGTGATGGAATCCCGGGGTCNGCAGGAGAGAAGGGTGAACCAGG 

TCTACCAGGAAGAGGATTCCCAGGGTTTCCAGGGGCCAAAGGAGACAAAGGTTCAAAGGG 

TGAGGTGGGTTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCA 

AGGATTCATGGGTCCTCCGGGGCCCCAGGGACAGCCGGGGTTACCGGGATCCCCAGGCCA 

TGCCACGGAGGGGCX)CAAAGGAGACCGCGGACCTCAGGGCCAGCCTGGCCTGCCAGGACT 

TCCGGGACCCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAGGTGACAAAGG 

AAATCCAGGCTGGCCAGGAGCACCCGGTGTCCCAGGGCCCAAGGGAGACCCTGGATTCCA 

GGGCATGCCTGGTATTGGTGGCTCTCCAGGAATCACAGGCTCTAAGGGTGATATGGGGCC 

TCCMGGAGTTCCAGGATTTCAAGGTCCAAAAGGTCTTCCTGGCCTCCAGGGAATTAAAG 

GTGATCAAGGCGATCAAAGG 

Sequence 1787 

AAGCTATTGTCACACCTTTGAAACCAGTTGACAACACTTACTACAAAGAGGCAGAAAAAG 

AAMTCTTGTGGAACAAGTCCATTCCGTCAAATGNTNGTTCTTCCCTGGAAGTTGAGGCA 

GNCATATCAAGAAAAACTCCANCCCAGCCTCAGAGANAGATCTCTTAANGCTTTCTGCTC 

AGAAGGATTTTGGAACAGAANAGAAAAGCATCATGGTAANAAATGAAAGCCAAGAGATGT 

G CCCA CTCCTGTTAATTCATTCGATTGAAAATTCATACCCCTTCTAAGAAAAAATGAAAA 

GTTTTNTAAACAACAAAAAAGAAAGCCCNNNAGGCAAGGAAAGGGCANGTGCTTCATNAA 

GNATACCTGNTGNAAAANAGGAAATGGCATCTTCCNCCAAGAAGAAAAAGCCCAANGGGT 

TTGTACCATACCTGGTGCNCTTTGTATGCNCACCCTGCCAMGGCATAAAGGTTTTCTAA 

AAAAAGTTACCTTGAGNGAAGCCAANGAAGCCTGGGAGAAAANAGTANTGNAAAAAATGC 

AAGCNAAAGAAGGNTGGGTGGGAAG 

Sequence 1788 

CCCCGCGTCCGAGCAAACATAAGAAACCTGAGTCATTTTGTCATTTAGAGTATTCTGATA 

AAATC TCTTGAAMTACTGAMTCAAMGGTTMTGATTTTTTGTTCATTCTGATTTGTC 

ATTTTATTATCTGTCTAGCAGAAAAATCAAATGGGTAAATTAGCACTTTAGACAGCCAAC 

ATAGTGAAACCCTCATCTCTACTAAAAGTTGGCAATTAAATCTGAATTTACAGATACAGA 

TAACAGTTTATCAGAMTCATATTTTTCCTGMGAAMTTTAAAATTAGGAGTTGTGGGC 

CTGGTGCGGTGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGCAGATCA 

CTTGAGGTCAGGAGTTCGAGACCAGCATAGCCAACATAGTGAAACCCTCATCTCTACTAA 

AAGTACAAAAATTAGCCAGGTGTGGTGGCCGTGTGCCTGTAGTCCCAGCTACTCGGGAGG 
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CTGAGGCGGGAGAATCGTTTGGAACCCG 
Sequence 1789 

GACCACGCGTCCGGTCGGCGTCTGGGCCTCGTCCCCTTCTCTCTGTCTCCCTTGCCTCCC 

CCATCACGTCCCCTGACACCGACACCCCATTGCTCCCACAGTCTCCCCAGNCTCCACTTT 

GGTCCCCAGCGCTGTCTGCCCGAGGATTTGCCTGAAGGCTGCCCCCAACTCTGCACCCGC 

CCCCCGAGGGCCACCGAGGACCATGACTAAGACAGATCCTGCCCCGATGGCCCCGCCACC 

CCGAGGAGAGGAGGAAGAAGAGGAGGAGGAGGATGAACCCGTCCCCGAGGCCCCCAGCCC 

CACCCAGGAGCGCCGGCAGAAGCCTGTTGTGCACCCCTCGGCACCTGCCCCCCTCCCTAA 

GGACTACGCTTTCACCTTCTTCGATCCCMTGACCCGGCGTGCCAGGAGATCCTGTTTGA 

CCCTCAGACCACCATCCCCGAGCTGTTTGCCATTGTGCGCCAGTGGGTGCCCCAA 

Sequence 1790 

CGGGGTCTTCTTGCTGTGAGGTCGCGTTCCCCAGTGTTACGGAGGGTCCTTGAGGCAAGG 

AGTGAAAATTGGGTCTGGGGGTTAGTCCTGGGGTGGAGGTCTGGGCACGCCGGGTCGGAC 

CCCCTNCATCTTCGGNTTTGCACACCCCGCTTTCCAGCGCGGAGTCCGCGGCGGGTAGGG 

CNGGCGTCNCGTGCGTGACGTCATCCAGCGGCGCCTNGCGAGGCTNCAGTGGCCTTGACC 

TCCCGCGGNGTGGGAGGCTGCGCGGCGATGCTGCANTTTCGTCCGGGCCGGGGCNGCGGG 

CCTGGCTT 

Sequence 1791 

GGGTTATGAGAAGAACGCTCAGAGCAGAGCACCGAAAGTGGCCACTACCAGCATGAAGAG 

CCCAACMTTCAAACTGGNAGMGTGAGAAAAACAGAATGCAGCTTTCAAGGTTCGTTTC 

AAGCAGTTGGCTTGTGGGACTCTGAGAGATGCTGCTGNCCATGACATGCGGGAATTATCA 

TGATCMCTAC(X}AGCTTGGATTTCACCCAGTGGCCAACTAGCTTTGTGTGGGAGACX3GC 

MGGGTTGGATTTTTCAAAAGAGTAAACCAGACCCGTGACCAAGGTGTNAACTAAGAAGT 

GGAGTCATGCTTCACACGGNCNTATCNTGCTGGCAGCCATTCCTGGGTCTGGCTGTGGTG 

TTAATCTTCATGGGATCC 

Sequence 1792 

GTGCTGGTTTNTCTTGCAGATGCTGCTGCTAGGGGTGGTGGGAAGCAGCCGTGGGACGCG 

TGGCCGGNAGCGGNGGTGACAGCCTGGGANNNCGGGGGCTTNTCTTCCTTGTCCTCCTCC 

TNTCCTGTCTATTCCCAGNGNGGGGCGTGGCTGACACTAAAGACTNTGTANNCATCAACC 

CGAGTGCANNTTCNATGGAAAATGAAGGTTGCACGTTTTCAAAAAATACCTAATGGTGAA 

MTGAGACAATGATTTGCTGTATTGNCANCCAAAAAAAGCAAAGGGNATTNCCANTCATT 

GAAGNTGCAAGCATTCTNCMGCTGATCTTCNAAATGGGTCTAAACAAAATGTNAAGNTA 

AGTTCNTNAGGCNAGCCCTTTCATGGTTTNGAMTGAGNTTTCGNTT^ 

TGGAGGCCNCCNTGTNGGGAAAGAAANGNGTTTNTTTTCNCCAGGTTTTNAAAA 

Sequence 1793 

GTCCGTTTTACAACCTAGTAATAATGTGGATAAATGTATCTACATGACACATGTCAAGAC 

CAAAATAACTGTGAATGACACACCTTGCTGTAAATGAACTGTGCTAACCCTGACTGTGGG 

CTTGAGAACAAAGATGAACTCTAGAACTCTAGCAGCCTAACTGCTGCTTCTCAAATAACT 

GTGTGMCAGTGAGATATTACTGTTTGTTTCTAAAMTCCTACTGTGCCCAGTTTCCTTC 

ACTACATGCCCTGCATTTTTTATTTAMTATTTAGCTGTAGCGCCATCAGATATGGATGC 

CTTCTAACAATTGCTGTTTGTAAAATAAATCAGGATGGTAGAAAGTGATTATATGGAAAA 

TTGGMCCTGGATGAGACCTTTTCGTTGAATTCTGAAGAGTAATGATGTGAAAATTGATA 

CAGGGCAAGAGATGATTCTTTTGGGTTTTCTTCTACTTCATGTCCAGAAGAGTAAGAGGG 

GAAAA 

Sequence 1794 

TNGTTGCCTGCMTACTACACTTTACAMCAATGTTAACACTGTGATTCCTTCATTGTTT 

TAAGAAGTTAACCTAGGGCCGGGCATGGTGGCTCATACCTGTAATCCTAGCACTCTGGGA 

GGCCGAGGCAGGAGGATCCCTTTAGCCCAGGAGTTAAAGACCAGCCTGGGCAACATAGGG 

AGACCCTGTCI I I I I I I I GGGCAGCGTGGTGGGGGATAAATAAATAAAAGGAAAAAAAAC 

TTAGCCTAGAATTAGAATTMTTTMTTGAATTCATCTAAAGATGTCTCTGGTGATTTTT 

ATATGTTCCGCTATATAATTGATGCTTTATAGTTTTATCATAATCX)AACAACTTCAGTTA 
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TATTTMTTATTGGTMNGGAGTTTMGACTNGAMGACTAGAGTGCTTTCTAGTCCAAA 
TAGAGGGTCANGTGAMCCAGCTT1TTGACATCMGATTTTCATTTTGAGAAGGGANAAG 
CCTGTGGGACTGGGCTTAAA 
Sequence 1795 

GTCCGGAGAAAACTGCAACTTGCCCAGNTCCTCTTCTCCACCAATAGATGCAGCATCCGC 

AGAGCCCTATGGCTTCCGTGCCTCAGTGTTATTCGATACAATGCAACACCATCTAGCCTT 

GAATAGAGATTTGTCCACACCTGGCCTGGAGAAGGACAGTGGAGGGAAGACACCTGGTGA 

CTTTAGCTATGCCTATCAAAAGCCTGAGGAAACAACCAGGTCCCCAGATGAAGAAGATTA 

TGACTTGAGTTATGAGAAGACCACCCGGACCTCAGATGTGGGTGGCTATTACTATAGAAA 

ATAGAGAGAACCACAAAATCTCCAAGTGACAGTGGCTACTCCTATGAGACCATTGGGGAA 

AACTACCAAGACCCCTGAAGGATGGTGACTATTCCTATGAA 

Sequence 1796 

CCGGCNTAGGCGGGGGGAACACGCCGCCTGCGCTCTCTTGGGACCCTAGATTTGGGGGAG 

GAGGTAACGAGAGGCGGAGAGGGTGGCTCCTCAAATATACACCCCTCCTGTCCTCCGCCA 

CCCCACCTTTGATTTCTCTTCCCTCMCCCAGCACTCCAGCCCCACCCCAGGGTCAATTT 

TTGCCCCCTTCCATCTGAGCAGTGTTACCAGGCCCCAGGGGGACCGGAGGATCGGGGGCC 

GGGTGGGGGGTCCCATGGAGTACTCCAGCACACGCAGGGGCTCCCTGCAGACAGGGGGGG 

CCTTCGCCCTGGAAGCCTGGACGCCGAGATAGACTTTCTGAGCAGCACGCTGGCCGAGCT 

GAATGGGGGGTCGGGGTCATGCCGTCACGGCGACCAGACCGACAGGCATATGAGCC 

Sequence 1797 

TCCGATTNNGCCNAGGGTTGCAGTTTGTAGACCCCTGATCTAGACCCTTAAGTAGCCTTG 
TTTGTGCCTGAAGTTTACAGATGATCCCCMCTTAT^ 

AGTCTCTCTCTGGAGCCCAGGTTGGAGTACAGTGGCAAGATCTTGGCTCACTGCAACCTC 

CACATTCCGGGTTCAAGTAATTCTCCCGCCTCAGCTTCCTGAGTAGCTGGGATTACAGGC 

GTGTGCCACCATGCCAGCTTTTTTGTATTTTGTATGTTAGCCATGTTGGCAAGGCTGGCC 

TCAACCCCTGACCTCAAGTGATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGG 

AGTGAGCCACCATGCCCGAACCCCMCTTATGTTTGACTTACAATGTTTGGACTTTATGG 

ATGGTGCAAATGTTATATGCATTTCAGNAGAAACTGGGCTTCAAAATTTTGMTTTTTG 

ATCTTTTTATTTT 

Sequence 1798 

TCCGCTGCCGAAGTCAGTTCCTTGTGGAGCCGGAGCTGGGCGCGGATTCGCCGAGGCACC 

GAGGCACTCAGAGGAGGTGAGAGAGCGGCGGCAGACNACAGGGGACCCCGGGCCGGCGGC 

CCAGAGCCGAGCCAAGCGTGCCCGCGTGTGTCCCTGCGTGTCCGCGAGGATGCGTGTTCG 

CGGGTGTGTGCTGCGTTCACAGGTGTTTCTGCGGCAGGCGCCATGTCAGAACCGGCTGGG 

GATGTCCGTCAGAACCCATGCGGCAGCAAGGCCTGCCGCCGCCTCTTCGGCCCAGTGGAC 

AGCGAGCAGCTGAGCCGCGACTGTGATGCGCTAATGGCGGGCTGCATCCANGAGGCCCCG 

TGAGCGA TGGA ACTTCGACTTTGTCACCGAGACACCACTGGGAGGGTGMCTTTTNCCTT 

GGAANCGTTTTGCGGGGGCCTTGGCCTGCCCAAGCTNTACCTTTTCCAACGGGGCCCCGG 
CGAG 

Sequence 1799 

GGCGNAGCTCGNCTTCCTCCNCGCCCAAGTTCCGGCGCCGCTCTTGCGGGAGCGTGCCGC 
ATCACCCCGGGGGCCCCTACGCGAGGATCTCCGGGGCCGTTGGCAGCAGCCTG 
Sequence 1800 

TCACCCCGCGTCCGGGCGGGCGTGGGGCGGTGGGAGGTAGTGAAGAAGGGTCGGCGGCCT 

GGGGTCGGCGCCGGCGCCGGCGGCCGAGGAGGCGGTAGGAACCGCAGGGCGCTCGGGGAA 

GCAAACNGAGTGTGGAAATACNACCTGACCCCTGCAATCCAGACCACAAGCACCCTTTAT 

NAGCGGNGCTTTNAGAATATCATNAAGNGGNTTTAATAAGGAGCAGGTCCCACCCCCTGC 

TGTGGAA 

Sequence 1801 

GTCGACCCACGCGTCCGGGAGCAGAGTCGACTGGGAGCGACCGAGCGGGCCGCCGCCGCC 
GCCATGM(X)CCGAATATGACTACCTGTTTAAGCTGCTTTTGATTGGCGACTCAGGCGTG 
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GGCAAGTCATGCCTGCTCCTGCGGTTTGCTGATGACACGTACACAGAGAGCTACATCAGC 

ACCATCGGGGTGGACTTCAAGATCCGAACCATCGAGCTGGATGGCAAAACTATCAAACTT 

CAGATCTGGGACACAGCGGGCCAGGAACGGTTCCGGACCATCACTTCCAGCTACTACCGG 

GGGGCTCATGGCATCATCGTGGTGTATGACGTCACTGACCAGGAATCCTACGCCAACGTG 

AAAGCAGTGGCTGCAGGAGATTGACCGCTATGCCAGCCGAGAACGTCAATAAAGCTCCTG 

GTGGGGCAACAAAGAGCGGACCCTCACCACCAAAGAAGGTNNGTGGGACCAACACCACAA 

GCCAAGGGAGGTTTGGCAAGACTTCTTTTGGGGCATTC 

Sequence 1802 

NCCCCGCGTCCGCGGACGCGTGGGCGGAGCTGCTGTGCAGTGGAACGCGCTGGGCCGCGG 

GCAGCGTCGCCTCACGCGGAGCAGAGCTGAGCTGAAGCGGGACCCGGAGCCCGAGCAGCC 

GCCGCCATGGCAATCAAATTTCTGGAAGTCATCAAGCCCTTCTGTGTCATCCTGCCGGAA 

ATTCAGAAGCCAGAGAGGAAGATTCAGTTTAAGGAGAAAGTGCTGTGGACCGCTATCACC 

CTCTTTATCTTCTTAGTGTGCTGCCAGATTCCCCTGTTTGGGATCATGTCTTCAGATTCA 

GCTGACCCTTTCTATTGGATGAGAGTGATTCTAGCCTCTAACAGAGGCACATTGATGGAG 

CTAGGGATCTCTCCTATTGTCACGTCTGGCCTTATAATGCAACTCTTGGCTGGCGCCAAG 

ATAATTGAAGTTGGTGACACCCCAAAAGACCGAGCTCTTCTTCAACGGAGCCCAAAAGTT 

ATTTGGCATGATCA 

Sequence 1803 

CGCGTCCGCGCTTCTGTTACGGCCAGTGCAACTCTTTCTACATCCCCAGGCACATCCGGA 

AGGAGGAAGGTTCCTTTCAGTCCTGCTCCTTCTGCAAGCCCAAGAAATTCACTACCATGA 

TGGTCACACTCAACTGCCCTGAACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTG 

TGAAGCAGTGTCGTTGCATATCCATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGC 

ATGTCCTAGGAATGCAGCCCCAGGAAGTCCCAGACCTAAAACAACCAGATTCTTACTTGG 

CTTAAACCTAGAGGCCAGAAGAACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCCG 

TAGTTCGTGTGCATGAAGTGTGGATGGGTGCCTGTGGGGGTGGTTTTTAGGACACCAGAA 

GAAMCACAGTCTCTTGCTAGAGAGCACTCCCTATTTTGTAAACATATCTGCTTTAAGGG 

GGATGTACCAGAAACCCACCTTACCCC 

Sequence 1804 

CCCGCGTCCGTAGATTAAATTATGCAAGTTGCAAGAATGTAGTAACTCTGATCAGCTACA 

AGGAAAGGAGGAAAGAGTAAATGAAGAAAGTCATCTAACTGAAAAGGAATATATAGAACA 

TTGTAACACCCCTACAACTGATTCTGATTCATCTATAGCAGTTAAAGCACTACAAATAGA 

TAGCTTTGGTTTAGTTACATGCTTTCAACAAGAGTCTCTTGATGTTTCTCAMTGATACT 

TGGAAAATCTCAGCAACCTGAGTCAAAAATGCAATCTGAATTTATAAAAGAAAAAAGTGC 

TACTTGTTCAAATGAGGAAAAAGGTAACTTAAACGAGGTCAGTAATAACTGAAGAGAAAG 

AAACAGATGGGAGATCACCCTATCTTCATTACTGAACCAAAACTACCNGTTCACAATATA 

CCTGGATTCGACAGCATAAAAGAAACC 

Sequence 1805 

GCGTCCGGCAGCTGAAAGGGGATTTGGGCCCGGAAGATCCGAGtCCATCCGCGGCGGGGA 

GAGGGCAAGCGGGACCGGTAGGGGCCGGAGCAGCGGCGGCGGCGCTCGGACTGTCCCATC 

CGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGATGGTGAAAGCGGGG 

CCGTGAGGGGGGCGGAGCCGGGAGCCGGACCCGCAGTAGCGGCAGCAGCGGCGCCGCCTC 

CCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATCGGGCCGCCGCCGCCA 

TGGAGCTGAGAGTCGGGAACAGGTACCCGGCTGGGCCGGAAGATCGGCAGCGGCTC 

Sequence 1806 

GTCGCCCCGCGTCCGCACAGTTGATTCTGMTTTTTAAGGCTTTCCTAATAGGCTGATCA 

CAGAGAATAATCCATTTTGAAGGTATAAAACTGCACTGTATGTCTGTCACTTGTAGCTGA 

A CTGAT TCACATTTTGACAAAAGAGAGAAAATACAAAAATGAGTTTTGCAAATGTAATAA 

CTTTTTCTGCATATAGAACTAAATAATTGAAAAATATGGGCTATAGTTCTCAAAGGTAGA 

TAGTAMATCACTGGCTTTTTCCAGCTGTATGTTTTTCCACTGTGCGTGTACACACACAC 

TGGAAMTMTTAGGCTGATTTTGCAGGTCTTCATTGTTAGAGATTCTGAAGTATTTACT 

GTCMTTCATAGGTTTCAGTTTATTTAGGAAATTAGTGTTTGACAGCI I 1 1 1 1 IAAATTA 
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TTTCACTGAAGCTGAGATTATTAGTGATCAAAGTTAAMTTTCAATATTTAATTTCTCTA 

TATATTATTAATATTAAAATGGTTT 

Sequence 1807 

GTCGACCCCGCGTCCGGTACTACATCCCCTGAAGAAACATGTGCCCAGGTGATTCGTGAA 

GCTAAGAGAACAGCACCAAGTATAGTGTATGTTCCTCATATCCACGTGTGGTGGGAAATA 

GTTG GACC GACACTTAMGCCACATTTACCACATTATTACAGAATATTCCTTCATTTGCT 

CCAGTTTTACTACTTGCAACTTCTGACAAACCCCATTCCGCTTTGCCAGAAGAGGTGCAA 

GMTTGTTTATCCCTGATTATGGAGAGATTTTTMTGTCCAGTTACCGGATAAAGAAGAA 

CGGACAAAATTTTTrGAAGATTTAATTCTAAAACAAGCTGCTAAGCCTCCTATATCAAAA 

AAGAAAGCAGT7TTGCAGGCTTTGGGAGGTACTCCCAGTAGCACCACCACCTG 

Sequence 1808 

CCCCGCGTCCGGCCTTTAAAAGAAGACTTGAATTCCTATGGAACAATAAAGACACAGCAG 

AAAACAGGGATTCTCCTGTTTCAGAGGAAATAAAAATGACCTGTCAACAATrTATCCATT 

ATCACCGTGACCTCTGTATCCGAAACATTGTCAAAGAAAGAAGGTGTGGTGCAAAGACTT 

CTGCTGGAACTTTCTGTGGCTGTGACCTGGTGAGCTGGCTAATTGAAGTCGGCCTTGCCT 

CCGACCGTGGTGAAGCTGTGATATACGGAGACAGGCTGGTACAAGGGGGAGTCATCCAAC 

ATATTACCAACGAGTATGAATTCCGGGATGAGTACTTGTnTACAGATTTCTTCAAAAGA 

GTCCTGAACAGAGTCCTCCTGCTATTAATGCAAACACTCTCCAACAGGAAAGATATAAAG 

AAATTGAGCATTCATCCCCACCCTCACATTCCCCTAAGACCTAAATTATGCAGGGGAGAA 

CCCTACATGGAATCAT 

Sequence 1809 

CGCGTCCGCTGGAGTGCTGCTGAGGAGCGANGGGCCCATCTGGGGTCTCTGGAAGTCGGT 

GCCCAGGCCTGMGGATAGCCCCCCTTGCGCTTCCCTGGGCTGCGGCCGGCCTTCTCAGA 

ACGMGGGCAGTCCTTCCACCtXJGCGGCGCAGGTGACCGCTGCCATGGCTTTTCCCCATC 

GGCCGGACGCCCCTGAGCTGCCTGACTTCTCCATGCTGAAGAGGCTGGCTCGAGACCAGC 

TCATCTATCTGCTGGAGCAGCTTCCTGGAAAAAAGGATTTATTCATTGAGGCAGATCTCA 

TGAGCCCTTTGGATCGAATTGCCAATGTCTCCATCCTGAAGCAACACGAAGTAGACAAGC 

TATACAAGGTGGAGAACAAGCCAGCCCTCAGCTCCAATGAACAATTGTGCTTCTTGGTCA 

GACCCCGCATCAAGAATATGCGATACATTGCCAGTCTTGTCAATGCTGACAAATTGGCTG 

GCCGAA 

Sequence 1810 

CGCGTCCGGTGCATCTGAGGACTGGTGGGAAGGCANGGCACAACGGGATTGACGGGCTGG 

TGCCTCACCAGTATATAGTGGTGCAGGATATGGATGATACGTTTTCAGACACTCTGAGCC 

AAAAAGCCGACAGTGAGGCCAGCAGTGGGCCAGTCACGGAAGACAAGTCCTCATCCAAGG 

ACATGAACTCCCCGACAGACCGTCATCCTGACGGCTATTTAGCCAGGCAACGAAAAAGAG 

GAGAGCCACCCCCTCCAGTAAGGCGTCCTGGCAGGACCAGTGAT 

Sequence 1811 

TCAGGAGTCGACCCCGCGTCCGGAAGGCCGATGCTGTGGGGGTGGGCGTGGAGAGAATTC 

TTCTGTGGGTCCTCTGGTGTTGAGTGGTCGGCTTGGTGTGGTGTGCGGAGGAGCTCCAGG 

CCCGTCGGCGCGGAGTGGTCTCACGTGTGAAACATGGCTACAGATTGGCTGGGAAGTATT 

GTGTCCATCAATTGTGGAGATAGCTTGGGTGTCTATCAGGGAAGAGTGTCAGCTGTGGAT 

CAGGTCAGCCAGACCATTTCTCTCACCCGGCCTTTCCATAATGGAGTGAAGTGTCTTGTT 

CCAGAAGTCACCTTCAGGGCAGGTGACATTACGGAGTTAAAAATTCTGGAGATACCAGGA 

CCTGGAGACMCCMCATTTTGGAGACCTTCATCAAACAGAATTAGGCCCCTCTGGTGCT 

GGCTGCCAAGTGGGCATCAATCAGAATGGCACAGGCAAGTTTGTCAAG 

Sequence 1812 

CCGCGTCCGCCCAGTCCNAGTGCTGGCTTTCCCTGTATCTGCCTCTGCCAGGCAACACTT 
ATCATGGCTCCCAATCAGCAGGAGCCTCCATGCTCCACTTTGAACAGCCTCTATGCTCCA 
GCAATGGGGCATTTGTGAAGAGTGACTTGATTAACTTTTCTGACCATGGGTATAATACAG 
TTGCTTCAGAGGGC AGTGGTT CTGGGTGTG AI I 1 1 I A CACTGTAACATTGTATACAGTGT 
CATGGATAATTACTATTTTTTTCTGGTCATTAACACTCACCTACTCTAGTACTAGGATTT 
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CAGACCMGGTCCTCATGACGCCTGGATATTTTAGTATCTATATCCAATAATCTTTTCTC 

TCCTACTGMTATCCAGGCAMGATGAAATCGTTTTCTTTAAAACTGTCAAATTCTGTAA 

AACTCAGGAGCCAGTTCAAGGGAACAAGCATCTTCACAATAGATGGAATCAAGAGTTAAA 

TGTTATAGTGGCAAGCTTGTCTACTGGGCAACAGAC 

Sequence 1813 

CGCGTCCGCCGCCGCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGC 

GACGCTGAAGCGGCCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCCGGCTCCCCGAAGCG 

GCGGCGCTGCGCCCCTCTGCCCGGCCCCACTCCGGGCCTCAGGCCCCCGGACGCCGAGCC 

GCCGCCGCCGTTTCAGACGCAGACCCCACCGCAGAGTCTGCAGCAGCCCGCCCCGCCCGG 

CAGCGAGCGGCGCCTTCCMCTCCGGAGCAAATTTTTCAGAACATAAAACAAGAATATAG 

TCGTTATCMGAGGTGGAGACATTTAGAAGTTGTTCTTAATCAGAGTGAAGGCTTGTGCT 

TCGGGAAAGTCAACCTCACTCCTCAGCACTCACAGCACCTAGCTCTTCCAGGTTCCTCAT 

GGATGAAGAAGGACCAGCCCACATTTACC 

Sequence 1814 

CGCGTCCGTTGMGAATAATATTGTATGTGCATTTTATCCATTMTGTTTCATACTTTCT 

GAGAGTAT MT ACCCT TTTAAAAGATATTTGGTATACCMTACTTTTCCTGGATTGAAAA 

CI I 1 1 I I I AMCTTTTTAAMTTTGGGCCACTCTGTATGCATATGTTTGGTCTTGTTAAA 

GAGGMGAAAGGATGTGTGTTATACTGTACCTGTGAATGTTGATACAGTTACAATTTATT 

TGACAAGGTTGTAATTCTAGAATATGCTTAATAAAATGAAAACTGGCCATGACTACAGCC 

AGAACTGTTATGAGATTMCATTTCTATTGAGMGCTTTTGAGTAAAGTACTGTATTTGT 

TCATGMGATGACTGAGATGGTAACACTTTCGTGTAGCTTAAGGAAATGGGGCAGAATTT 

CGTAMTGCCTGTTGTGCAGATGTGTTTTCCCTGAATGCTTTCGTATTAGTGGCGACCAG 
T 

Sequence 1815 

GTCGC CCCGCGTCCGATTTAAACTGGGTCTTTATAAAAGTAAMTGGCCMCATTTAATT 

ATTTTGCMAGCMCCTMGAGCTAAAGATGTMTTTTTCTTGCNATTGTAMTCTTTTG 

TGTCTCCTGAAGACTTCCCTTAAAATTAGCTCTGAGTGAAAAATCAAAAGAGACAAANGA 

CNT IsrrrC GANTCCANNTTTMGGCTGGGGGAMNTGGGTTTTTTAGCNCMCCNTTNCM 

AAGTTTTNTTTNGGGGATTCATAACANCNCCACCNAATTGNTTTTTTGT^ 

CATTTCAATACTAGTTATATTNANNAGGAGTNGGTAGAGAGGAAACATTTGACTTATCTG 

GNAAAAGCAAACTTGTACTTAAGAATAAGAATAACATGGNCCATT 

Sequence 1816 

TCGACCCCGCGTCCGCTCTGCTCCTTGTCTCCNTCTNCCTTTTTCTGTCTTTGCCGGGTC 

TCTGGGTCTCTGACCCCCATCCGGCCCTCATGGCTTTGGGTCNGGAGCTNTTGAAGCAAT 

GTTCATCAT 

Sequence 1817 

CCACGCGTCCGGGGGAGCCGGACGCCAGAGTCCCCTCTCCACGCCGTGCAGCTGCGCTGG 

GGCCCCCGGCGCCGACCCCCGCTGCTGCCGCTGCTGTTGCTGCTGCTGCCGCCGCCACCC 

AGGGTCGGGGGCTTCAACTTAGACGCGGAGGCCCCAGCAGTACTCTCGGGGCCCCCGGGC 

TCCTTCTTCGGATTCTCAGTGGAGTTTTACCGGCCGGGAACAGACTGGGGTCAGTGTGCT 

GGTGGGAGCACCCAAGGCTAATACCAGCCAGCCAGGAGTGCTGCAGGGTGGTGCTGTCTA 

CCTCTGTCCTTTGGGGGTGCCAGCCCCACACAGTGCACCCCGATTGAATTTGACAGCAAA 

GGCTCTCGGCTCCTGGAGTCCTTACTGTCCAGCTCAGAGGGAGAGGAGCCTGTGGAGTAC 

AAGTCTTTGCAAGTGGTTTCGGGGCCAACAGTTCGAGC 

Sequence 1818 

TCGACCCACGCGTCCGGTGAAACACAAAACCAAGGAGTACATTAAGAAGTACATGCAGAA 

GTTTGGGGCTGTTTACAAACCCAAAGAGGACACTGAATTAGAGNGACTGTTGGGCCAGGG 

TGGGAGGATGGGTGGTCAGGTAANGACAAGACTCTAGGGNAGAAGGAAANCCTGTGGGCC 

TTTNTGTCCCACCCCTGTCAGCACTGGTGCTACTGATTGATACATNACCCTGGGGGGNAA 

TTNAACCCTGCCAGNATGTCAACNTGGAANGGCCACAAAGAAGTGGAACCTCCCATCTAC 

AAMNNGAGTTTACNCTTANATTTTGTAGAAGCCTNGTTTGGCCATTGTTGCNNNTAGAN 
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AGTCCATNNANTAGGNGGCAAGGGGGCCTNTANTAAAAATGAACCCCTGGNACAGAACCT 
TGNACTTNCACATTCTTTAAAANCCCTGGGAGATGNTTTGCTTTCTNTGGGNCGNGTTGN 
TTTGNTTC AAGCT GCTACNAAGTAACTCTCAATGGCCGGCAATTATCCCCAACTCNCACN 
AAACTCCNTTTTTAACCCCTG GCANG G AATCCTTGCAAATTTAATAATTTTTTAAATGGG 

Sequence 1819 

GTCCG GCTT TAGTGMTTCTTAATAGATNGTATATATAAAAGTACATTTTAATAGAAAGC 

CAGGGTTTTMGGMTTTCACATGTATAAGGTGGCTCCATAGCTTTATTTGTAAGTAGGC 

TGGATAAATGGTGCTTAAATGGTAATGTACTCCACTTCTTCCTATTGGAAGATTAACATT 

ATTTACCAAGAAGGACTTAAGGGAGTAGGGGGCGCAGATTAGCATTGCTCAAGAGTATGT 

AAAAAAAAAAAAAAAAAAAANGAACCAAACCACTGGAAATAATCAAATGCAAAAAGGTAA 

CAAATTCATAACTGGAAAGCAAAGAGAAGAACAAGTATGATTTGGATGATAAAGCATTGT 

TTTAATGGTG AAAACT TCACCAGATCACTTAATGTTTCTAGGAGGTTAACTTCAAGTGGG 

CAANTGGGGGTTTTTAGGTAGGTCAGTGGCCCTAAGTTCCTAAAGCCCACAGATTAGGGA 

TCTGTTAMCTGMTGGTCTGTTGGAAAGGTTTTGTTTTAACTGCTTGGGAGGCTTTCCT 

TTTAAG 

Sequence 1820 

GCGTCCGGGAAAAGTTTGCCTTCCANGCCGAAGTTAACAGAATGATGAAACTTATNATCA 

ATTCATTGTATAAAAATAAAGAGATTTTCCTGAGAGNACTGATTTCAAATGCTTCTGATG 

CTTTAGATMGATMGGCTAATATCACTGGACTGATGAAAAATTGCTCTTTNTTGGAAAT 

GGAGGAACTAACAAGTCAAAAATTAAAGTGTGATAAGNGAGAAGAACCCTGCTGCATGTC 

ACAGACACCTGGTTGTCTGGAAATGACCAGTAGAAGAGTTGGGTNTAAAAAACCTNTGGT 

ACNCATTAGTCCAAATCTGGGACMGNCGGAGTTTTTTAAAACAAAAATTGACTTGAAAG 

CCACCAGGGAAAGATGGCTCAGTTCAMCTTTTTTGGAATNTGGATTGGGCCCAGTTATG 

GTGGTCCGGTTTTNCTATTTCCCNCCTTTCCCTTGTAGCAGATTNAAGGTTTATTTNGTC 

ACTTTCAAAAAACAACAAACAAACNGATTACCCCAAGCAACATCTTGGGGGAGNTCTTGA 

CTTCCAAATG 

Sequence 1821 

CGTCCGCGGGTAAAATGTTATGGTMGCATGCACANGTTTGCAGTCTACAGTTTTTTTAT 

GTA GCACAA AATAGGTGTACCTTTATMGTACATTCAATTTTATGATTTACATTTATCAT 

GT AATTTT TAAAAAAATCCATCTATCTAGGATATGTTGGATACAAAGTCTGCTTTTGCTA 

TTCTTTTTGCTTAMTACTCCTATCATTTTCTGAATTACTTGGTATTTAGAACTCX5TAGC 

ACCACGGGGAAGAATA GAGG TATCATCAAACGTGGCAAATTTTCTTTCAGGAATAATAAA 

GAGCATGATTCCACAGCTTTTCTGGGGATGTTTGAGATTCTTTTTTAGTACTAAGCAAAA 

TTCTCATCACAGGAATGTAGCCCAGGCCAATTTATAACTAAATCTCTATTTTGTTCGGAT 

GATGCTTCTAAAACAGCATTGATAGGTTAMGAAGCTTGGGTATTTTTAATTTACTTCAA 

TGATTAGCTCAATTGCTTCTGGGAGTTTTAATCCTGTGGATATGTCAT 

Sequence 1822 

GGGCGCCCCGCGTCCGCTGATCTCGGGCTCCTATTTCATTTACATTGTGTGCACACCAAC 

GTAACCAGTGGGAAAACTTTAGAGGGTACTTAAACCCCAGAAAATTCTGAAACCGGGCTC 

TTGAGC CGCTA TCCTCGGGCCTGCTCCCACCCTGTGGAGTGCACTTTCGTTTTCAATAAA 

TCTCTGCTTTTGTTGCTTCATTCTTTCCTTGCTTTGTTTGTGTGTTTGTCCAGTTCTTTG 

TTCAACACGCCAAGAACCTGGACACTCTTCACTGGTAACATATTTTGGCAAGCCAACCAG 

GAGAAAAGAATTTCTGCTTGGACACTGCATAGCTGCTGGGAAAATGAACATCAGTGTTGA 

TTTGGAAACGAATTATGCCGAGNTTGGTTCTAGATGTGGGAAGAGTCACTCTTGGAGAGA 

ACAGTANGAAAAAAATGAAGGATTGTAAACTGAGAAAAAAGCAGATGAAAGTGTCTCACG 

AGCTATGTGTGCTCTGCTCAATTCTGGG 

Sequence 1823 

CCCCGCGTCCGGTCCTTTGGCACTGTCATNTGTGTCCCTCGAGTGAGCNTCACCAGAGCT 
GCAGTAANNGNGCACCTACTACNGGCTCTGNGCTGAGTCCTTCCAGTGNGCCTCTCACTG 
AATNCTCACCCCACTGNCATGAGGTTTNCCCCATTTGACTGATGAGGGTGNAGAGCCAGG 
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GAGCCTTGNTCACTGGTTCATTGANTACATTTACAMTATTATTNACAGAGTGGGAGAAG 
AGCCGTATAGGGNCTTAATGCCATGGTNGGGACTTTGGAATTTAATTCAAGTGATGTGGG 
AGTACCTGCCAGATGGATGGAGGGTGAAATAATGCTTAAGCCCTCAGCC 
Sequence 1824 

CGCCCACGCGTCCGCACCGGCCCGATGAGCTGCAGCGGCTCCGGCGCGGACCCCGAGGCG 

GCTGCCGGCCTCCGCCGCCTCGGCCCCGGGCCCCGCGCCCCCGGACTNGGCTCCCGCCGC 

GCTGCCCTCCAGCACCGCCGCGGAGAACAAAGGNCAGCCCCCGNGGGGACAGGNNGGGGA 

GGACCTGGGNGCTNGGAGCAAGCATGCTGGGGGGCACAGGGACCCTTTGGCGGGAAGCCG 

GGCCGGTAGAGCNCAGCNTGAAGCAGGNTNTGAGGAGCTGGNTCCCTGNGNATNGGNNGA 

GGATGGGCCGCNGGANGCCCTCTCCGNGAAGGGGGGNGCTCAATAATCTTAACCGGGGGG 

TTTNAA 

Sequence 1825 

CGCGTCCGGCTGAAGGCTCCCCTGGGNTTNCTGGCCTCCTGGGGCAGAANGGGAGAGAAA 

GGCGATGCTGGCAACTCCATTGGAGGAGGCAGAGGGGNACCTGGCCCTCCAGGGCTCCCT 

GGGCCCCCAGGGCCAAAGGGAGAAGCAGGTGTCCGATGGCCAGGTTGGCCCCCCAGGGCA 

GCCAGGAGACAAGGGGGAGCGTGGAGCAGCTGGAGAACAGGGACCAGATGGCCCCAAGGG 

CTCCAAGGGAGAACCAGGGAAAGGAGAGATGGTGGATTACAATGGAAACATCAATGAGGC 

TCTCCAGGAGATCCGGACGCTGGCCTTGATGGGGCCTCCTGGTCTTCCTGGGCAAATTGG 

CCCACCTGGAGCTCCAGGGATTNCAGGCCAGAAGGGGGAGATTTGACTGCCANGGCCTTT 

CAGGACACGATGGGGAAA 

Sequence 1826 

GTCGACCACGCGTCCGG 1 1 1 1 1 1 1 1 1 1 1 1 1 I GAGACAGAGTTTTGCTCTTGTTGCCCAGG 

CTGGAGTGTGATGGCTCGATCTTGGCTCACCACAACCTCTGCCTCCTGGGTTCAAGCAAT 

T CTCC TG CCTCAG CCTCTTGAGTAGCTTGGTTTATAGGCGCATGCCACCATGCCTGGCTA 

ATTTTGTGTTTTTAGTAGAGACAGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCC 

CAACCTCAGGTGATCTGCCCTCCTTGGCCTCCCAGAGTGCTGGGATTACAGGTGTGAGCC 

ACTGTGCCGGGCCCGTCCCCTCCTTTTTTAGGCCTGAATACAAAGTAGAAGATCACTTTC 

CTTCACTGTGCTGAGAATTTCTAGATACTACAGTTCTTACTCCTCTCTTCCCTTTGTTAT 

TCAAGTGTGACCAGGATGGCGGGAGGGGGATCTGTGTCACTGTAGGTACTGTGCCCAGGA 

AGGC 

Sequence 1827 

CGACCNCGCGTCCGGCACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTG 

AGCGCCCCGCCCCGGGGCCTGAGCTGGACGTCGCAGGCCTGCGCCCCCCGACCCCGGCTG 

GCCCCGCTTCCAGCTGCCGAGGCCTCGTCGCGCCTTCCCCGGGAACAAAAGGCGGGGTCG 

C AATG GAAGAAGAGATCGCCGCGCTGGTCATTGACAATGGCTCCGGCATGTGCAAAGCTG 

GTTTTGCTGGGGACGACGCTCCCCGAGCCGTGTTTCCTTCCATCGTCGGGCGCCCCAGAC 

ACCAGGGCGTCATGGTGGGCATGGNCCAGAAGGACTCCTACGTGGGCGAC 

Sequence 1828 

CNCCACGCGTCCGGACCGGGGAAGACGCCTCTTCGCCGCCTCCGAAAACCGAGGCAGCGA 

GCGACCCCCAGCATCCCGCGGCCTCCGAAGGGGCCGACGCCGCCGCCGCCTCGCCGCCAC 

TGCTGCGCTGCCTAGTGCTCACCGGCTTTGGAGGCTACGACAAGGTGAAGCTGCAGAGCC 

GGCCGGCAGCGCCCCCGGCCCCTGGGCCCGGCCAGCTGACGCTGCGTCTGCGGGCCTGCG 

GGCTCAACTTCGCAGACCTCATGGCTAGGCAGGGGCTGTACGACCGTCTCCCGCCGCTGC 

CTGTCACTCCGGGCATTGGAGGGCGCCGGGGTGTTNTGATCCGCAGTGGGCGAGGGGAGT 

CAGCGACCGCAAGGCAGGAGACCGGNTGATGGTGTTGAAC 

Sequence 1829 

GGTGTCGCCCCGCGTCCGCTTGTC 1 1 1 1 1 1 GGGGGTGTGAATTTTTTGCATTGTTCTGAT 
CA TATT CTTTATCATGTAATTTATGTTC I 1 1 I 1 1 ACTAAGTATTATGTGTGGTTATTATA 
GATTTTCACAAAGATATATTGCTGGTAATATATTTTATTGTGTAGTCTTATAATTTACTT 
AACCTTCTTTCMTTGTTAGAAATTTAGGCTATTTCCAGATTTTCAGTATTGTAAATAAT 
GCTGTGATGACCMTTTTGTGAATAAMTGTTTTTATGTATTTCAGATTATTCCCTTAGG 
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ATAGTCTCTCAGTGCCMGTTGTCAAAMCATCTCTATTTTGCTTATCTTCCTGCTCTCT 

TGCTGCCTTAGGGGGTAGTAAACTGAAACATAAAGTAAACATGCATACAAATAAAAAACA 

TAAAACAAAAATAAGCAACCTGATGGTAATAGGTGAAAGTGGTAACCTGTTTTAACTTTG 

AATTCTTGCCCGGCGCGGTGGGCTCACGCCCTGTAATCCCAGCA 

Sequence 1830 

CGCGTCCGGTAAACCAGCCGGAGCGGCGCGGNAGCGGCAGGACCGCCGTGGCGCCTAGAG 

TAGCAGACCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCCGGTGACAGCGTGAG 

AGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTAGTCGTTCTTGGCTCAGGAGGC 

GTCTGGAMGNCTGCmGACTGTACCMTTTTGTTCMGGMTTTTTGTAGAAAAATAC 

NGATCCTACCGATAGGAAGATTCTTATAGAAAGCAAGTTTGAAAGTAAGATGCACAACCA 

GTGTATGCTTTGAAAAATCTTTGGATACCTGCCAGGNAACCGGGAGCCAATTTACANCCA 

ATGGAGGGGGATTTATMCATGNAAAAAAATGGGACAAAGGGATTTTGCATTTAGTTTAA 

TTNCMTCACCAGACACMGTTCCCACCMTTTTAAAACGAATTTTACCAAAGAACCCTG 

GAGGAGNAACCAAGAANTTCTTTNNGAAGGTTTAAAAAGGAACCACCTTGAATTGGAATG 

GTTTCCCAAATG 

Sequence 1831 

CCAATTATAGACTATATAGGGGGAAGAGCACTGGATTTGGAGTCAAGAAACCTGGACACT 

TGGCTCCACACTTCCTTAGCTGGGTAACTTTGGGCAAACCGCTTGGTCTCTCAAGCCTAA 

GGTTCTTCAGCTATAAAATGGGAATAATACTTCACTAACTACCTCACAGAGTTGTGGTAA 

GAATATAATC AGATA ACTGGATAAAAACACTATATAAACTGGAAAGCGCCGTACAAATGT 

GAGAGATCAGTTTTATTATCAMTCACTGTTTTCCACTGCCTCTTGAATCGGCTTTATTC 

TAACCMCCATTACATCTTTCTCATCTTTTGGAGTATGGGTAATTGAGGCTTGGGTGTGT 

CATCAGGGACTGGGAAGTTATTTCAGCTCCCATGTAAAAGGTGGGAGAGGTGGTTTGTGG 

GNGCAG 

Sequence 1832 

GNGTCGACCNCGCGTCCGCTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 

AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 

GTGTGGCCCTGCAGGTGGCAGTCCCXJCGGGCATCGATCCCACCAGGGGATCCCAGGGGGC 

ACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGA 

CCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCC 

CAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGC 

AGAAATGTAGGGAGGAGCTGCT 

Sequence 1833 

GCCNCGCGTCCGTGAAACGCAAAAGAAGGAGCTCGGAATATAAGAACGTCAGAACGAGTG 

ACACTAATAGTGGATMCACTAGATTTGTTGTAGACCCATCCATTTTTACTGCACAGCCA 

AATACAATGTTGGGCAGGATGTTTGGATCTGGCCGAGAACATAACTTTACACGACCCAAT 

GAGAAAGGAGAGTATGAGGTGGCAGAGGGAATTGGTTCCACTGTGTTTCGAGCGATTCTG 

GATTACTATAAAACAGGAATAATCCGTTGTCCTGATGGCATATCTATTCCTGAACTGAGA 

GAAGCATGTGACTATCTTTGTATCTCTTTTGAATATAGCACTATTAAATGTAGAGATCTC 

AGTGCX3CTMTGCATGAGTTATCAMTGATGGTGCTCGTAGACAATTTGAATTTTATCTG 

GAAGAAATGATCCTCCCTCTCATGGTAGCTAGTGCCCAGAGTGGGGAACGG 

Sequence 1834 

ccncgcgtccgctttaaccaccataagaaatcagaaaacgcaagataaagttcagcacac 

agtatgtatggattgcagtagctacagtacatactgttatcgctgtgatgattttgtggt 

taatgacaccaagctgggactggtacagaaagtcagagaacacttacagaacttggaaaa 

ctcagctttcacagctgacaggcataagaaaagaaaacttttggaaaactcaacactaaa 

cagcaagttattaaaagtaaatggaagcaccactgccatttgtgccacaggccttcggaa 

tttggggaacacatgtttcatgaatgccatccttcagtcactcagtaacattgagcagtt 

ttggctgttatttcaaagaactgccccgccgtggagttaaggaatgggaaaacagcagga 

aggcggacataccacaccaggagccaaggggataacaatgtgtctttggtagaaag 

Sequence 1835 
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TTACTGGGCACCCANCTCCATGTGCANGACTTTTCCCAACACAGCCTTGGCCAGTCAGAT 
GGTGTGNCAGGGCCNNAGGTTTNCGTANCCTCTTGGGTGATAGAAAGGGGCCCAGGCCCT 
GGGCTGGGGCTCATAANGGACTCAAANGAGGCACCTTGCCC 
Sequence 1836 

TCGCCCGGGGGCCATGGCAGCAGCGGCTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCC 

TCCCGGGGAAGTCATTCATCTGAATGTGGGAGGCAAGAGATTCAGTACCTCTCGCCAGAC 

TCTCACCTGGATCCCAGACTCCTTCTTCTCCAGTCTTCTGAGCGGACGCATCTCGACGCT 

GAAAGATGAGACCGGAGCAATCTTCATCGACAGGGACCCTACAGTCTTCGCCCCCATCCT 

CAACTTCCTGCGCACCAAAGAGTTGGATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCA 

TGAAGCCCAGTTCTATGGGCTCACTCCTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTT 

GGATCGATCTTCTTGTGGAAACGTCCTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCC 

AGTGAAGCGGCGGAACCGGCACAGCCTAGTGGGGCCTCAGCAGCTAGGAGGACGGNCAGC 

CCCTGTCCGACGGAGCAACACGATGCCCCC 

Sequence 1837 

CGCGTCCGTTCTAGATCGCGAGCGGCCGCCC I I I I I I I i 1 1 I I I IATCTTTCTG 1 1 1 I IC 

CACTAAAAGCTCCGTTTTTCCATCTTCCCATTACTCTCCCTTTCX5TGTGNACACTCCTGA 

AG ACAGGC ATCCTCATMGGTGTTCTGMTTMCTTTAGGGNGTTCTCCAGGTACTTTGC 

ATC Mill ATATTTCTTGT AAATTG NT AATTTCTAAG CTCC ATG ATTAAAG AG AATTC AC 

CACTAAAAAAAAAAAAAAAGG 

Sequence 1638 

CGCGTCCGGGCTGGCGAGCGCCGNCGCCGGCGGAGACX5GACNCTNGGCCAGAGCCNGCCC 

GCGGCGCCCGGGCCTGGCCGGCTGCTTCCCGCCTCAGCNNGGCGCCCCNGCCTCCGTGCG 

CCGCAGACTTTGCCTAGGCGGGCNGAAGCTGAACAAGAGGTCCCTCGGCCCTGCACGGTC 

CGGCCGCGGTCCGGAGCCGANGCGCATGAGATTCCCCAGGAAGCCATCACACTCCTTCCC 

ACTGTGGTTTNGGGAGCATGAAGGCGTTG 

Sequence 1839 

GCCCCGCGTCCGTTTTATTTTGCACTTTTATGGGTGACAGTTTTTACGCATMCCT^ 

TAAMTACACTCMGTGACTTGGACTTAGATGCTTATCCTTACGTCCTTGGTACCTTTTT 

TGTATTMCA MCACTGCMTTTATAGATTACATTTGTAGGMGTTATGCTTTTTTCTGG 

Mi ll GTTT TACTTTCMCCTAGGTTATMGACTGTTATTCTATAGCTCCAACTTAAGGT 

GCCTTTTTAATTCCCTACAGTTTTATGGGTGTTATCAGTGCTGGAGAATCATGTAGTTAA 

TCCCAAAGTGCTGGGATT ACAGGC GTGAGCCACCGCGCCCAGCCTACATTCACTTCTAAA 

GTCTATGTMTGGTGGTCATTTTTO 

GGAAAACTTATTCTGMTATTMCGGGTGGTGAAAAGGGGACAGTTTTTACCCTTAAAGT 
GCAAAAGTGGAACATACAAAATMGACTAAATTTTTNAGAGGTAACTCAAGTMTTTC 
Sequence 1840 

GTCCGGCCAGCTGATGCCGGGAGCTAACTACCGCGCCGGGGCCGGGGCCGGGGCCGGGGC 

CCGACGTCCCCGCGGGGCCCGGGACCGNGAGGAGGACGGCGGGGGCCTGGAGCCCGCGGC 

CGTGGCCCGCGACCTATTGAGGGGCACATCTAACATGTCATTTGAGGAGCTGTTGGAATT 

GCAGAGCCAAGTGGGGACTAANACNTACAAACAATTGGTAGCTGGAAATAGTCCTAAGAA 

ACAAGCTTCTAGACCACCTATCCAAAATGCATGTGTTGCANATAAGCACAGGCCTCTGGA 

MTGTCANNCMGATCCGAGTNCCATTTTTACGTCAGGTTGTTCCCATTAGTAAAAAGGT 

AGC 

Sequence 1841 

GCACCCGCTGNGAGAGGCGGTAGCGGCGGCGGCGGCGGTGGTATCGGCGGCAGCTGTGAG 

GGGGTTCCGGGMGATGGTGCTGATCMGGMTNCCGGGTGGTTTTGCCATGTTCTGTTC 

AGGAGTATCAGGTAGGGCNAGCTTTACTCTGTTTGCMGAAGCTTTGTAATGAATTGAGA 

CTGGGTGGGTGGAGAAGGAATTGAAGNTCTTAAAGAATGAACCTTATGAGAAGGATGGAG 

AAAAGGGACAGTTTACGCACAAAATCTATCACCTAAAGAGCAAAGTTGCCTGCATTCTGT 

GAGGATGATTGCTCCCX3AGGGCTCCCTTGGTGTTTCATGAGAAAAGCCCTGNAAATGCCG 

TACCCCTACTGTAGNMCAATTrrTNAACCGAAATGGAATATATTGAMNGMTGATTTTC 
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CTTCAATTNAAAANTCCGAAAACAATNGGCANCCAAAACCCATNACTTTGGGGAACCATN 
TAGGAAAAAANNGTACCNATNGGGTTTTTAGGATTCCCAAAACCACNATTGGGAAAAAAC 
CTTG 

Sequence 1842 

CCCGCGTCCGGCGGAGAAGACTCAAAAGAGTGTGAAGATTGCTCCTGGAGCAGTTGTATG 

TGTAGAAAGTGAAATCAGAGGAGATGTAACTATCGGACCTCGGACAGTGATCCACCCTAA 

AGCAAGAATTATTGCGGAAGCCGGGCCAATAGTGATTGGCGAAGGGAACCTAATAGAAGA 

ACAGGCCCTTATCATAAATTGCTTACCCAGATAATATCACTCCTGACACTTGAAGATCCA 

GTAACCMNMCCCTATGATTCATTGGCACCMTAMTGTTGTTTGGAAGTTGGGCTGGT 

TTATTCCCAAAGGCCCATTGAAAGATGGGGGAGGATAAAATAAATGGTCATTTGAAATCA 

AAAAGTCAATTATTGTTAGGCCAGAAAATGGTAMTATTTGGACCAAGTTGGCTTGCATT 

CATTTGGGNGGGCTTTGGTTGCMCCTTAMNTACATTTTGTAAGTTCATCCCCTGGAGA 

ATACCNGGTGGAATCCTTAAT 

Sequence 1843 

CGACCNCGCGTCCGGGGGATCTGTGCCTGGCATGGGGACGAGTTCTGGCCTCCTTAGGGT 

ACGGGGAGAGCTTGGACTTTGGTCCTGACGTGGTGGACGACACACCTTCGAAGAGTGGAC 

GTTACCTCAGTTGTCTGTTGTTAGAGTTTAATCGATCACTCCTCTGTTTTGTTGTGTTCT 

TTCCCAGAAATMCTTTACCAMGGAAAGCTATTTTGCGAACTATCTTCTCCAGCGGAGA 

TGGCCMTGTGCTTTGTMCAGAGCCAGACTGGTTTCCTATCTCCCAGGATTTTGCTCTT 

TAGTTAAAAGGGTTGTCAATCCCAAAGCCTTTTCGACTGCAGGATCATCAGGTTCGGATG 

AGTCTCATGTGGCTGCTGCACCTCCAGATATATGCTCTCGAACAGTGTGGCCTGATGAAA 

CTATGGGACCCTTTGGAC 

Sequence 1844 

GGGACAGAGCCCCCGATCCGCCAGCACCACCTGAGGATTNNGAAACCGCCCCAGCGATGG 

AAGAGGGNCAGGAGCTGGAGAGGAAAGCAATAGANGAACTGCTTAAGGAGGCAAAACGTG 

GGAAAACTAGAGCTGAAACAATGGGACCCATGGGTTGCTTATTACAGGGACAAGATACAA 

ACTAAAATCAGCCAAAATAAGACACAAAGATTAAAGCCA 

Sequence 1845 

CGTNCGGGACCCGNCAACATGGGCCGCGTTCGCACCAAAACCCGTGAAGAAGGCGGCCCG 

GGTCATCATAGAAAAGTACTACACGCGCCTGGGCAACGACTTNCACACGAACAAGCGCGT 

GTGCGAGGAGATCGCCATTATCCCCAGCAAAAAGCTCCNCAACAAGATAGCAGGTTTATG 

TCACGCATCTTGATGAAGCGAATTTCAGAGAAGGCCCANTTAAGGAGGTATCTTCCATCA 

AAGCTGCAGGNAGGAGTGANGAGAGNAAAGGAGTAGACAATTATGTTTCCTGAAGGTCTC 

AGCCTTGG GATCAA GNGAGAATTATTNGAAAGNTAGATCCCTGNACAACTAAAGGAAATG 

CTTGMGNCTTTTTGGACCTTCGNCAGGTTCTTC^ 

GCCTACAAG TTGGGGATTGAMTTTTTCAAAAACGCCNTCGGGGGMCCCTGTTTTGAAA 
ATTTTTTTTCTTG NT AG NTG CCTT GT ATTAATTTTT 
Sequence 1846 

GTCGACCCCGCGTCCGCAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCA 

CCAACTGGAACCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTG 

GCTGCTCCCCCAACGACTTCAAGCCACAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGAC 

CCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGAACCAGGAGGTGCTGGAAATCCTGG 

ACTTCCACCTGTATGGCAGCTACCCCCCCGGCACGCCAGCCCTCAAGGCCTACTGGGAGA 

ACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCTTACA 

CAGCCTTCGCCCGCCTCAAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACC 

CCACTCTGCAGGTTTGAGCCCAGGGGCTTTGCCGTCCAAGCGTGCACCTGTATTTCTATG 

ACGACCATTTC 

Sequence 1847 

TCCGCAAGAGTTATGCTTAAGACCAGCCAGCCTTGATAGTGGCAGAACATCCACTAGCAA 

TAGCAATAATAATGCTTCACTACATGAAGTCAAAGCAGGTGCAGTTAATAACCAAAGCAG 

GCCACAAAGCCACAGCAGTGGAGAATTTAGCCTGCTTCATGACCATGAGGCTTGGTCCAG 
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GCACAAAATATCTGAAACACTGGAGAGTCGACATCACAAGATCAAAACTGGTTCCCCTGG 

MGTGAAAGTTGTTACTCTACAACAGTTTTTGGAAGAAAGCAATAAGCTTACCTCAAGTA 

CAGATAAAGTCCTCAAGTCAAGAGAATCTTTTAGATGAAGTAATGAAAAGTTTGGCTGNC 

TCTTCTGACTTTTTGGGAAAAG 

Sequence 1848 

GCGTCCGCGCCGGGCCGCTCTAGCCGGTGAGGCCGGCGGGCTCTCTGTGGCTGCGGCTGG 

GAAACCGCGCGGAGGAGGTGCCCGGCCGGGGACCAGCCCTGGTCCAGCGCCTCCCTCTCT 

CAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCTACCGTGCGCAGCTGCAGCA 

GGTGGAGCTGGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCAGGCTGACCTGCGCCAGCT 

GCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAGCCTGGTGTCTGTCAGGAA 

GAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCCGCCAGGAAGATGCTGA 

Sequence 1849 

AGTCACCACGCGTCCGGGTTGCACTCTTCCTATAGCCCAGAGGGCGAGAGGGCCTGTGGC 

CTGGGGGAAGGAGGACGAGGTTCTGCCTGGATCCCAGCAGTAGGACGCTGTGCCAT7TGG 

GAACAAAGGAATAGTCTGCCTGGAATCCCTGCAGATCTTGGGGCCGGAGGCCAGTCCAAC 

CCTTGGAGCAGGAAGAAACGCAAAGTTGTCAAGAACCAAGTCGAGCTGCCTCAGAGCCGG 

CC CGCAG TAGCTGCAGACTCCGCCCGCGACGTGTGCGCGCTTCTCTGGGCCAGAGCGAGC 

CTGTTTTGTGCTCGGGTTAAGAGATTTGTCCCAGCTATACCATGGGCCCGCACTCGGNAA 

AGCTGGCTTGCGTGGCCGCTGGTGTGGTTATCGGGGCTGGTGCCTGCTACTGTGTATACA 

GACTGCTTGGGGAAG 

Sequence 1850 

TCGACCNCGCGTCCGCTCAGGMCCTTNGAGMGATNAGNNCCCCACTTAGATTNTTAAG 

GAGTAAAMGGGCTGAGTTATGCCTTTAAGNGCTGTCAAGAATTCACTTGGGTTTGGGAC 

ATTTGCTGGTGTAATGCTAGATGCCCACAGCANCATAATATTGNNCTTTGTCAAAGGTNG 

GTAAATNCTNTGNTTNTCANCANCCCTTTCCCCA 

Sequence 1851 

AGTCACCACGCGTCCGCGGCTGGTGGTGGGCTCGGGGCGCTCGCCTTGCCCGTCTTCGCT 

TCCGGAGGTCGCTACTGCCGCCTCAGCGGCCCCGGAGCGGGGGCGCCCGGGGGTCCTTCG 

CCCCCGGCCACGGTCCCCGCGCCGGGGCTTCGCCGCCCCCAGTGTCCGAGCTGGATCGTG 

CGGACGCCTGGCTCCTCCGAAAAGCGCACGAGACAGCCTTCCTCTCCTGGTTCCGCAATG 

GCCTCCTGGCATCGGGCATCGGGGTCATCTCCTTCATGCAGAGTGACATGGGTCGGGAAG 

CAGCATATGACCATCCCCCGACCTTGGCCCTGTTGCTACCCTGCCTCTNCCGCAGGCTTC 

TTNCTGCTGGGCGGCCTGTGCGTGGTGTGGGGCAAGCGCCTTGTACCCGTGGGCCTGGCG 

GCGCTTNGAGGACCCATGCAGCTGACCTGGGGGGCCGGCCTGGG 

Sequence 1852 

GCGTCCGCTCGCTGCAGCCCCGCCTGGGCCACGGCACCCTCGAGCGCCAGCCCCGCGCCC 

CACCCGGGAGCAGCGAGCCACCGGCGCGCTCCCCAGGAGCCCCTGCAGGCGCCGGCCCTG 

GTCCAGCGCCTCCCTCTCTCAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCT 

ACCGTGCGCAGCTGCAGCAGGTGGAGCTGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCA 

GGCTGACCTGCGCCAGCTGCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAG 

CCTGGTGTCTGTCAGGAAGAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCG 

CCAGGAAGATGCTGAGTACCAGGCTTTCCGGGAGGCCATCACTGAGGCGGTGGAGGCACC 

AGCAGCGGCCCGTG 

Sequence 1853 

GCGCCCCGCGTCCGGAAATTGACCCCTAGAGAAAATCCCATTAACTTGTTAAATTAGTGG 

AATTMCMCAAATAAAGCATGTTTGAGACCTGGCAAAAATTCCTCTGGTAGTATTTATA 

AATAGAGCTGCATGCCTCTAGTATGAAAACCGTATCAGTTGCAAGTGCCACTTCTACAAG 

TTACTCAGTTTACTCTTTGTATCAGTAACTTTAAAGGTTGGATGATCCTTGCTGGTTAAA 

GCTAAATCTCAACCTAGCAACTAAATGAAAATATTTAGAATCATCAGAATCTGAACAGAC 

TAAMTTATCAGCGATAAGCAGAATCAAGCAGGGTATAAGTTTTATCTCAATTATTTGAA 
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ATTGACTGGAGTTTTCTTAMGTGTAMGCTGAMTTTGCTMCCATGTTTTTGATGAAC 
CCACAGTGCAGCATTGGGTTGGGGTTTrAGATTTGAATGACTCTCTGCTATAATTATCAT 
GACTTTGAAA 
Sequence 1654 

TGTCGACCCCGCGTCCGAACCTTTGGGAGACTCCAAGACAGCATGCTCCGAGGTCGGCGG 

GGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAGG 

AGGACNAGGAGAGGGACGAGGTTGGGCCCGAGGGGGCGCTGGGCAAGAGCCCCTTCCAGC 

TGACCGCCGAGGACGTGTATGACATCTCCTACCTGTTGGGCCGNGAGCTTATGGCCCTGG 

GCAGCGACCCCCGGGTGACGCAGCTGCAGTTCAAAGTCGTCCGCGTCCTGGAGATGCTGG 

AGGCGCTGGTGAATGAGGGCAGCCTGGCGCTGGAGGAGCTGAAGATGGATAGGGACCACC 

TNANGAAGGAGGTGGAGGGGCTGCGGAGACAGAGCCCTCCGGCCAGCGGGGAGGTTGAAC 

CTGGGCCCAAACAAAATGGTGGTT 

Sequence 1855 

TGT CTGCCG AACAACTGAAAACTATGCAGGTTATATCATTCCACCAGCACCACCAAGACC 

TGATTI I I IGATGCTTCAAGGGGAGAAAACTACATGAAGCTTGGNGAAGNGAGAAGGGTC 

AAGTG ACCGA ANNGCAAGNAATTTCACAAANAGAATGAAACNAGGCAACTGGAAAGCTCG , 

AAATANTTTTGGCAATTATTTCAGAGGANAGCACCAGATTTGCCGTATTGCATNGGMNT 

TGNTACCCNGTAGTTCCNTGTTGGANATGTCMCANTCCCCTANTmTTTGNACGAAACT 

AAGGMTTNTAGAAATTtTTGCAATNGTTCCTTTACTTGGGGGAAAATTATTNNAATTCC 

AAAGGAATCTTTNAGAGNTTGNNTNGCCGTAANGGCAAGAGAGAGNCNATNNTNGAAACT 

NAGGMGCGGAAACCTTTTGNAAANMCTTTTCTTTTTTA 

Sequence 1856 

GAGTCGACCNCGCGTCCGGCGGCGAGCGGGACTGGCCATTGGAGTGCTCCGCTGCGGAGG 

GAGGGGACCCCGACTCGAGTAAGTTTGCGAGAGCACTACGCAGTCAGTCGGGGGCAGCAG 

CAAGATGCGAAGCGAGCCGTACAGATCCCGGGCTCTCCGAACGCAACTTCGCCCTGCTTG 

AGCGAGGCTGCGGTTTCCGAGGCCCTCTCCAGCCAAGGAAAAGCTACACAAAAAGCCTGG 

ATCACTCATNGAACCACCCCTGAAGCCAGTGAAGGCTCTCTCGCCTCGCCCTCTAGCGTT 

CGTNTTGGAGTAGCGCCACCCCGGCTTCCTGGGGACACAGGTTTGGCACCATGGGGCCCA 

CCAGCGNCCCGCTGGGCAAGGCCCACCGCAGGCTCGGTCTNTGACTACGNCAACTATGAT 

ATTCATTCGTCCX3GCATTACAACTACACGGGGAAAGCTGAATATCAG 

Sequence 1857 

CGTACGCCGAGCGCCGCTCCGGCTGCACCGCGCTCGCTCCGAGTTTCAGGCTCGTGCTAA 

GCTAGCGCCGTCGTCGTCTCCCTTCAGTCGCCATCATGATTATCTACCGGGACCTCATCA 

GCCACGATGAGATGTTCTCCGACATCTACAAGATCCGGGAGATCGCGGACGGGTTGTGCC 

TGGAGGTGGAGGGGAAGATGGTCAGTAGGACAGAAAGGTAACATTGATTGACTCGCTTCA 

ATTGGGNGGGAAATGCCTCCGCTGAAAGGCCCCCGAGGGCGAAAGGGTACCCGAAAGNCA 

CCANGTMTCACTT GGGTGT TCCGMTATTGGTTNATTGAAANCCATTCAACCCTTGCNA 

GGGMAMCAMGG7TTTTTCNAACAAAAAAAGAAAAGCCCTTACAANGGAAAAGGTTAC 

CATTCANANMGAATTTMCATTTGGAAMTTTCAAATTCTAAATAGGGGGGAAAAACCT 

TTTGGAAAGGAAACCAGGANGTACCCCAGGANAAAGGAAGGTAAAAAA 

Sequence 1858 

GNGTCGACCCCGCGTCCGGTGGAGGGTCAGGAGCTGCCCCGGATCCTCTCCATGTAGTTG 

CGAAGCTCCTCAGGGTCCTTCAGCCCCATGTCCTCACACACCCAGCGGATGTCCTCCTCG 

CCTGCCACAAGGATGGACTGCACAGCAGGGGCCCCTACAGGCTCCTCAGGTGACTGGGCT 

GGAGGGGCTGGCGCAAATGTCACAAACTCTACTCGCTTCCGCCGCCCCCCAGCCTCCTTT 

CGGGCCAGGGTGCTTGAGGAGCTGGTGGTGCCCCCAGGAGGGCCAGGGGCCAGGGTAGGG 

GCCTCCCCTCCTCCCCCACTCTCACAGGGGCAGCTCCCCTCCCCCTTGGGCGGGCCAGGG 

GACTGCCGGTCCAGCTGGCGGCTCAGTTCCTCCTGGTCAGTGCCCAGCCAGACCCAGTTG 

TGGGGCTGGGGGAGGTTGGGG 

Sequence 1859 

GCGCCCCGCGTCCGGCGGCGGCGGCGGCGGCAGCGGCAGCCAGAGGACTCCCAGCGGCTG 
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GAGCAGAAGTGTTAGCCGGCCAGAGCTCCCAGACCCCTACCCACAGCCAGGCGGGACGCG 

CACAGTCCCTCCACGCGGAAAGAAGTACCTTCGCCGGTCACCGGCTCCTGCAGGGTTGCA 

NATATATACAGAGCTTCATAATCAGCCCAAGACCACATAGAGCAAACATGAATGATATTT 

CCCAAAAGGCTGAGATTAAAGAAATGCTTGCTTCTGATGATGAGGANGATGTATCTTCTA 

AAGTNGAAAAGGCTTATGTTCCAAAA 

Sequence 1860 

CGACCCCGCGTCCGACCCACTGAAGACGTCTGCGTGAGAATAGAGACCACCGAGGCCGAC 

TCGCGGGCCGTTGCACCCACCGCCAAGGACAAAAGGAGCCCAGCGCTACTAGCTGCACCC 

GATTCCTCCCANTGCTTANCATGAAGAAGGCCGAAATGGGACGATTCAGTATTTCCCCGG 

ATGAAGACAGCAGCAGCNTACAGTTTCCAACAGCGACTTCAACTACTTCCTACCCTCACC 

AAGNCAAGCTTGCTCTGAAAAGCCCANTTATGCCANAATGTAGGATCCTGAAAACCAAGA 

AAC I I 1 1 1 I ACTTG AAATT 

Sequence 1861 

GCGTACGGCCTGTTGGGCTGTCTGGGGGGTGGCCATTTAGGGATCGTGGGGACGGGGTCC 

ACCCCANNCAAGAAAGAACAGGCCCGTCCACAGGCCCGGCTCTGGGCCACAGTGCCCCGG 

AAGCAGGTGTGTCCAGAGTCANGCTTGAATGGCTCTCCCCACAACCACCCAGCNAGGCGC 

TGGTGCNTCCTTCTGCCTCATGGGACCAGTCCAGCTTNCAGCCGCTCTGGGCTCGAGGGT 

NGGTACTGACCACTTTCCTTCTTGAGNTGGGAGCATTCTCTTGGGGGAGNCTCTTCCAGT 

GGGCA CCTGCCTGGGACNCTTGCCCACCGGTTTTCTTGTAAAAATCAGGAATACXSGGTGG 

CTTTTAGTAAAAGGCAAGACCANAGNCGCCTTNCGTTGGGCAGGGGAAAAGCCAAGCGTG 

CCGGNGGGNAAGGTCACTGGAAAAAGGTGGCTTGCCCTAAGGGGGAAGTTTGGGAAAATA 

GTCCCCCTGTTCCMGAANTGCCTTTGMTTTTTAAAAACATTTTTGGCT 

Sequence 1862 

CMTNrTACMCGCCATGTNCACCCANATGTTCCAGACTMGCGCTGNTTTCGACTGGCCC 

CCACCTTNAGCAACCTGCTCCTGCAGCCNACCACCAACCCTCATACCTCGGNCAGCCACA 

GGCCTTGCGGTCAATGGGGATGTAGACAAGCCTTCAGAGCCAGCCTCTGAGGAGGGCTCT 

GAGTNGGAGGGGAGTGAGTCCAGTGGACGCTCCTGTCNGAATGAGCGCAGCATCCANGAG 

AAGCTTNAGGTCCTGATGGCCNAAGGNNTGCTNCCTTGCTGTGAAAGTCTTNCTGGACTG 

GCTTCNGACCAAC 

Sequence 1863 

NGGAGTCGACCCACGCGTCCGGCCGCCAGAACACAGGTGTCGTGAAAACTACCCCTAAAA 

GCCMMTGGGAMGGAAMGACTCATATCAACATTGTTCGTCATTGGACACGTAGATTC 

GGGCAAGTCCACCACTACTGGCCATCTGATCTATAAATGCGGTGGCATCCGACAAAAGAA 

CCATTGAAAAATTTGAGAAGGAGGCTGCTGANATGGGAAAGGGCTCCTTCAAGNTATGCC 

TGGGTCTTGGATAAACTTGAAAGCNTGAGCCGTGAAACCGTTGGGTATCACCATCTGGAT 

ATTCTTCCTTTGTGGGGAMTTTTGGAGNACCAGGCAAGTTACCTATTGGTGGACTTATT 

CATTTGGATGGCCNCCMGGGACCACCAGGAGGACCTTTTATCAAAAAAAACANTGATTA 

CAGGNGGACATC CTNNA GGCCTGGACTGGTGGCTGGTCCTGNATTGGTTGCTGGCTGGGT 

GGTTTGGATGGMTTTTGAAGGCCTGGGNTATNCTTCCCAAAAGAAAT 

Sequence 1864 

GCGTCCGATGGCGTGNTGTCTCACAGAAAGTTCTCCGCTCCCAGACATGGGTCCCTCGGC 

TTCCTGCCTCGGAAGCGCAGCAGCAGGCATCGTGGGAAGGTGAAGAGCTTCCCTAAGGAT 

GACCCATCCAAGCCGGTNCACCTCACAAGCCTTCCTGGGATACAAGGCTGGCATGACTCA 

CATCGTGCGGGAAGTCGACAGGCCGGGATCCAAGGTGAACAAGAAGGAGGTGGTGGAGGC 

CTGTGACCATTTGTAAGAGACACCACCCATGGTTGGGTTGTGNGGCAATTGGTGGGCCTA 

CCGTTGGGAMACCCCTCGAGGCNCTCCGGTACCTTCMGACTTGTCTTTTGCTGGAGCA 

CAATCAGTTGAATGAMTGGCAAGMGGGCGGTTTNTTATTAAAGTMTTTGGCCATTAA 

AAATTCTAANGAAAGGAAAGGGCCTTTACCAAAGTTACCTGCAAGGAAAATGG 

Sequence 1865 

CCGGCCGGGCTGGGTCCCAGCACNTGACCCCAGCTGCACTGCTGTACAGGGCTGCNCGTC 
ACCCGAAGTCAGAAACGTGGCATCTCATCGGAAGAGGAGGAAGGAGAGGTAGACAGTGAA 
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GTAGAGCTGACATCNAGCCAGAGGTTGGCCTCAGAGCCTGAACATGCGCCAGTCACTATC 

TACCTTCAGCTCAGNAGAATCCATCAGATGGGGAGGAAGGCACAGCTAGTGAACCCTTCC 

CCCAGTGGCCACACCTGGAAGTTGGCTAGCACCAACACNTGATGAGTCCGGGCNCAGNAT 

GAAGCNGNGTTCTGATGAACATGTTGACTCCCAAGGGGCTCAAGAAAATCCCCACNTGGG 

ACNCCACTCTCCTTCAGGAGGGTCATACCCTGGGCCCTGGAACCCAGCTCCCTGGCCCA 

Sequence 1866 

TTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCCGCAAGCCCAACGCCGGGGGCAGCCCCGN 

TCCGGTGCGCCGCCGGCCGGAGGCCTCGCCGGTGCAGAAAAAGGAGAAGAAGGACAAGGA 

GCGGGAAAACGAGAAGGAGAAGAGTGCCCTAGCCCGGGAGCGCAGCCTCAAGAAGCGCCA 

GTCGCTGCCCGCCTCCCCACGTGCCCGCCTCTCTGCCAGCACCGCCTCTGAGCTCAGCCC 

CAAATCCAAGGCCAGGCCATCCTCTCCCTCCACATCCTGGCACAGGCCTGCCTCCCCCTG 

CCCCAGCCCAGGGCCAGGCCACACTCTGCCTCCAAAGCCACCGTCCCCCCGAGGCACCAC 

TGCATCCCCCAAGGGGCGGGTTCGGAGGAAGGAGGAGGCAAAGGAGAGCCCCAGCGCCGC 

AGGGCCCGAGGACAAGAGCCAGAGCAAGCGCAGGGCCAGTAACGAGAAGGAGTCAGCAGG 

CCCAGCCTTACCGGCACCTTTGGCGGNGCCTTGGCCAACCCAAGCCCGGCCCAAAAGGAG 

CMGCCCCCGNGGAGACCCCTTACAGACCTGGTTTCTTTGACTTAACCCCAAGCCCTTGT 

TCCCCGGTGGACCCTAGCAAANCAATGGGCCGGNACCACAAGANCGAGAAGAAGCCCCTT 

GGNTNTTGGCTTGAAAGCGGCGCCAGGNCCCGGGAACAACCGGAAGCCCCANGAGCAAGA 

ACCGANGCTTTANGCAAAAAAGGACAAAGCCAATTGCAAAAGGACAACTTGCACCGGAAG 

GCCCAGGC 

Sequence 1867 

CCCCGCGTCCGTTTMGAGTTGCATATTTTACTTTATTTTTATTAMTTAAA^ 

TCTGGCAGCGATTCCAGAACAGGGTAAGGAGGTTCCTCACAGGGGTCAGAGAAGAGCGGA 

GAAAGACAGACTGACGGAGACTGAGACACAGGAGAGAAAGGACAAGGTTAAGGGAGAACT 

GTATCTGATGAACACACACAGCCGGCTCCATGGCGGGTGACGGGGAGCTCACATCAGCCC 

AATTTCTCCTCCCCGGCACCCGAAGTTCAGCGGTGGAGCAGTATGTGGGGGCGGTTAGGA 

ATCMGAGACCCTCCCTTCCCCACCCTAGGTCCTTTCTCGGCTTGGTCGTGGAGCACAGC 

ACATACCAGAAAAAGCCAAGGGCAATGGAGGGGCAGGGAAACCGGGAGTATATGTACACG 

GGGAGGGGAGAACAGAGCCTTGGAGGTCGGCCTCTGCCAGAAGGGAAGTGGCTCACACTT 

GCATTTGNMCACTTGGCCAGTGGGGGATGGGGGAMGGMTTGCCCCTTCCTTTTGG 
Sequence 1868 

CCCNCGCGTCCGCACACCCTTCCTGTACTCAGTCCTCAGTTTGCCTGGTGAGAGAGCAGC 

CTCCTCCCGTGTGCTCTGCCAGCTGGACCCAGACTGGCCATATTACCAGTGAGACCAAAA 

AGATGGAGGTGGGGAGGTAGCTCTGAGGTCTGGGAMCCATTCCAGCTCCTGCCAGTTTT 

MCTTGTGTTTMTTCCTGGCACAGTTGTCCTGGAMTGCCTTTTTCTCTTGCCTGGGAA 

CCACTAGAAGGGGATGTTGTCTGTGTTGGCCAGGGCCATGCAAATTCAACATCTTGTTTC 

TGCCCTTCCCCCGTGTAGCTGAGGCTAGGTGTTGGCATTACCCAGTGCTTGTTCTTCAGA 

GAGCAAAAGCACTGCTCGTCATGTCTGAAATTTAGTGAGTGAGCTCACCCACTAGGCTGG 

TGTTTCCTGCCCGTGGCTGCACATTGGAAGCACCGGGGCACTTTGAGAACTACAGATGCC 

TGGGTCCCAGAGCATCTAAGGTGCTCTAGGGTGTGTCCAGGACACAAGCCCTGGTTGAGG 

ACCACTGCTATATTGTATGGCCTCTmAAAAMGTTMTTTTACTTG 

MGCTACAGMMGTTGCAAGAATAAAAACTGTACAAATGAGGCTTAAATATTCTTTGGC 

CAAATCACCTATTAACATTTCGTTCCAAAAAAAAAAAAAA 

Sequence 1869 

GGGCAGCGCCTCCGACATGAAGGCTGAGCTGTCGCAACTTATTAGCGACCTGGGCGAGCT 

CAGCTTCGGCAACGACGTGCGCACCCTGCAGGCCGACTTGCGGGTGACGCGCCTGCTGTC 

AGGCGACAGCACGGGCAGCGAGAGCTCCATCGAGGGCGGGGGCCCTGACGCCACCTCCGC 

CACCGCCGGGGACTCGTCCCGCCAGGCCGACGGCGCCAGTGCAGACGAGCCCCACTCGGG 

CTGAGCTCCTCCGCGCGTCGCCGGCGCTCCACCGTGGCTACCCATCCGTGGTCCCGACAA 

CCTCCCTGTCCCTTGCCCGCCCCCAGGAAGGGGGAAATGGGGCATTTGGGGCCCAGACCT 

ACACTTGGAGCCCAGGTCCAAGCGTTCCCCGACCGCTTTCCCTACTNCCGGNCCCCGCTC 
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CCGCCCCMGMCTTTTGGCTTTTTTGCGCGTGGGGATGCGGGGAGATTTGAGANGGGNA 

AACCCCCGCCAGGAAGGAAGAGAAGAGGCACCCCTTTGGGAATGCCGGTGAAGGGAAGGT 

TGGCTGAAGTTCCTMGTTTAAGGCCTAAGGTGCCTTGGNCAGGTTTCCTGTTTGTGGGG 

GAAACTTGGGNCTTGAGGAGGANGGGGTTAAATTTCTTCTTCCAACCCCTGGGAAGCGGG 

CCTTGCCTTGGANNTGAAATTTAANTNAAAAAAAAAAAAA 

Sequence 1870 

TCTTCATCATGGGTGCCAGCATCCACCTGGTGGGTGACTCTGTCAACCACCGCCTGCTCT 

TCAGTGGCTACNAGCACCACCTGTCTGTCCGTGAGAACCCCATCATCAAGAATCTCAAGC 

CGGAGACGCTGATCGACTCCTTTGAGCTGCTCTACTATTATGATGAGTACCTGGGTCACT 

GCATGTGGTACATCCCCTTCTTCCTCATCCTCTTCATGTACTTCAGCGGCTGCTTTACTG 

CCTCTAAAGCTGAGAGCTTGATTCCAGGGCCTGCCCTGCTCCTGGTGGCACCCAGTGGCC 

TGTACTACTGGTACCTGGTCACCGAGGGCCAGATCTrCATCCTCTTCATCTTCACCTTCT 

TCGCCATGCTGGCCCTCGTCCTGCACCAGAAGCGCAAGCGCCTCTTCCTGGACAGCAACG 

GCCTCTTCCTCTTCTCCTCCTTCGCACTGACCCTCTTGCTTGTGGCGCTCTGGGTCGCCT 

GGCTGTGGAATGACCCTGTTCTCAAGAAGAAAGTACCCGGGTGTCATCTACGTCCTGAGC 
CCTG 

Sequence 1871 

CCGC GTCCG GTTTGTTCGTTATAGGTTTAATAAGTCTATTGAGGAAGACCTACTCCTGTG 

TGMTCTTTGCAAAGTMTGCTACCGGTGMGAAATATTCAACTGNATCAACAGTTTTAT 

GCAGAAACATGAAATTGAATGGGAAAAATGTGTTGATGTTTGTAGTGATGCTTCTAGGGC 

AGTGGATGGGAAAATTGCCGAAGCTGTCACCTTAATAAAATATGTGGCTCCCGAAAGCAC 

CAGTAGTCACTGCCTATTATACAGACATGCACTGGCAGTTAAAATAATGCCTACATCTCT 

AAAAAATGTGCTAGAC CAGG CAGTACAAATCATCAATTATATTAAAGCTCGACCACATCA 

ATCCAGACTATTAAAAATTTTATGTGAGGAAATGGGTGCTCAGCACACAGCACTTCTTCT 

AAATACAG AGGT G AGGTG GCTTTCTCGAGGTAMGTTCTTGTMGACTTTTTGAACTTCG 

TCGTGMCTTTTGGTTTTCATGGATTCTGGCnTTTCGACTATCTTGATTGGTTAACAAAA 

TTCATCTTGGCTGCTMGACTTGCATATCTTGCAGATATTTTTACTAAATTAAAATGAAA 

GTTAATTTGTCMTGCCAAGGNAAAAATGTGACCCGTTTTACNAGTATTTGATAAAATGT 

CGTCATTGGTAAGAAAATTGGAAT 

Sequence 1872 

TCGACCCACGCGTCCGCGGACGCGTGGGTGTGACTGGCACCCAATGCCATGCCTTTATGG 

TCAC TTGGTAGTATAMGGCATGGCATTTGTTGTGACTGGCACCCAATGTTTGATTTTTT 

TTTTAAAACTATCCAATTAAMTTAAGGTCTGGGAGTGTTCTGTTTCCCATTCTTTAATA 

CTCACCTCCTCCCAGACTTTCTACACCTGTTGCACCTCAGGCAGAGGATGTTCTGGACCT 

CCCCCTCTTGGTCCCTACTAGAGACCTCTCAACAGATCTGTGGGCCCAGTCATTGGGTTT 

TATCAGTGCTTAATGTGAACTAAGTTTTTTACTTCCACAGAATACAAGCCACTACCTTCT 

GACCTCCCCACCCCC CACC AACCCCCATCTTTTAATATGCTGTGGGGCATAGAACTCCGG 

AATGACCAGCATGATATTTTCAGAGTCTTGT<XCCGGGGTATTAGCACCTCTTTTTGAAC 

AGGGAATTGATTCAAGATTGGACATGGTCTCCTCTGATTATCAGGTACTGGGGCTGAGGG 

CATTAAAAATAGTAAGCCTCCCTCTCGTCCCCTGCCTCAAGAAATTGCCTCTTATTTATC 
AACATCI 1 1 1 ICCTC 
Sequence 1873 

CCNCGCGTCCGGTGTTCCTCCCTGAGAATTAGTGGTCAGACATTGCAGAGGGCATCAGAA 

GCGGGTAGATGAAATAGCGAAAGGAAACAGGCTAGCAGACCAAAGAGCTAAGTCAGCAGT 

AAGAAGGCCCCAAGGTCCCAAAACACTTGAGGCCCCTCTGATTTGGGAGGGCTACATAAG 

GGAAATAAAGCCTCAGTATTCCCCTACAGAGATAGAATGGGCCACCTCTCGAGGTATACT 

TTTCAACCCTCAGGATGGTTACAATCAGAAGATTGCAAAGTACGCTTGCCAGCCTCCAGC 

CAATGGAAGATTCTTAAAATCCTCCACTGAGCCTTTCACTTAGGAAAGCATAAGACATCA 

GTGCATCCAAAGATTGTTCTCAGGAGAAAATCTACTAAAAATGGTCAAATAGGTTGTTAA 

TACTCGTGAAACCCTCTTAAAAATAATCCCTTTAACAGATGACTTCTTCXJCCACCACAAT 

CAAAGGA 
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Sequence 1874 

ACGCGTCCGATGACCAGGCTGCCCCGCCTCCTGGTTCTGCCCAAGTTCTCCCTGGAGACT 

GAAGTCGACCTCAGGAAGCCCCTAGAGAACCTGGGAATGACCGACATGTTCAGACAGTTT 

CAGGCTGACTTCACGAGTCTTTCAGACCAAGAGCCTCTCCACGTCGCGCAGGCGCTGCAG 

AAAGTGAAGATCGAGGTGAACGAGAGTGGCACGGTGGCCTCCTCATCCACAGCTGTCATA 

GTCTCAGCCCGCATGGCCCCCGAGGAGATCATCATGGACAGACCCTTCCTCTTTGTGGTC 

CGGCACAACCCCACAGGTGAGCCTGGAACCCATCACGTTCCACATCCTCCCACCCATTCT 

TTCTCTCAGGAACTAGTCCCGACAGATGCAGACATCCX^TCTATCCCTGAGAGGGCTCTGG 

GCAGGGAACCCATAACCCTACCCTGCTTCCTGTCCCAAGAGGAGGC 

Sequence 1875 

AGTCGACCCACGCGTCCGCCCACGCGTCCGCCTTCTTCTGGGCACTGACTGCCCTTCTGG 

TCGCTTCAGCTGCTGCCTTCCAGGGTCTTCTGCTGCTGTTGCCGCCACCACCATCTGTAC 

CCACAGGGGAGTTAGGATCAGGCCTCCAGGTGGGAGCCCCAGGAGCAGAGGAAGAGGTGG 

AAGAGTCCTCACCACTGCAAGAGCCACCAAGCCAGGCAGCAGGCACCACCCCTGGTCCAG 

ACCCTAAGGCCTATCAGCTTCTATCAGCCCGCAGTGCCTGCCTGCTGGGCCTGTTGGCCG 

CCACCAACGCGCTGACCAATGG(^TGCTGCCTGCCGTGCAGAGCTTTTCCTGCTTACCCT 

ACGGGCGTCTGGCCTACCACCTGGCTGTGGTGCTGGGCAGTGCTGCCAATCCCCTGGCCT 

GCTTCCTGGCCATGGGTGTGCTGTGCAGGTACACAAGGACCCCCAGCCCCTGTGCGGGTG 
GAACTCA 

Sequence 1876 > 

TCGACCNCGCGTCCGGTCTTCGCAGGTGGCCCTCGGGCCCGCAGCCGCTGGGTAAGGGTG 

ATGCCTAGCCTGGCTTATTGCACCTTCCTTTTGGCGGTTGGCTTGGNGCGAATCTTCATC 

TTAGCACATTTCCCTCACCAGGTGCTGGCTGGCCTAATAACTGGCXaCTGTCCTGGGCTGG 

CTGATGACTCCCCGAGTGCCTATGGAGCGGGAGCTAAGCTTCTATGGGTTGACTGCACTG 

GCCCTCATGCTAGGCACCAGCCTCATCTATTGGACCCTCTTTACACTGGGCCTGGATCTT 

TCTTGGTCCATCAGCCTAGCCTTCAAGTGGTGTGAGCGGCCTGAGTGGATACACGTGGAT 

AGCCGGCCCTTTGCCTCCCTGAGCCGTGACTCAGGGGCTGCCCTGGGCCTGGGCATTGCC 

TTGCACTCTCCCTGCTATGCCCAGGTGCCGTCGGGCACAGCTGGGGAA 

Sequence 1877 

ACCCCGCGTCCGCCCTTAAGAGACAATGATTGAGAAAGAGCCATGTGGCTTGGCTCTAGA 

MCGTCATTATCATTAGGACCATCAGATTTTAGATTAAGCTGCTATTGAATTAATAAAAT 

CCCAATGAAGCAGAGTTATAGGGATAGATTTATAGCTGGCAGAGTGGTATCAAAGGAGAA 

MACAGT GAAA AAGCCAATTTCACTGGTTCGTTCAATCCAGCTTGTTGCTAATATTAGTT 

ACCCTTGTTTTAATGACAGAGAGTGGCTGGAATCTGTAGCTAGGGGAGGGGCAACACTGT 

TAGATGTGAGGAAAGGAAGTGCCAAAAATGCCTGGACAGATGGCTTGTCCCAAGGCCAGG 

ACACACACTTTAAMTCCMCATTCACCTAAGCAAGTAATTCTTAAAGATCTTACAGAAA 

CGCAGAGTCAATTCAGGTTTATAAAGGAAGGCTTNAGGGGAGAGAGGAAGGCCTGGGGGG 

CCTGGACGAAAGAGGCCTAGGACCTGAAGAGACTCCAGCGAGTCTTCGGGAAGC 
Sequence 1878 

AGTCGACCACGCGTCCGCAGCATCGTCCGAGACTTCCCAGACTCCCGGCCAGCCATCGAG 

GACCTCAAGTACTGCCTGGAGAGGACGGACCAGAGGCAGCAGCTGCTCGTGTCCCTCAAG 

GCTGCCCTGGAGACTCGGCTCCTGCATCCAGGCGTCAACACGTGTGACATCATCACCCTC 

TATATCTCTGCCATCAAGGCGCTGCGCGTGCTGGACCCTTCCATGGTCATCCTGGAGGTG 

GCCTGTGAGCCTATCCGCCGCTACCTGAGGACGCGGGAGGACACAGTGCGGCAGATTGTG 

GCTGGGCTGACGGGGGACTCGGACGGGACAGGGGACTGGGCTGTTGAAGCTGTCCAAGAC 

CGACCCGGCGAGCCTGGAGACAGGCCAGGACAGTGAGGGATGACTCAGGCGAGCCAGAGG 

ACCTGGGGTCCCGGACCCTGTGGATGCCCGATCAGGGGAAAGTCGAGCTCCAGCC 

Sequence 1879 

GTCCGCACMTTGAAAACTGGGAAMATMTCTCGTGGTTCGGTTTGTTAGTTTGAGACG 

CAGTCTCACTCTGTCGCCCAGGCTAGAGTACGGTGGCCCAATCTCAGCTCACTGCAACCT 

CCAACCTCCCAGGTTCAAGCAATTCTCTTGCCTCAGCCTCCCGGGGTAACTGGGGATTAC 
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AGGCGCGCACCACCACGCCCAGCTAATTTTTGTG I I I I I AGTAG AG ATG AGATTTTGCCA 

TGTTGGCCAGGTTGGTCTTGAACTCCTGACCTCAAGTGATCCACCCACCTCGGCCTCCCA 

MGTGCTGGGATTACAGGTGTGAGCCACCGTATCTGGCTCTCATATTTTTTATATACAGA 

TTGMTATCCCTMTTTGAAAACCTGAMTCTGAMTCTTCCAAAATCCAAMCTTTTTG 

AGCAGTTACATGATATTTGAATGAACTGCTCATAAGAA 

Sequence 1880 

GAGGAAACCAAAGTGCTCTGTATCCTCCAGTCTCCGCGCCTNCACCCAGCTCAGGAACCC 

GCGAACCCTCTCTTGACCACTATGAGCCTCCCGTCCAGCCGCGCGGCCCGTGNNCCGGGT 

CCTTCGGGCTCCTTGTTGCGCGCTGNTCGCGCTGCTGCTCCTGCTTGACGCCGCCGGGGG 

CCCCTCGCCAGCTGCTGGGTCCTGTCTCTGCTGTGCTGACAGAGCTGCGNTGCACNTTGT 

TMCGCGTTTACGCTGAGAGTAAMCCCCMAACCGATTGGTAAACTGCAGGTTGTTTCC 

CCGCAGGCCTCGCAGTGCTCCAAGGTGGGAAAGATGGTANGCTCTCCCTTGAAAGAACCG 

GGTMGCANAGTTTATGTCTGNNACCCCGGGAANGCC(XCTTTTTTCTTAAAGGAAAAGG 

TGCATCCCAANNAAAAATTTTGGGACCAGNTGGGGAAAACCAAGGAAAAAAA 

Sequence 1881 

GCGTCCGCCCTGGCTCCTCCAGCMGACCTCGTCTTTGCTTGTCCTGCTCAGATGCTGGT 

CATCCTGGGCATGTCCCCAGTGTGGACTCTGGACTGGGAAGGGGGCAGGCCCCTTTGGAC 

CTGCAGTTGGCCTCAGCAGAAGGCCTTGCCTTGTGTATGTGACTCCATATCCCGGGAGCA 

GTTGACCTTTGCCAAACACTTTACAGTTCTGGAGGAGGAGGTAACATAGATGCCTGGGCC 

TGATGGTGGGGCCATACCCATGTGTCGCCTCTCACTCTGGCAGCCTCAGAGGCCCCTTGC 

TGCTGGCTCCCATCTCCCTCCCATTTGCAGACCAGGAAGGAAGAGCAAGCTGTACAAAGG 

GAAGCAGAGCCTGGGGTGGGGTGTGAGCAGGGTGACCCCTCATCTGAAAGGCCCAAACCA 

GGGGGGAAGCACCAGCCTTAGTGCAGCCCCCTCTGACCCCACCTTAGAATGGAAAGCCTT 

CACCTGCAGCCCAGGCCTTCCTCCCCGG 

Sequence 1882 

AGTCGCCCCGCGTCCGGTGATTCCAGGGTGCAGMGGGATTCATATTCCCAGMCGCTTT 

MGTGTACACCTGCAGGATAAAGAGATACCGGTTACATTATTAAATGATTCTAGGGATTC 

ACTGGGGGAT ATTTTTGTTGCTTTTACTTTCATGGTTAGAGCTACAMGAACMGTGATT 

I 1 1 II II I I I ICTCCCTTCCCCATTCAGAMCATTATACATTGGGCCATTTTTCTTTCTC 

CCAAAGAAGATTCATGGATAGTCAGACTGAACTGTGTGCAACAGGAAAAGTCAAAAGGGA 

AAAGGCAGCTGATGAGGTTCATGGTTACATGTTCTACATCATGCAGAGTAGCTTGAAATC 

TAGTCTGGAGAAAACTGGATCAAGATTCTAGCCCACTGGAGTTGCAAGGAATGAGAGGCA 

AAAATTCTAAAGATTTGGGTTATATTTTCAACTTGGGGGACAGAGAGAAATGGAGAGCAG 

GAATTACAGTTCCAACAAACATCATGATAGTC 

Sequence 1883 

CCACGCGTCC GACTA GTTCTAGATCGCGAGCGGCGCCC I Mi ll I NTTTTTNNCACGCTT 

MTTCACTTTATTTTTCTTGNATAAAAACCCTATGTTGTAGNCACAGNTGGGGCCTGAGT 

CCGNTGCACGGAGACTCTGGTGTGGGTCTTGACGAGGTGGTCAAGAGNAACTCCTNGATA 

GGGAGACTTGGGTGAATACANTNTCCTTCCANAGGTCCGGGGNGTTCATGGTATGCTGTA 

NGGTCCTTAAAAAAATGGG 

Sequence 1884 

GTCCGAAAAAATGATAATGTGCATAAAATTCAACCCAGCTTTCAAAGTCCAGTCAAAATA 

TCAGAMTCATGAGGTCCMTGGATTTTGTTTAGCAAATACCGAAACAATAGTTATTGAC 

CACAGTATACCAAATGGAAGAGACCAGCACCTGGGCGTGGACCCAACAGAGCATTTATTT 

GAGAATGGCAGTGAGTTTCCCTCAGAGCTGGAGGACGGGGACGAGCCAGCAGCCTACGTC 

ACCAACCTGTCATATTACCACCTGGTCCCCTTCGAGACAGACATTTGGGACTGAACCTCT 

CTATCAGGCCTCCCCCGCCTCAAGCTGTTCACTGCC 

Sequence 1885 

CGCGTCCGCACAACAAGACACTCCAATTGTGATTTGAGTTGAGGATCTCTGCCTGCCTTC 

CTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGC 

AGAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGA 
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AGATGGGGTATTTCCCACTTCCAGCAACGAMTMCAAATGAAMGTTGCATACTTATTG 

ATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGT 

ATGTGTGCCTGTGTGTGTATAAAMTMCCATTGAAGCTAACTTGCTAATGTACTTAGGC 

MGCCACTTCCCATCTCTGGGCCTCGTCTTTCCTCCCTCTAAAATCAAAGAGCTGAATTA 

TGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTA 

TATTGGCCTCTCCTTATGTGGTGGGTTTTTTTGGGAGGGNCATAGTTNCTTATACACAGG 

ACACCTGCATNATCN AAGGGC I I I 1 1 I 1 1 CTAAAAAAAAAAAAAATG 

Sequence 1886 

CGTCCGCTCCTGAGTAGCTGGGATGACAGGCGTGCACCTGGCAGCI I I 1 1 CAAAGTGTTG 

ATGGTAATCTGAGGCMTCTMGGGAGTCATTTTTTAAGTGACTTTATACAGAAAGATTG 

GTAAGAGCCAAGGGGTAGAAGTGGCATAAATGTCTAAAGCAGGGAAGTGACAGGGACTTT 

CATTGTTCTTGGCTGAGGAGAAGCGGGAGTGGCTGATGGAAGCACCTAAATGATGCCTTT 

GTCTGTGGGAAGGCAAATGATGCCCCAGAGCTCTAACCAAAGGTTTTGCAGCCGCCGAAA 

AACAGGAAAGTTGGGAAGCGGGGGTAGGACTACACTGAATCATTAACAGTGCTGTAAACT 

ACCCATGTGGCCATTAACMTGGACCTTTGGGGGAGTTTTCCTAAACGATCACTCTGGA 

Sequence 1887 

CGTCAAACACCCGCAGGCTGTCGATTTCATCATCAAAGAAATCAAGACGATAAGGGCAGC 

TCACTCCCCATCGGGAAGAGATCCAGCAACGCGCCGCGCGTGGCGTATTCGCCGTGCTCC 

ATCACCTGGTCAAC ATGGCG ATAACCGGCGCTGTCCAGTTGGGTTCGTAATGCATCTCGT 

GACAGGCGCTGACCTTTTTTCATCACCAGCGCATGACCGTGGAGAAAACTGTGTGGGCAA 

ACGCGCTGCATAAGCGTATTCACCGGAACAATCAGTACGCCACGCTGCATCGTCGGTAGC 

TGGTAAAGGGTGGAAAGGCGCGAGGAGATAATGTCCTGATGAGGCGAAAAACTGTCGTAG 

GGAAGAGTTTCCCAGTCCGCCAGATTCATCACCATTTGATCGGTAAACTGGCTGATTTCA 

TCATGCAAACGCAGAGCATTTTGCATATCTGGTGCAATGAGTACCACCGGACCGGCGTGA 

CGTTCGGCA 

Sequence 1888 

C GCGTC CGTTTATTTTTATGCCCCTTTTTGTGGATMGATTCTT^ 

MTTTTAAGTGACTTTCTCCAGGTCATGMGATTCAATGGGTAGAATTGAATCAGAATTG 

AMTGTTCCAGATTCATATTCTTGTGTGTGTTTGATAAAATTCATGGCTTCCAAAGTAAC 

TGM CACTTCCTTTGGGCCCTTGGAGGGAAMTCCATATTTTTACTAATTACACTTTT^ 

TTTTAGACATCTGGCAGTTCTTTGMCTTAAACATATTCTCATGGCCATAGTTCCAMTT 

MGCCCG ACGCA GTT GCTA AAAATCTTGCTGCACTGTTGMTACTMTMTGCMCATTT 

ATTGGATGTTTTTGCATTTTGATGACCTC 

TAAGTGACCCCCTCACTAGTGAAMTMTAAATGTTCTATATCATTTATTATTATTTGTG 
TATTCTCTACATG ATAT A I I I I I I I 
Sequence 1889 

CCGCGTCCGGGAGGATGGACGTACTGGTGTCTGAGTGCTCCGCGCGGCTGCTGCAGCAGG 

MGMGAGATTAMTCTCTGACTGCTGAMTTGAC(X)GGTTGAAAAACTGTGGCTGTTTA 

GGAGCTTCTCCAMTTTGGAGCAGTTACAAGAAGAAAATTTAAAATAAAGTATCGACTGA 

ATATTCTTCGAMGAGTCTTCAGGCAGAAAGGAACAMCCAACTAAAAATATGATTAACA 

TTATTAGCCGCCTACAAGAGGTCTTTGGTCATGCMTTAAGGCTGCATATCCAGATTTGG 

AAAATCCTCCTCTGCTAGTGACACCAAGTCAGCAGGCCAAGTTTGGGGACTATCAGTGTA 

ATAGTGCTATGGGTATTTCTCAGATGCTCAAAACCAAGGAACAGAAAGTTAATCCAAGAG 

AAATTGCTGAAAACATTACCAAACACCTCCCAGACAATGAATGTATTGAAAAAGTTGAAA 

TTGCTGGTCCTGGTTTTATTAATGTCCACTTMGAAAGGA 

Sequence 1890 

CGCCCCGC GTCC GCTMTTATAAGCTTTACMGTATTTATTTTATAAGGCTTAGACAGAA 

TTATTGGAGTTTTAMTTAAGTGTATTGGAAAAGAAAGGATGGTATGTGTATGAAATGTT 

MGATCCTACGCMCACTGCTATTTTTTTCCTTTMTATTTGTGCTGCATAACAAAAGCC 

ACTAGACTGTTACTGTCTTGTCTGTCCATGTGTTAACAGCATTTCTTAATGATGTATATA 

TGGAGTGGTCTTCMTCATAGTGAAGMTTTAMGAGAAAGTCAATTGTATTGGCATTTT 
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TAATAAGAACAAAATTAGTTCGTCTAAGGGGACTGGCTGGCCACATATTTGTTCCTTGCC 
CATATGC nTTC TACTTCTTGTTCnTATTATGAAATTATGAATTTGAAGCCTCTGAAATGG 
TGATCAGTTTTCAACATCTTTCAAAAACAAAATTACTA 
Sequence 1891 

GCGTCCGCGGCTGCTGTGTGTGAGCAGTGGACACGTGAGGGGGGGGTGGGTGAGAGAGAC 

AGGCAGCTCGGATTCAACTACCTTAGATAATATTTCTGAAAACCTACCAGCCAGAGGGTA 

GGGCACAAAGATGGATGTAATGCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTTGAGCC 

C AGGAG TTCAAGACCAGCCTGGGCAACATACCAAGACCCCCGTCTCTTTAAAAATATATA 

TATTTTAMTATACTTAMTATATATTTCTAATATCTTTAAATATATATATATATTTTAA 

AGACCAATTTATGGGAGAATTGCACACAGATGTGAAATGAATGTAATCTAATAGAAGCCT 

MTCA GCCCCCATGTTCTCCACTGAAAMTCCTCTTTTTTGGGGGGTTTTCTTTCTTTCT 

1 1 1 1 iGAATTTGCACTGGACGGNGGACCGTCAGCCATGTNCAAGGATCCCCAGGGGGGGG 

GNNGTCAAMTGGCTATTGGAAAATTGGGGTGGAMTGrMTGCCTTTTTCACTTTTTGGA 

TAAATAAMCATGTAAAAAMTGNTTTCAAAAAAAATTAATTAAAATTAAATTAAATTCC 

NNAAAAAAAAAAAAAAA 

Sequence 1892 

AGAGGATTCCCAGGGTTTCCAGGGGGCCAAAGGAGACAAAGGTTCAAAGGGTGAGGTGGG 

TTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCAAGGATTCAT 

GGGTCCTCCGGGGCCCCAGGGACAGCCCGGGGTTACCGGGATCCCCAGGCCATGCCACGG 

AGGGGCCCAAAGGAGACCGCGGACCTCANGGCCAGCCTGGCCTGCCAGGACTTCCGGGAC 

CCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAAGGTGACAAAGGAAATCCA 

Sequence 1893 

TCCGC(XCGCGTCCGCTTTTTCCNAACAAGGAGCATCCAAAGACACAGTGACTTGAGCTA 

TAGATAGTAAAAATCATACGAGAGTTGAACTGAGTCAGGTTTAGGAAGCAAGTTTGGTTG 

CATCAATTMGCAGGCTCTTTTCAATTGACTGATGCTGGGGCCTTCAGTTTTATTCTCAG 

TATAGATTGCCAGTATTGTTAAGAGTATCCAAAGGCCTTTCTAGATGGAGACAGAATAAC 

TGACTTGAACATACAGTGTGCCTGTAAGTGTCCAGGCTCAGAGCTGGTGAAAACCCTTCT 

GTTGGGCGTGTGCAGGGTTAAACTCCTGAAGTAACTTGTGAGGACTTCAGTGCTTGCTGG 

TGTCCTGGGCAGCACCATGAATGCCTTTACCAAGACATGCCAAGTTGGATCCCCCGAATG 

AAGCAAGAGTGGCTTGTGGGTGTGACCCTTGCTCCCTGCTACACAGAAGCATCGCAAGGG 

CTGCCTGTGTNGGTTTCCAGATGAAGGGTCTTGGGTCCCGGAAGCTTTGTGGTTGAGAGC 

TCAAGTGGGACC 

Sequence 1894 

GTCACCACGCGTCCGCGGACGCGTGGGCGCACGCCGGCGGCGCGGAGGCCGGCTCTGCGC 

TTCGGGCCGCCCCCTCCCCCCACCCCGCTCACACCCGGCACTTACTTCGGCTGTCTCCGC 

TGCCCTCCAGCGGAGACGCAGCTCCTCAGGCGCCCGGCGGTATTTGTTGGGTCGGCGGCG 

TCAGGGATTCGCAGTGGCCTGTGGTCGGCGTCGTCCGGCCACTGGTGCGCCCCCGCGGCA 

GGCAGAGCTCACGCTCCTGTCCCCCGGCTGGTCCGGGGTCTGGGCGCCGCGTCGACGGCG 

GCTCCGCAGGACGCGCAGACCGGGCCGCAGCCCATGCC 

Sequence 1895 

NCCCCGCGTCCGATTTAMTGCCGTAMTTTTAAATTCATACCTTTCCATGATTCAAAAT 

TCAAAAGATCCCATGGGAGATGGTTGGAAAATCTCCACTTCATCCTCCAAGCCATTCAAG 

TTTCCTTTCCAGAAGCAACTGCTACTGCCTTTCATTCATATGTTCTTCTAAAGATAGTCT 

ACATTTGGAMTGTATGTTAAMGCACGTATTTTTAMATTTTTTTCCTAAATAGTAACA 

CATTGTATGTCTGCTGTGTACTTTGCTATTTTTATTTATTTTAGTGTTTCTTATATAGCA 

GATGGMTGMTTTGAAGTTCCCAGGGCTGAGGATCCATGCCTTCTTTGTTTCTAAGTTA 

TCTTTCCCATAGCTTTTCATTATCTTTC 

Sequence 1896 

CGACCCACGCGTCCGCCCNGCGTCCGAGGGGCAACAGCAGAGCCTACAGCAGGGGGCACA 
CTCCACCGGNTCCAGCCGCCTGCACGACCTCTACTGGCAGGCCATGAAAACCCTGGGAGT 
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CCAGCGCCCCAAGTTGGAGAAGAAGGATGCCAAGGAGATCCCCAGTGCCACCCAGAGCCC 

CATCAGTAAGAAGCGGAAGAAAAAGGGATTCTTGCCAGAGACGAAGAAGCGCAAGAAACG 

CAAGTCAGAGGATGGCACGCCAGCGGAGGATGGCACACCTGCAGCCACCGGCGGGAGCCA 

GCCCCCCAGCATGGGCAGGAAGAAGAGGAACAGGACAAAGGCTAAGGTCCCAGCCCAGGC 

AAACGGGACGCCAACCACCAAGAGTCCAGCCCCTGGCGCCCCCAC 

Sequence 1897 

ATTATA TACTT CTGMTGGCACCTTACTTTTTGGAMCAMTCTTCTGTTATTTACAAAA 
TMTMTTTTTAAAAAACATAAAAAAAAAATCCAAAGCTGCTCTCGATAATAGTCAACAT 
TTGCATATATATGGAATTTCTTACTTTTTTTCTCCCAM 

TMTGTTTGTGTA7TTCATGTATAATTGTGATCTCM7TATAAAAGTTTAATTCAGCATG 

TCTTTGAGCCAATATAATTACTGCACACCCACTAAATTGGGATCAGCCATTATAAATAAT 

GTAGTmAGAATMTAAMCATGACACATATATATATATATAMTATATAGTATATATT 

GG CACATC GGTGAAAGTT TAATA TGTGCAGGMGGTTTTTTTCTTTCTTCMGTTAAAAA 

TTAu i 1 1 iGCCATATGTAATTTTGGTGTTGCAGGCTGGTCGAGAGGATAAAAAATGGAT 

TTTAAATCTGGGTACCGGATGGATCTTTCNGGNGGTTAAGAAACACAGGGGNTGNGGACC 

TTCCTTTTT 

Sequence 1898 

CCGCGTCCGMTATAGTATTTT7TMTTTTTGTGGGGATGGATTCTCAAATACTTGTGAT 

TTTAAMGATTCTAAAGCTAAAACACAACTTGATTTTAAAAAGAATGATTCTCCTTACAC 

MTTATAMTATTTGCAGTAAATATTTTCCTTATAATACTGTTTTGACCCCATTTAAAA^ 

GTATTAGATTATATTCCTTTGATCCMTGAAAACTGAACCTTATAAATGGTTAGCTGAAA 

GTAGACCTTATTCTTGTCCTTCTTTAGAAGAGTAAAGATTTGTCCTAGGGAAGATGGCTG 

ACTTCGGTTCCCAACATGCCGTATGCATTTAGACTGTAGCTCCTCAGCCCTGTGGACACA 

AMTTTGGACAGCTTATTAGGNTACCGTTAGCMTGCTGGACCGGTTTCTTCAACACTAA 

AGANTTTCACCGTTGNAACAGATTTCCTGGTTCGTCTNATGGGGNCTGGTAAAAATGGT 

Sequence 1899 

GCCCTGGAGGCTACTTGTAAATCCTTAGAAGAAAAGCTGGATCTGGTCACGAACAAGCAG 

CACAGCCCCATCCAGGTTCCCATGGTGGCCGGCTCCCCTCTCGGGGCAACCCAGACGTGC 

AACAAAGTGCGATGCGCTGTGCCTGGGCGTCGGCAGAACACCATTGTGGTGAAGGTGCCG 

GGCCAAGAAGACAGNCACCACGAGGACGGGGAGAGCGGCTCGGAGGCCAGCGACTCTGTG 

TCCAGCTGTGGGCAGGCGGGCAGTCAGAGCATNGGGAGCAACGTCACGCTCATCACCCTG 

AACTCGGAAGAGGACTACCCCAATGGCACCTGGCTGGGCGACGAGAACAACCCC 

Sequence 1900 

NCCACGCGTCCGCCCCGCGTCCGGGCCCGGCGCGCCTGCTCTGGGCTCTCCGCGTGCCGC 

ATCGCTTTCTTTTCTTCCTCTGGAGCAGCTATGGCGGCGGCGAAGACCCTGAACCCCAAG 

GCCAAGGTGGCCGGAGCGCAGGCGGCGCTGGCGTTCAACATTAGCGGGGCGCGGGGTCTG 

CAGGACGTGCTGAGGACCAACCTGGGGCCCAAAGGGACCATAAAGATGGCCTTCATCCCA 

GMTMTCACTGMGGATTTGAAGCTGTGAAGGAAMGCCCTTCATTTTTTGGAAGAAGT 

CAAAGTAAGCAGAGAGATGGACAAGGAAAC 

Sequence 1901 

CCACGCGTCCGCCCCGCGTCCGAAACATGAGGTTCTCTCTACTGGTCCTCTTAACTGTGG 

TGTTGAGGCTTATATTTGTGTMTTTTTGGTGGGTGAMGGMTTTTGCTAAGTAAATCT 

CTTCTGTGTTTGMCTGMGTCTGTATTGTAACTATGTTTAMGTMTTGTTCCAGAGAC 

AMTATTTCTAGACACTTTTTCTTTACAAACAAAAGCATTCGGAGGGAGGGGGATGGTGA 

CTGAGATGAGAGGGGAGAGCTGAACAGA 

Sequence 1902 

GTCGCCCCGCGTCCGCCCTCCCTGGCAAATMTATAATMCCCGTGMTTTTCAGGAATT 

TAAAMTTANGCTTTTTTCCACTTAAAGGAGAAAAATATTTGGGACTAGCAGCAGAGGCA 

GTAAGAGATGTGAACCTTGGTGAGCTCTGATACAGTGAGAAGAGATTATACTCATGAAAG 

AGAATGTTAGTGTTACAGAGAAGCAGCCGATAGCAAATCGACTGTAGAGACTTGGCGGCG 

GTGGCATTGCCCCAGGTCGTCAGCAGTGTGGTATTATCTATGAGAACTTGAGCGACAGAG 
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TAT TTCnTG ATGMTTTATAGATCATrrGAGATGTTGAGTTACTTTAGTTTAGTTTTGTT 

TTGTTTTTTCAAATAAGTAGAGACTATTTGTAAAAAACGAGGAAAGGGAAATGAAATGGG 

GCGTGTTTGATAG CMTAM TTTGGTTTCTTTTTAAAGAATTCTAAAAAGGGTCTGAGAC 

CCTG^AGCATTMTTTTTTGAGTGCCCTTCCTTTTTNCCCT^^ 

TTCTCT 

Sequence 1903 

GCGTCCGCCCCGCGTCCGGGAAACCCCCTTCGATGACCTCCAGAGCCTCCCAAACX3ACGT 

GATCTCTTCCCTGAAGAACAGGCTGAAAAAGGTCTCCACAACCACTGGGGATGGTGTGGC 

CAGAGCGTTCCTCAAGGCCCAGGCTGCTTTCTTCGGTAGCTACCGAAACGCTCTGAAAAT 

CGAGCCGGAGGAGCCGATCACTTTCTGTGAGGAAGCCTTCGTGTCCCACTACCGCTCCGG 

AGCCATGAGGCAGTTCCTGCAGAACGCCACACAGCTGCAGCTCTTCAAGCAGTTTATTGA 

TGGTCGATTAGATCTTCTCMTTCCGGCGAAGGTTTCAGTGATGTTTTTGAAGAGGAAAT 

CAACATGGGCGAGTACGCTGGCAGTGACAAACTGTACCATCAGTGGCTCTCCACTGTCCG 

GAAAGGGAAGTGGAGCAATTCTGAATACTGTAAAGACCAAAGCAA 

Sequence 1904 

CGTACGGGGTGCGGTTGGCGGCGGCGGCTGGGCCGGGGGCTGCCGGCTGCGCTCGGGCCG 

TGCGCGGCGGCCGTGCGGNCACGCCATGGACTTCAACATGAAGAAGCTGGCGTCGGACGC 

GGGCATCTTCTTCACCCNGGCGGNGCAGTTNACGGAGGAGAAATTTGGCCAGNCTGAGAA 

GACTGAGCTTGATGCCCACTTTGAAAACCTTCTGGCXDCGGGCAGACAGNACCAAGAACTG 

GACAGAGAAGATCTTGAGGCAGACAGAGGTTCTGCTGNAGCCCAACCCCAGTGCCACGAG 

TGGAGGAGTTCCTGTATGAGAAGCTGGACAGGAAGGNCCCCTCAAGGGTCACCAACGG 

Sequence 1905 

CNCGCGTCCGGTGCATCTTGCCCATTGATTTCTAMTGTATTAACTACTTAAATTAATCC 

TGAATCTTTTCCCAGGCTTAAGTGGGATMTGTTTTATTGTAGATGCATATTTCCTGGCT 

CTACCX3AGTCTTTCTTTGMGACTTTATCATCCTATTTTCTGAATCCAGTGGCTGACTTT 

AATCTTCTCTGGAGGAACTAGATAATTTCTAGACTAATGCTTACACTCATGATCCAGATT 

GTAATTTCTGAACTCCTTCTTCCAAATAGAATCAAAACAAGAAAGGGGAAAGCCTCTCAA 

AGCMCTGTGCGTTAATAATGAMCACTCTTTTTTTCTAATCCAAGGAGGGTTTCATACT 

TTTTCTTAGTTTCTTG CCCTCTTCCCTTCTG ATCAATAATTGTAATAG GG AAATTTGCAA 

TTGTGCCAATACTCAGATTCAATACTGAACTACTTTCTTGCATTGGAATTCAAATTCCAA 

GGTTMCMCTAGCTGTATGTTTCCAAAACAATCTTATTGGATATGGATTTTCTTAGGGG 

GAAGGTTCCAGAAATGATTT 

Sequence 1906 

GANCAGGCTCAAGAGCAACATGGAGGTCTGCACTTAATCGCTCCTCTCCGGGGGCGGCCA 

TACCGAGGAGGCGTCTCTTCCGTGCAGGCAGGCTCTCCTGGGGACCTCAGAGATTCTCTC 

CAGCGGCAGCGGAAAACGGACAATGGGTGGATTCGGGTCCAGATTCTGGTAGGAGGGAGT 

TTGGGATCGAGATCTGGAAAAAAGCACTAGACTGGAAGAGGACGCGATGGAGTCGGAGCC 

GCTGGCGGGGACAAAAACCAGAGGCCGGGGAAGGCGCCGGTGGGAGGCAAGGCACGGATG 

GACTTTACCTGCGCACGCGTCGCAGCCATCTCCGCGCACAGTGGTGGCCACCGCGACTGG 

TGCTGAAGTGTTGGCCGCGTGCCGGGCGCTCCGCTGGGACCCGGGTTGCTGGCCCTGAGT 

CTCAGCTTTCTCATCTGTACGGTTGGGACAAGTACAGTAACCCTCTCCCGTCAAGACGGG 

CC 

Sequence 1907 

GTCGACCNCGCGTCCGAACATCGTCAACTACGGCATCCCAGCCCACCGTGACATCGACGA 

GTGCATGTTGTTCGGGTCGGAGATTTGCAAGGAGGGCAAGTGCGTGAACACGCAGCCTGG 

CTACGAGTGCTACTGCAAGCAGGGCTTCTACTACGACGGGAACCTGCTGGAATGCGTGGA 

CGTGGACGAGTGCCTGGACGAGTCCAACTGCCGGAACGGAGTGTGTGAGAACACGCGCGG 

CGGCTACCGCTGTGCCTGCACGCCCCCTGCCGAGTACAGTCCCGCGCAGCGCCAGTGCCT 

GAGCCCGGAAGAGATGGGACGTGGACGAGTGCCAGGACCCGGCAGCCTGCCGCCCTGGCC 

GCTGCGTCAACCTGCCGGGCTCCTACCGTGNGAAGTGTCGCCCGCCTTGGGTGCCCGGGC 

CCTCCGGCCGNGATTGCCAGCTCCCGAGAGCCCGGCCGAGCGTGCCCCGGAGCGGGCCGA 
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ACGTGTGCTGGAACCACGCGGAAGAGGACGGNATGTGCCCTTGCCCCCTGGCCCGGGCC 
Sequence 1908 

ACCACGCGTCCGGGCGGCCCGGCCAGGCCCCCGGCACTTCCTCGTCCTCGGCCCGGGTGC 

CCTGCCCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCCGGGCCGC 

TGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCAGGNGCGCTGTGTCCCCAAGCTC 

CCCGTCCTCGGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAA 

GGCAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTNTCCCTCCTTGGGCTC 

CTTCGACCCCCTGGCTGANCTCACAGGAAAGGACTCAAATGTCACAGCATCTCCCACAGN 

CCCTGCATGCC 

Sequence 1909 

ACGCGTCCGGMGGGTGTTACATGTGTCTCTTCAATACCTTTGGTTTTGGGAAGATCTCA 

GGAACGGCCTGCCTCACCGTCTATGTACAGCCCATAGTATCCCTTCACTACAAATTCTCT 

GAAGACCACCTAAATATCACTTGCTCTGCCACTGCCCGCCCAGCCCCCATGGTCTTCTGG 

AAGGTCCCTCGGTCAGGGATTGAAAATAGTACAGTGACTCTGTCTCACCCAAATGGGACC 

ACGTCTGTTACCAGCATCXJTCCATATCAAAGACCCTAAGAATCAGGTGGGGAAGGAGGTG 

ATCTG CCAGG TGCTGCACCTGGGGACTGTGACCGACTTTAAGCAAACCGTCAACAAAGGC 

TATTGGTTTTCAGTTCCGCTATTGCTAAGCATTGTTTCCCTGGTAATTC5TTCTCGTCCTA 

ATCTCAATCTTACTGTACTGGGAAACGTCACCGGAATCAGGACCGAGAGCCCTAAATAAG 

TCACACAGCACCCTTGAAAGGGGATTCCTGGNCTACTTGGATTTGGCACAAGAGAAAAAG 

CAGGAGGGAAAGG 

Sequence 1910 

GCGTCCGCTAGTTCTAGATCGCGAGCGGCTGCCC I UN IC TTTT CI 1 1 I 1 1 1 II 1 1 1 1 1 

TGAGACACAGTCTCACTCTGTCACCAGGCTAGAGTGCAGTGGCACGATCTCAGCTCACTG 

CAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGAC 

TACAGGCACGTGCCACCACGCCCAGCTAATTTTTGTATTTTTAGCAGAGATGGGGTTTCA 

CCACATTGGCCAGGATGGTCTCGATCTCAACCTCGTGATCX3ACCCACCTCGGTCTCCCAA 

AGCGCTGGGATTACAGGCGTGAGCCACCGCGCCAAGCCAAGGTCTGCATTTTTCTTTAGA 

ACTCAGAACACCCAATAGTCCTAGGCCCCCATCCTCGCATGGCAGCAAGCTAAATAAGCA 

TNTTCCCACTGCGAGTTGGGG 

Sequence 1911 

GTCCGCTGAGAATGGATAATCTCACTGCAGGTATTCCATAATAGGCTTGGATTAAAACCA 

GGGACAATGGGCAGCAATAGGCCATAGGTTAAGCAGCAGCAACTAGTCACCACTGGACTG 

TCTTCTTCTCCCCTTCCCCATATCCCACATTCTCCTAAACATGATGTACGTGTAGCAACA 

GTCTTTTAAAGTCAGATGGTCAGACTMTTATTTTTACAATTTAAGTGTAAGTGATGTAC 

ATGAATGGACTCTGTGAATCGGAAAACTTACGTAACAGCAGAGAATACGTATGTTATATG 

G AATAACCTGAGTTGAAGGTACAA I I 1 1 1 1 1 1 CCAGCTCTTTTATTCCTTTAACTGCTTA 

ACAAAAGAAAGAGTCTCCAAAGTTTAAAAAACCTTTGAAAAATATACAGCTTGATATTAT 

TTACATAAAATATGAATCCAGGTTCCAATATCAAACAAACATTGCTATGTCAGAAACACA 

GTGGAAGGCAGGAACGTAACTCACTGCCTTTTAGAT 

Sequence 1912 

CCCGCGTCCGCTCTTTTCTTCTCTNTAAAGTGAATTATTC C I I 1 1 1 1 1 I GTTTTATGTAA 

CGTGTATATATTCTTAGTTTTTCTTGAMTCATTGTAATGTTMCTTTGTTGTTTCAAAT 

ATCTTGGTGATTGCTTCATTATCTCTTCAACAMAAAMCCTTTAATTTTGCCATTGAAA 

CTGTAGAACTATGCCATGCTTTTATTAGAAGCAGTGCTCTGTGTTAACAACAAGAATGGT 

GTMTTAGAATTGGGATGTGGATATTTACTGTATGACAACACATTTACAGTTCTGTAATG 

CMGGATGCAGTTTAAAMTGTGAAGTAGTGATGGTTTTTGAAATAAGCTTTAAAATATA 

GGGGATCTTGAAGGCTCCCTGGGGTAACTATTTTATAACTTAGATAAAATGGCTAGTCAT 

ATCTGTGTGGTTTGNAAAGGTTA7TTTTTTAATATTTTAAGATTACAATTTTACAAATGT 

AG AAATG AG CCCAACTATTTAAATTTTAAAAC AGTAAAAACAAA 

Sequence 1913 

CGACCCACGCGTCCGCTTTGAAGCAGGAAAAGACACGTCTCTAGAGCAACATGGAAAGGA 
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TCCAGTATTTGTTTTCAGGCAAAAGCAAGCCTCCAAAAGCCCCTTGGTTCTTCTCTGTGT 

GTCTCGCCCCTTCTGCCTTGTACACCAGACTGCCGGCAAGCAAGCGGGGAGGTGGCGAAC 

AGCTGGGATCCTCCAGTGAAMGTTTGCTCAGCGCGAGAGATGTCAAGAGGCTGGGATTG 

TCGCCTATCGGGAGGTGGGGAGCTCTCTCACTCACACTTCCTGGGAAATGAAGGAGAACT 

MTAGGCAGCCCGCATTTGCAGCCAGCCMCCTGGGTGGTATTCCTTGAAGTGAAGCGCT 

CATAMTCCCTTCTGGGGCCAGAGCCGGTTTCACMGCTGCAGGATTTGGGTGGCCTGCC 

TGCTCTTTGGGAGCGGGCTTGGGAAGCTGAAGAAATTGAATCAGGACCTCAGGCCCTTTA 

CAGACTCCC 

Sequence 1914 

CGCGTCCGCCCCGCGTCCGCTCAGAMCCCCTGCCCTTTCCCTTCAGAAAACGATGGCAG 

GCATTCCTCTGAGTTTAGAAGCAGAGACTCACTCCAACCCAAACTAGCTGGGAGTTCAGA 

ACCATGGTGGMTAMGAMTGTGCATCTGGTCTCTTCTGTTGTTTTTATTTCATATCAG 

ATTAMTTTCTTTACCATGTTGGCTAAGTCTAAATATTAGAGATGAGGCTGTGCCTACTC 

(X)TGG(X;AGCTCTGCTGATAGCCTATGATGGGTTCCAATGGGAMTGACTCTTTACTATT 

AAAAGACAAGGAAAGCTCTGACTTCGTACTTCTCTGATGAATGGCAATGTAAATGAACAA 

GGCTCCATGTGACTGGAGCATGGAAGTGAATGCTACTTTCTTAATTTAATCTGCCCTGTC 

CTACCTGCTCCTCTGATTGTTAGCCATCACATAACTTATTGAATGCTTGCCATGTGCCAG 

GCACTGTGCTGAGTGCCATACATACATTTCATTTAATTATCCAATAATCCTACTTACTA 

Sequence 1915 

CCGCGTCCGATTCTINTTAACATCTCTGTGAGGMG 

AGGAAGCTGTTTGGAGATMTTTAAGTGACTTGCCTGGGGAAtCTAGCCAGTAGTAGAGT 

ACTGATTAATCAGGTGCTGACATCTGCTCTGCTTTGTGTATGTAATTCAGCAGTGCTTCA 

AAGATCCAAGAAGCTGTAGCAGATCTCAATACACTCTCCTATAAAATTAGTGAATAATCA 

CCATGACAAAATTGGTATGGCGGMCAGTCATTATACATTATTTAGACTCATTCCTTCTT 

CCAGTGCCCTTATGATTATTTCCTACCTTTACCATTGG 

Sequence 1916 

CCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGATATTGCTTCAGAAAACTGAATGTGTA 

TGTCGGTCATATTGCCTTTATAACCATGCTAATATCTATGCTTTATACATACTCAAACTT 

GCCTTGCCTTAAAAAAATACATACTACATACTTAAATCAGGAATTCTAGCCATCTCACAG 

AATACCMCTAAAAC TAAGT GCATTGAGATCTGAGATTGGTAAACCCAGATTCATTTACC 

ACAGCTGTAATTAAGTTTTTAGAMCTATTCTCTTTTTGGGGAMTCCA 

TTCTGTTATCTTATTAGMGMAATGATGTTGATATGTGTTTCAGATTTTCCATTTGAAA 

TCTTATAATTAATTT GATT TATTTACTGTAAGTAGGAGGTATNAATGACACTCTTAAATT 

GG AAGGA GGGGTGTTTTAGTGTCTGTNTTAGGTCAAMTTAGTGATTCTATTTTTATCAA 

AAGTTTTATCCTGMGTTTCAGGACCACTCTTCCTTMTNAACTTGTTAATGGGAAGCGA 

GCCTTATGAACATTTAAAAAT 

Sequence 1917 

CGCGTCCGCAG CAGTAA TCCTTTAAT MCTGG CACGAGCACTTTATTCTTCTGGTGAGCT 

CCCTGMTATTTATTTTTCTGATTATAMTTTTCTATATTAGTAGCAT^ 

ACTTCTTCACTATAGAGCATTTACTTTTAGTCTCTAGATGTATATTTTGGAATGCTGTAC 

T TGGC AT MCATAG ATTAAMTCATMTGCATGACTAAMACTCCTTGGATTTATTTCCC 

ATTTTAAAATTTTTAGCGGTMGTTCAGATTTATMTCTTTCTCTAGACTTCCATGGTCT 

GAATGTTGCCTGCTGAAGTAGCAACCTAAAAAGTATCCCCTGCTTATGCTTCTCCAGTTG 

GCCCTCCATGTCCATAGGCTTCGCATCTGTGATTCAGCCCACTGTGGGTCAAAAATATTT 

GGGGAAAA 

Sequence 1918 

GCCCGCGTCCGCATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAA 

ATGTGMGMTGTGATGAAGCmCAGTTTCAAATCGAACCTTGAAAGACATAGGAGAAT 

TCATACTGGAGAGAAACCTTACAAGTGTAATGATTGTGGCAAGACCTTCAGTCAGACATC 

ATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAAATGTGAAGAATG 

TGATGMGCTTTCAGTTTCAAATCAAACCTTGAAAGACATAGGATAATTCATACTGGAGA 
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GAAACTTTACAAGTGTAATGAATGTGGCAAGACCTTTAGTCGGAAGTC 
Sequence 1919 

NGACCNCGCGTCCGCGCTCCGCTGCCNGGGGCGGGAGGGAGGAATGGTTGCTTCACGCCC 

CGGGGGAAGAGACNGGAAGCTCGGCTCTGGGTTGCGGGCCCCGGGGTCTCCGCGTGGGGC 

GCACCGTCCGACCCGCCCCTCCCGGTGTGCAGCGCCCCGCACCGCCCCGCCTTGCCTGGG 

AGAAGCCCGGCGGGACGCGCCGGGCTGGAGTGGGCGGTTATAGGCTTTGAGCTAGGCCGC 

TTCCGGGAGGCGGAGCTCACACCCCATTTCCTTT 

Sequence 1920 

GTCCGTTCCTTGATTCCTGGAAGTCCAGTGGGTTCTGCAGCTGAAAAAGCCCTGGGTCCC 

AGCAGCAGAGAGACAGGACAGAGGGGATGCTTGGGCGGGGAGGGACGGTAACCTGCAGAA 

CAGATTCCATTTTTATAGMCGAGTACACGTTTGCTAAAACAGTCCTGCTTTCCCAGACT 

GGATTCCCACCACAGGGACAGTCGGAACTCAGGACTAGCTCCAGCGACATCTTTCCTCCG 

MTTCMGCCTTCTATCACMTGTCAAMCAGCTATTTATAAAGCCATTTTCATTGTACT 

TGATMCAGCAaSAGTCCCAAMCTTTTAGAMTAAMTAGGACATTGGCTTGATTGAAA 

AGAGGGACTTTTTAAAMTTGTTCTTTCGTCAGAAAGCCTTTTGGATGACT 

Sequence 1921 

GCGTCCGAAAAAAATAGCATTATACCTCTTCCTTGTCTCAACCGCCATGAAAATTCTGAA 

CACTCCAAATTCAGTTGAATMTCCAAAACAAMTTTATAAGTATAAAATAATTTTACTT 

CTTATAGTAATAGTATACTTTAAAAAGCCTCAGGGTATATTATCTTCTAAACAGCTACAA 

7TCAGTGCAGCTACATTAACCAACTATGTTCTCTAGTTGAGAACAACTAGGCCTATTTCA 

CTGCTGTGTAGCCTCAGTGCCTAACATGGGTGCCAAATAAATATTCGTAGAATTACACTG 

AATTGTAAAAACCATTCGTTTTTGTTTACAATTGCCAAAAATCTCAAAAGGCCCTGTATT 

TATGTMTTCTTTGAMTTATTATTTTAT7TTGATTTCTCAGTTATTGACTGGTGGGGTG 

TGACTTAGTCATAAGTACTCAATATTATAAAAACCTCAAATAATTGACTTGGATTTTACA 

CAACATCCTTCCCTTTTCTACAAGTTAA A I 1 1 1 1 1 1 A CC 

Sequence 1922 

TTGGTATTCTTGGCTAATTTCTTAGCTACTTGMGGTTMTTTGCAAGACTTTTAAAACC 

TTAGAAAAGTTTTAAGGTTGCAAAGTTATCAACACTGGGGCAGAGGGTGGAGAGGCCAAT 

GCGGGTAGAAGGAGGCAGTTATGTTTATATTGMGGTGAAATTTTTCTTTCATTTAGAAT 

GGAAAACATCCCCAAATGTATCATTATAAACTAGTCAGCCTTGACTACACAAAATTGACC 

TTTAAGTTGCTTGAGAAAAACACAATGCAAATCGTTCAGAAGGGTCAACATCCTTTGGTG 

CTAAMTCTTGTGTATGTTTTCAGMTGGCTTTTTCTGTATGTTATAGAATAATCACTAA 

AGGAMGGTAGTTGAATTTAAAGTCATGMGCAAGACTCTTTMTTCAGTTATTTTAAAC 

AAG AATTAAAACCCC AAACATCCTTGGCAGG CTTTGAAGC ACAC AG AATTTTCTAG N ATT 

TCTTATTA 

Sequence 1923 

CNCCGCCGGAACMCAACAGAAAGCTGTGTTTGTCTTTTTTCTCTCAAATATATCTCCCG 

TATGAGATTTCAGGTCCCCATGTTTTCACCAAGCAATCTGCTATGTCAGCCAACCCANCA 

TCACTTTCTACAGGAGGTTATGATTTTTGCCATTTACTAGAGGAAGATGTTTTATGAAAT 

CAAGTTGGGGTTTGMTTCAGGTGCAGTCATCAGTTCTTTAGGGGCTGCAATGTTTTAAA 

AAMAATMGTCATCAGATTTTMGMAAMGTGATGATTTCTTATTGATATTTTTGTAA 

CAGMTATAGGCTCTTAACTGAAMTCCAGMCCAGMACATAMTCTTGAGTTTCTTTT 

CATGTACATAAAAAGCAATAGCCGTTTTAGTATAGGATAGCCCTGAGCCCAAAAAGTMT 

AGAMTTTTCTCTAGATATTTTAATACAGAGAGTGTATAGACTGACTCTAAGTTAATAAA 

TGTGCAAAAATATCTTAAAACCATNCCCTNCCCTTTATTTCAAC 

Sequence 1924 

CCNCAAAAAGGAACCAGAGGCCACTTGTATATATATGGTCTCTTCAGCATTTATTGGTGG 

CAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGA 

GAATTGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCA 

ACTAAAATGCCATTTCACCTGGMCTTGATGGAGGGAGAAAACTCCTTGGATGATTTTGA 

AAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCGTGAATTCAAAGCCACAATGTGCA 
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ACCTACTGGCCTATCTAAAGCACCTCAAAGGGCAAAACGAGGCAACCCTGGAATGCTTCG 
TAAAGCTGAAGAGTTAATCCAGCAAGAGCATGCTGNCX3AGGCAGAAATCAAGAANTCTGG 
TCACCTGGGGAAACTAT 
Sequence 1925 

CCACGCGTCCGTCCGCCTCCAGCAGCATCCGCTTCAGCAAGGCCTGCCTGAAGAACGTCT 

TCTCGGTCCTACTCATCTTCATCTACCTGCTGCTCATGGCTGTGGCCGTCTTCCTGGTCT 

ACCGGACCATCACAGACTTTCGTGAGAAACTCAAGCACCCTGTCATGTCTGTGTCTTACA 

AGGAAGTGGATCGCTATGATGCCCCAGGTATTGCCTTGTACCCCGNTCAGGCCCAAGTTG 

CTCAAGCTGTAAGCACCATTTACGAGGTCATTCCTCCTCTGACAAGCCCTGGCCAGCCGG 

GTGACATGAATTGCACCACCCAGAGGATCAACTACACGGACCCCTTCTCCAATCAGACTG 

TGAAATCTGCCCTGATTGTCCAGGGGCCCCGGGAAGTGAAAAAGCGGGAGCTGGTCTTNC 

TCCAGTTCCGCCTGAACAAGAATAGTGAGGACTTTNAGCCGCCATTGAT 

Sequence 1926 

GCGTCCGGTAAGTATTTTGGAATTCAACCCTCGAATTA 1 1 1 1 1 1 CTCATTTCAGCATAGT 

GATAGGGGATGCAATGAGGCTTCATTA 1 1 1 1 1 I ATGACCTGCCCCTCATTTGCTCTGATG 

TTCCCTAAATTCTGTMTCATATCATMCTTTTGTTATGAATAGAGAGGAATGGGCTCAC 

TGAAA(XTGACACTAGAAATTGGTGGGTGATGCTCATAACTGCAAACACTTAGCTTATTG 

MGTGCCTCTATTTACATGTTCTTrAGTTATMTATGTATTTTTCTAACAGAAATACACG 

TCTGTAATTGGTATATATTATACTTTGTATGTGTGACAACAAAAGCTAAACAGAGGCTAA 

AGTCTTTAGCAGAGAAGAATGAATTN 

Sequence 1927 

AACTGTTTGGGAAAATACGTTGAGGGAGAGAAGACCTTGGGCCAAGATGCTAAATGGGAA 

TGCAAAAGCTTGAGCTGCTCTGCAAGAGAAAATAAGCANGACAGAGGGATTTGCTCTGGA 

CAGANATGGAAGAGCCNGGGAACAGAGAAGTGTGGGGAAGAGATAGGAACCAGCANGATG 

GCAGGGGCAAAGGGCTCAAGGGTGAGGAANGCCNGTGGGACCCCACAGANTATGGGGAGA 

TAMGGACATTGCmGCTTTTGGTGGCACCGTAAGCTCCTTGACTGTCTNCAGCACCCA 

GAATCTCATTAAAAGC7TATTTATTGTACCTCCAACCGGCTTGTGTGCAATGGGGGTCTT 

TTTGTGGAAAATCAANGAGCANACAGGTTTTCATGTGTACTGTCACCACGTGGGATGNGA 

ACCAGATNGCATGGAANCAAGACGCTAAATGNAAGAGGGCCATAANGGNTGGGATTTCCC 

AGGCNCCTTAAGAACAGCTTGTC I 1 1 1 1 1 1 1 1 1 1 1 CCTTTCCAAAAAA 

Sequence 1928 

CCGCCGGTMGTTAAAGACTTTAAGGACATTCAAAGTTTAAAATAGTGTTCAAATTGCAA 

MTTTGGCAATCTTCATATAAATTGGTTTCTTTTCTMCTTTTCAAAAACTAACATTAAA 

TGTCMTTATAGGAAMCATAGTTGGAMTGTMTCATCCAMGATCATTTTTAAAATGA 

MTTTAATTAGCACATATTGMCATTTGACTTMTTGTTAMCCCCAGTTTTGTTTTGTT 

TTTTTAATCAGATTTTTGCACACTGATTAG 1 1 1 1 I GTGTTGTGGCTTTTGTTGCTTTATT 

ATTCAAGGI 1 1 1 1 1 1 1 1 1 1 1 1 ICTTCCCCATGGGGGAGATTGTCTTCCAATGTTTAACTA 

CGTTTAMTAMTAAAMTTGAATTTTATTGNTCATTTATATAAAATCTGATCCTTGATG 

TMTTTCCMTACAGTTCCMTTTTATGGCTTTATMTTACMTGATATTTTCTTCTATA 

ATAMMCCAMGTAMCATTTAAATGGGGAMCTGATATTTTTCATTTATATGAAGTAT 

NAAGCCCTCTACTGGGGTCNTTATTGGNGAATCATNCTGCCTTCAAANTGGTTTCAAAAN 

TGGGTAG AAANAAAACC I 1 1 I I 1 1 1 IG 

Sequence 1929 

ccgcacaccctaaagaaaataagtatcccagtcgacatcagtgacagtgatatgatgctg 
aacatcatcaacagctctattactaccaaagccatcaagtcggnggtcatctttggcttg 
caacattgccctggatgctngtcaagatggtacaagtttgaggagAatggtcggaaagag 

ATTGACATAAAAAAAAATATGCAAAGAGTGGAAAAAGATACCTGGGAGGCAATCATTGGA 

ANGACTTCCTGTGTCTTGCGTGGAGTCATGATTAACAAGGATGTGACCCATCCACGTATG 

CGGCGCTATATCAAGAACCCTCGCATTGTGCTGCTAGGATTCTTCTCTGGAATACCAANG 

AAAGGAGAAAGCCAGGACTGACATTTGAGATTACACGAGAGGAGGGACCTTCACCCCGAA 

TTCTCNCAGAATGGAGNGAAGGAAGTACATCCCAGCAGCTNTGTGAGGGACCATTATCCC 
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Sequence 1930 

l I l 1 1 1 1 M T ACAACTCAAAAAAAGAMTCTTTTAATAAAAATTACTCATAAAAATCCTA 

ATAMTTTTAAAGA GCAG NGATATTCCTTATTACATTTATAAMGAACATTTGGNCCTTT 

TACAAAAAGATCCCTTTTAATTNAMTACNTTTCTTATTTACAGATTAAACATAAAATAT 

C ATNT ACAGTTGCAMGCATATTGCACATTACAGAGCAMGCATTNGNGTATTTCCGNAA 

GTTTTCCCAGAGTTCCCAACTCTATACI I 1 1 1 1 1 I GTAAAAAGATTTACCCTTGCTTATG 

CAAAMTAAMTMGAMTGCNANCTGNCGGTTTTGCTATTTAAAAACTANAANCCAAAA 

TAAACCTNTTAAMATATTATTCCTCTGCCTTGCANAAAAGGAAAGTGAAGAGGGGTNTT 

ANAAATCAGNGGGGGTTNCCACCAGNGTCNCTTGATAATTTT 

Sequence 1931 

CGTCCGGGGAAACTTTATATGGTTGGGGATAAGAATTGAATGCAAATTAATGAACTAGAT 

TTGCATTTATG GAGT TACCTCATCATGGAGTTACCTTGGTCTGTCCCACGTCAATTAATC 

TTGNTTCCrTATTTTCATAGACCATCCTCTAGMCAGTGTTTTCACACTGTGTATCATGA 

TCCTATMTGCAGGGGTGTCCAAGCTTTTGTCTTCCCTTGGCTACACTGGTNGAAGAATT 

GTCTTGGGCCACACATAAAATATACTAACACTAATGATAGCTGATAAGCTAAACAAACAA 

AAAMTCACAAAAMTCTCATAATGTTTTTTTGAGATTGTGCTGTTTTTTTAGTTGTG^ 

TTCAATGGAGAAATAC 

Sequence 1932 

CGTCCGGCGCGTTCGTGCGTCCTAGTTCCAGTACATGCGTGGAGGGTTTACGGCAGCGTG 

T TCTG ATTCnTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTANGAGGG 

CTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGCCGTGC 

GGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTACGCCAT 

CTTTAAGGTTCTAMTGAGMGAMCTTCMGAGGTTGATAGTTTATGGAMGAATTTGA 

AACTCCAGAGAAAGCAAACAAAATAGTAAAGCTAAAACATTTTGAGAAATTTCAGGATAC 

AGCAGAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAA 

AAAAGTTCTGAAGAAAATAGTAAAAGAAGCCCATGAACCGCTGGCAGTAGCTGATGCTAA 

ACTAGGAGGGGTCATAAAGGAAA 

Sequence 1933 

AGGGAGCCGCCCCGCGTCCGCCCCGCGTCCGCGGACGCGTGGGCTAAAACCCATCAGGCA 

AGATCACCACGCATTGANATATTTTCATATCAAGATAAAGTCGCACATTTTCCACAATAC 

ATTGCTAAAATAMGAGGAGAAAGGCTTAGGAAGTTTTTTTGCAGAGAGTGCTGGTAAAG 

MTTGAGCMGTTTGCTATTGTATTGNAATGTTTCTCTCAGGTTTGNTCTTCCT 

GNNGGTATTCCATGAATAATTGAGATCAGCCCTATGTAAGGTAAGATCATAATATGGGGG 

ACAAATGG 

Sequence 1934 

GC GGAC GCGTGGGCTCCATCTGAGCTCTTGGGTGACCAGGGTGCATTGTCAATGAGGGTA 

ATATTTTGAAAGACATCTTTATTATGAGCAGTAGGTCTCAACAGTGGGCTTAAAATGTGC 

AGTAAATCATGCTGTAAACAGATGTGTTGTCATCCAGGTTTTGTGCCATGTCTAGAGCAC 

AGGCTGAGTAGATTTAGCATAATTCTGMGGACCCCAGGATTTTCAGAATGATAAATGTG 

CATTCGCTTCCACTTACAGTCACCAGCTGCATTAACCCCTAACAAGAATCAGCCTGTCCT 

TTGTAGCTTTGGAGGCAGGCATGMCTTCTCCTAGATGGCATCTTCCAAGAGGGCTATTT 

TTGTCTACATTGAAATTCTGCTTAGTGTAGCCACCTGCTTCAATGATCCTA 

Sequence 1935 

TNGACCACGCGTCCGGGCGCCNTCTCCCAGCAATCAAGTCTTGCTCCCTGGCCTGCCCTC 
CGGCACCCTGGAGCCCCGCTTTCCACACAGCCTTGCTCCTCCCGGCCTGGCTTGCTGCCT 
GCAGGCCTCTCAGGGTCCTTCACCGTCTCTCCAGTCTCGGATCCGGACCCGTGGATCTCA 
GCCTCAGACCCTCCTCTGGCCCCGGCCTTGCCCTCGGGCACGGCCCCTTTCCTCTTCAGC 
CCTGGGGTCCTGCTCCCCGAGCCAGAATATTGTCCTCCTTGGAGGTCCCCAAAGAAGGAG 
TCTCCCAAGATCTCCCAACGTTGGAGGGGAAGTCCAAGCCCAGGGGGAACTTGACATACC 
ACCAGGTACATGCCCCCAGAGCCGAGAAGGATNCGGGGGGCAAGACCCCCANGCCGTAGG 
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GNGTCCTCCCTTGGGTCCTCCTGGGCCATCTCTGTGGGGAAGGGACAANACTCGCAACAA 

NCCACATTCTAANGATGANGGCCCTTTTNCCCT 

Sequence 1936 

CCN CGCGT CCGGAAAATATCCNAGGTTGTACGCAGCAGTGGAAGTTGCTCTCAAGGGAGT 

GGTATTTTACACTATGCTCATGGCGACAGTCAGCAAACTCACCTGTTGAAGCAAGGAAGA 

AGCTCCATGGGCACTGGTCTCAGTGGTGGGAAACGTCCTAGTCAGGAAGAGGACACACAG 

AGTATTGGTCCTAAAGNCCAGAGACAGAGCACTAATTAGGTAAATATTTTAGAGCTGTAT 

TTCTTG CTTTA GAAGAGTATATAATTMCATAAATTAAGATAATTTCAAAAATGGAGCAA 

ATCTCTATTTTCAMCCAGAAAATCT^ 

CAGTTMTTTrTGGTCMGAGACATGCATCTGTACTGGAGAMTTGTTGCACCAAGTTTT 

ATATTCATCTGAACCAATGC 

Sequence 1937 

CCCCGCGTCCGCCCCTTTNCTCCCTGAGGACTCCACAGAAGATGGTATATTATGGGAAAC 

CTTCTTGTAAAAACAACTATGAGAATTATATTGACATAGTGAAATATGTGTTCAGCGCTT 

ACAAGAGAGAGTCCCCTCTCATCGTCAACACTATGGGATGGGTTTCAGACCAGGGGCTCC 

TGCTTCTCATTGATCTGATCCGATTGCTGTCTCCCAGCCACGTGGTTCAGTTCCGCTCTG 

ACCACAGTAAATATATGCCAGACCTTACCCCGCAGTATGTAGATGACATGGATGGCTTGT 

ACACAAAAAGCAAGACCAAGATGAGAAATCGACGTTTCAGACTCGCAGCATTTGCAGATG 

CTTTGGAATTTGCTGATGAAGAAAAAGAGAGTCCAGTTGAGTTCACTGGACATAAACTGA 

TAGGTGTTTATACMG ACTTTG CATTCAGAATAACTCCAAGAAATAGGTAACTAACCCTC 

ATTTGGCTGAAGAATTATTTTTCCTTCCGTCGAAAMGACCTGCCATTCTCA 

Sequence 1938 

GTCGACCCACGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTCTCGC 
ACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTCGCAATGGNAGAAGAGA 
TCGCCGCGCTGGTCATTGACAATGGCTCC 
Sequence 1939 

ACCACGCGTCCGGGCGCCAGGCTAGGGCGGCCTGGCCACTGAGCCGGGGTGCAGTGGCAG 

CGGGAGAGTACCTGGCGATGGCGATATGAGCGGTGCGGGGGTGGCGGCTGGGACGCGGCC 

CCCCAGCTCGCCGACCCCGGGCTCTCGGCGCCGGCGCCAGCGCCCCTCTGTGGGCGTCCA 

GTCCTTGAGGCCGCAGAGCCCGCAGCTCAGGCAGAGCGACCCGCAGAAACGGAACCTGGA 

CCTGGAGAAAAGCCTGCAGTTCCTGCAGCAGCAGCACTCGGAGATGCTGGCCAAGCTCCA 

TGAGGAGATCGAGCATCTGAAGCGGGAAAACAAGGGTGAGCCGGCGCGGGGCCCTAGGCC 

GGCCCTGCCTCCCCAGGCACACTCAACACTGCCGCTCCCGCAGCACAGAAACACAGCCAT 

TCAACTCCAGCACACGCCTGGGCTCAGGGGGAACACAGGACGATCTCCATTACAAGCTCA 

TAATGAATCAGACATCACAGAAGAA 

Sequence 1940 

CGCGTCCGTGAAGGCCAGACCGAGAGGTGCCAGAAGAGAACAAACCTCCATCCAGACACG 

CGGGCGGAAAGGCTCCAGGGGTCCAGGGCCAGATGGCGCCGCCTCTGCCCGACTCAGAAA 

GAGAGAAACAAGAGCCGGAGCAGGGAGAGGTTGGGAAGAGGCCTNGACAGGCCCCANGCC 

TTTGGAGGAGGCGGGTGATCTTCCTGAAGATCCCCAGAAAGTTCCAGAAGCAGATGGTCA 

GCCAGCTGTCCAGCCTGCAAAGGAGGACCTGGGGCCAGGAGACAGGGGCCTGCATCCTCG 

GCCCCAGGCAGTGCTGTCTGAGCAGCANAACGGCTTGGCGGTGGGTGGAGGGGAAAAAGG 

CCAAGGGGGGACCGCCGCCAGGCAACTCCGCCGGNGACACAGGGCAGCCCGCANAGGACA 

GCNACCACGGTGGGGAAGCCTTCCCTCCAGCGGAGAAGCCGGCTTCAGG 

Sequence 1941 

CCGCGTCCGCAGAAACATATGTGTAGTGTGCTGCAGCATAAGATGGAAGAACTTAAAGAA 

GGCCTGCGGCAAAGAGATGAGCTTATTGAGAAACATGGCTTAGTTATAATCCCCGATGGC 

ACTCCCAATGGTGATGTCAGTCATGAACCAGTGGCTGGAGCCATCACTGNTGTGTCTCAA 

GGAAGCTGCTCAGGTCTTGGAGTCAGCAGGAGAAGGGCCATTAGATGTAAGGCTACGAAA 

ACTTGCTGGAGAGAAGGAAGAACTACTGTCACAGATTAGAAAACTGAAGCTTCAGTTAGA 

GGAGGAACGACAGAAATGCTCCAGGAATGATGGCACAGTGGGTGACCTGGCAGGACTGCA 
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GAATGGCTCAGACTTGCAGTTCATCGAAATGCAGAGAGATGCCAATAGACAAATTAGCCG 

AATACCAATTT 

Sequence 1942 

CCGCGTCCGCTCGCCTGCCCCGGTGCACCCAGTCCGCTCACCCAGCCCAGTCCGTCCGGT 

CCTCACCGCCTGCCGGCCGGCCCACCCCCCACCCGCAGCCATGGACGCCATCAAGAAGAA 

GATGCAGATGCTGAAGCTGGACAAGGAGAACGCCATCGACCGCGCCNAGNNAGGGCCGAA 

TCCGACANGAAGCAANGCNTGAGGACCGCTTGCAAGCANGCTGGAGGAGGAGCAGCAGGC 

CCTCCAGAAGAAGCTGAAGGGGACAGAGGATGAGGTGGAAAAGTATTCTGAATCCGTGAA 

GGAGGCCCAGGAGAAACTGGAGCAGGCCGAGAAGAAGGCCACTGATGCTGAGGCAGATGT 

GGCCTCCCTTNACCGCCGCATTAGCTGGTTTGAGGAGGAGCTGGACCGGGCCCAAGGANC 

GCCTGGCTACAAGNCCTGCANAAACTTGTANGANGCCNNNNAANGCCGGCCTNATNATTN 

GCCNNGAAGAGGGAATTNAAANGTTNTTNNAAAAACC 

Sequence 1943 

GTCCGCTTAGTTTCTGCATTATNAGTNAGCATAAATAATAAATCCAGAAAACGTGCTGTA 

TTTGTTGTTGTTTCCTCCATGGGCTTTCCCGCCATCTAATTTGATATAGACTTCATCTCC 

CGGCTCCAAATGAAGAACCACACTGTTACTGGCATAGTCGTAATTCTGATCAGCATCTTG 

GGCAATTG CACTAGCACGC ACCTACGTGAAACAGACAAGAATTAGGATGCGTAAATGAGA 

ATTCTCAAGTTTTCTTTTTTGCCATTCATGTAGCATCACMGCTGACTTGCTGCCATAGT 

ACAGAATTTAGCATAGCAAAG 

Sequence 1944 

GCGTCCGGCTGCGGGCGGTCGGGGCTCCGGGCCGGGGCGGCGGCGCCATCTTGTGCCCGG 

GGCCGGTGGGGAGGCCGGGGAGGGGGCCCCGGGGGGCGCAGGGGACTACGGGAACGGCCT 

GGAGTCTGAGGAACTGGAGCCTGAGGAGCCCCCGGCAGCCAAGAGGAGGAGGAGGAGCCG 

GGACTGGTCGAGGGTGACCCGGGGGACGGCGCCATTGAGGACCCGGAGCTGGAAGCTATC 

AAAGCTCGAGTCAGGGAGATGGAGGAAGAAGCTGAGAAGCTAAAGGAGCTACAGAACGAG 

GTAGAGAAGCAGATGAATATGAGTCCACCTCCAGGCAATGCTGGCCCGGTGATCATGTCC 

ATTGAGGAGAAGATGGAGGCTGATGCCCGTTCCATCTATGTTGGCAATGTGGACTATGGT 

GCAACAGCAGAAGAGCTGGA 

Sequence 1945 

CCCA CGCGTCCGGCAAACCGGGAAAGGAGAGGATCCCGGAGCCGGTGAGAATTCTCTGTT 

TTTTCTCTACCATCCTTTCCAGGCCTTTTCCTCACCTAATGAGTCGTAGAGACGAGGGCC 

CAAAAAGTCTGTAAAGGTGGCTGGTGAAAGATTAAGTGNTCCAAGGGCCCTACATTCCNG 

GANGNGGTTCGGGATAAAAGAGAACTAGTCNTGGGAACAATGTAAGTGGGAACNTNAAGG 

NANNGGNAAAGCGGCCNATAAAGGNGNNCGGAGGNCCCAATGGNANTAAAGCGGACCCTG 

TGTAGGTATAGAGTTGAGTCAAGTGGAGTCACTGCCTCTTGTCCCTCTTGGTCAGCGTGA 

TGGCCAGAGGCCTGGGGGCCCCCCACTGGGTGGGCCGTGGGGACTGCTGANCCTGGGCCN 

ACTTGG 

Sequence 1946 

ACGCGTCCGGCCGGGAGTGGTGGTGGGCACCTGTAATCCCAGTTACTCGGGAGGCTGAGG 

CAAGAGAATCTCTTGAGCTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATTGCGCCACTGC 

ACTCCAGCCTGGGTGACAGAGGGAGACTCCGTCCCAAAAAAAAGAAAAAGAGAAACAGCT 

GTCACCTCCCGCAGGACCCAAATCCTCTCTCTGAGCACCGTCATCCACCACATGGCTNGG 

CCTNGNTTCCMGANCNAGTCNANCCTTTNNNGNCTTANTTNNAGGTNGANNCCGCNNNT 

TTCNNCCCAAAGGAGACAGCCCCTGCTCCTAGATGCCCTTGGCCTCCGCAGTGCAGCCCC 

CAGGTGTCCTGACTGAAGCANAGGCCNTAGCCCCAT 

Sequence 1947 

NCGCGTCCGAAGTGGATGAAAATTGGTACCATGGGGAAGTCAATGGAATCCATGGCTT7T 

TCCCCACCAACTTTGTGCAGATTATTAAACCGTTACCTCAGCCCCCACCTCAGTGCAAAG 

CACTTTATGACTTTGAAGTGAAAGACAAGGAAGCAGACAAAGATTGCCTTCCATTTGCAA 

AGGATGATGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTGAAGGAATGCTGG 

CAGACAAAATAGGMTATTTCCMTTTCATATGTTGAGTTTAACTCGGCTGCTAAGCAGC 
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TGATAGAATGGGATAAGCCTCCTGTGCCAGGAGTTGATGCTGGAGAATGTTCCTCGGCAG 
CAGCCCAGAGCAGCACTGCCCCAAAGCACTCCGACACCAAGAAGAACACCA 
Sequence 1948 

CGCGTCCGAGTATTTGAAGTGATGCTGGCTCAGACCGCTCCCACTATGCAAAATGTAACC 
CATACATGGATTCTCCACAATCAATAGGT7TTGCTGAACTGCTGAGCCTTGGCAGTGGGC 
GCCTTTGAGGCTTAGMGTGCCNNMGNNTCTNNCA^ 

TGNCGCACTNTTANCAGTCANCAAANAAAATCCNGGGGNGNTNNTNANCCCCNAAGGCNC 

NNGNTNCCACNGTTCAGTGAAACTTGCTTCAGAGGCAGAAAGAAGCAGGTTTCCAAGCAA 

CAATCAGTGCTCCACACATGCATGCATATGCGCTAGAACTTCTATTTGATCAGTTGCATG 

AAGGAGCTAAAGCTCTTGATGTAGGATCTGGGAAGTGGAATC 

Sequence 1949 

CCACGCGTCCGGAGAGAATGGGCCGCGCGCGCCCGCGGCCGGGGAAAGCCTGTCGGGAAC 

CCGGGAGAGCCTGGCCCAGGGCCCCGACGCCGCAACCACCGACGAACTCAGCTCTCTCGG 

GTCTGACTCGGAGGCCMCGGCTTCGCCGAGCGCCGCATCGACAAGTTCGGCTTCATCGT 

GGGCTCGCAGGGCGCCGAGGGCGCGCTGGAGGAAGTACCCCTGGAGGTGCTGAGGCAGAG 

GGAGTCCAAGTGGCTGGACATGCTCAACAACTGGGACAAATGGATGGCCAAGAAGCACAA 

AAAGATTCGTCTGCGGTGCCAAAAGGGCATCCCGCCTTCTCTGCGGGGCCGTGCTTGGCA 

GTACCTGTCAGGAGGCAAGGTGAAGTTACAGCAGAACCCTGGAAAGTTTGACGAGCTGGA 

CATGTCCCCTGGGGACCCCAAGTGNTTGGACGTGATGGAACCGTGACCTGCNCCGGCAGT 

TNCCATTCATGAAGAGTTTTTGTGTCCGGGGGGG 

Sequence 1950 

NGCGTCCGGCTTTACAACGGGCAAATACTGGAAACCATCGGAGGCAAACAGCTCANAGTC 

TTCGTATATCGTACAGCTGTCTGCATTGAAAATTCATGCATGGNGAAAGGGAGTAAGCAA 

GGGAGAAACGGTGCGATTCACATATACCGCGAGATCATCAAGCCAGCNNANAAATCX)CTC 

CATNGAAAAGTTAAAACAAGGATAAGCGCTTTAGCACCTTTCTCAGCCTNCTTGAAGCTT 

GCANAACTTGAAA AGAA GCTTCCTGACACAACNCTGGAGACTGGGACATTATTTGTGCCA 

ACCAAATNGATGCTTTTAMGGGGAATGACCTAGTTGMNNAAAAAAGGAAATTTCTTGA 

TACNGGGACCAMAAMTGGCTCNTTTCAAAAMCAATTCCATTTCCTTTATTTNACCCC 

TGGACCACCCCMGGGMGTTTTTTTCMTTTGGGGAAAAMGNGGAATTTTTGG 

Sequence 1951 

CAATNCTTGAACCAGGAAGGCGGAGGTTGTCGGGGGCTGAGATTGCACCACTGTGCTCCA 

GCCTGGGCGACAGAGGGAGTCCCTTTCTCAAAAAAATAATAATAAAAATAAAGATGGCAG 

TAGGMGGTTTCAGCTTGAGATGCTGTCTTTTCTTCTGTTTTTATGCATAAATACAACGA 

AGACGGGAGAGGAGATGGAAAGCAAAGATGATTAAGTGAAATAATTGTGGGAAACAATAG 

AGGGATAGACTTTGCTTATAGGGGATGTGGACAGAGCAGAAAAATGGGAGGAATGGGGAG 

GATTCAGTTAGAGAAGGAAGAAACCGGTCCAAGGGGCTGGGGCTTTAGGCCCTGGGGCCT 

CCAGTGCCCGTATAAGGCTGTGGCAGMGCCCTGCCCATTTCCGTTCCTTTCACTCCCTA 

TNTTCACCCTTACACCTCCCAAAAAC 

Sequence 1952 

CCGCGTCCGGGTTTTTTGGCCTTATTTTCCT^ 

GATTTCATTCATTGAAGMTCAATACATATTTTGTTTCAAAATGTTTGAAACAAAAGACA 

TAGATG GTAG ACTTTTATTAAAACATATATGGATGTGGAAAGCACATATATTAATGCAGT 

CATCCCTTTTCAGGTGGGAAGAGAGCAMCCAGTTGATTTTTTAATTCATCCTTAGTACA 

CAGAGMTATCTTTTNCTCAAGGMTATACCCTGGTTGGAGCTTTAAAAMGMTGGTTT 

TGGG MCCATT TCATTTTTCCCMNMGGTTGCTATTCTTGGGGTAANTGGGGNATACCN 

GANTGNTTTTTCCNGGGGGGTTTTGGTGGTGGGGGGTAAAATTGGGGN I iTTTTTTTl 

Sequence 1953 

GCGCCGAGGGGTGTAATGCATTNGCAGCAAGAGCTATGAGAAATAACTTTAGACATTATT 
TCATTGMCCTTCCCMCTGAMTTATTTTATGATGTTATAACATGGATAGTAACTCAAG 
TAGCAATMGTTACACAGTTGTGCCATTTGTGCTTCTTTCTATAAAACCATCACTCACGT 
TTTACAGCTCCTGGTATTATTGCCTGCACATTCTTGGTATCTTAGTATTATTGTTGTTGC 
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CAGT6AAAAAAACTCAAAGAAGAAAGAATACACATGAAAACATTCAGCTCTCACAATCCA 

MMGTTTGATGMGGAGAAMTTCTTTGGGACAGAACAGTTTTTTTACAACAAACAATG 

TTTGCAATCAGAATCAAGAAATAGCCTCGAGACATTCATCACTAAAGCAGTGATCGGGAA 

GGCTCTGAGGGCTGTTTTTTTTTTTGATGTTMCAGAAACCAATCTTAGCACCTTTTCAA 

GGGGTTTGAGTTT 

Sequence 1954 

TCGCCCCGCGTCCGGTTAAAACTGCCTCTTTAGATGTGGATGCCTTAATGCTGTAACACA 

TTTGAAAACATTGGCAATACTTAAGTTGCTGCCATGATTACAGATGGAATTATTGGCTAC 

CAAAGAGACGCAATTGATGATGAGAAGCATGATTCTTGCTTCCATATAACCAAAGTTAAT 

CTTMTTGCMTTTGACTCCGTTTCCTTGGTAGGGATAGACTTTC7TCAGATTCCAAGTG 

CTCTCTTAAATGGCAAATTAAGTTAAAGAATACTACTGCTCCATTCCCCTCACTTATTCT 

C CAGT TMTTGCTTGTCAGTTCCATTTCAAGAAAGCAGTGGATGTTCCAGGTTTGATTCA 

GTTTTCCTGTGCACACTATTGCCAAAI 1 1 1 1 1 1 1 1 AGCAAAGATTCTGCACTGGMCGTA 

GACAGTTGGAMCAGTACTACCTACTAGAGGGTATGGGGTTTCTCTTTCTCCCCGCTTTC 

ACCTCTTTCTTTTCCAATTC 

Sequence 1955 

CCNCGCGTCCGGCTCTGCCCAGTCACCCGGTCTCCTCCGGCTTCCCTCCGGCCAACAGCG 

CGCTCAGGCTCGCCTCAGGCCCCTCCAACGGAACAGGAGTCGAGGGGCAGTGAGGCCGGG 

ATGCGTGCGAGCGCGGGGCGCGGCTGGCGCTGGGCCGTGGGGGCGGGGCGGCGTGCGTGC 

CAGCGGCCGTGCGATTCTGTGAGGCCTGCTCTGCGCCGGCGGGGAAGCGCGGGCGACGCT 

ACGCTGGACGTCCTTACCTGCCGGAGGAGAGAAAGTGCCGTCAGCTGTAGGGG 

Sequence 1956 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGAC 

GCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGAGAATGTGCTATACCCC 

AGAAATTCCATCGATCTGTCATGGGCCCCAAAGGTTCCAGAATCCAGCAGATTACTCGGG 

ATTTCAGTGTTCAAATTAAATTCCCAGACAGAGAGGAGAACGCAGTTCACAGTACAGAGC 

CAGTTGTCCAGGAGAATGGGGACGAAGCTGGGGAGGGGAGAGAGGCTAAAGATTGTGACC 

CCGGCTCTNCAAGGAGGTGTGACATCATCATCATCTCTGGCCGGAAAGAAAAGTGTGAGG 

Sequence 1957 

CCACGCGTCCGCTGGAGGCACTGGACGAGATGCTGACGCACGACATCGCCAAGCTCATGC 

CCCTGCTGCGGCAGGAGGAGCTGGAGAGCACCGAGGTGGGCGTGCAGGGGGGCGCTTTTG 

AGGGCACCCACATGGGCCCGTTTGTGGAGCGGGGACCTGACGAGGCCATGGAGGACGGCG 

AGGAGGGCTCGGACGACGAGGCCGAGTGGGTGGTGACCAAGGACAAGTCCAAATACGACG 

AGATCTTCTACAACCTGGCGCCTGCCGACGGCAAGCTGAGCGGCTCCAAGGCCAAGACCT 

GGATGGTGGGGACCAAGCTCCCCAACTCAGTGCTGGGGCGCATCTGGAAGCTCAGCGATG 

TGGACCGCGACGGCATGCTGGATGATGAAGAGTTCGCGCTGGCCAGCCACCTCATCGAGG 

CCAAGCTGGAAGGCCACGGGCTGCCCGCCAACCTGCCCCGTNGCCTGGTGCCACCCTCCA 

AGCGACGCCACAAGGGCTCCGNCGAGTGAGCCGGGCCCCCCTT 

Sequence 1958 

ACGCGTCCGCCTGGCTAACATGGCCGAAAGGTCGTATTCTCCGGGGGAGGACGGAGGCCC 

GGAGAGGAGGGGGTGGAGTGCCTGGTTTTCCAGTCAGGCGGCCGGAGGGCAGCCCTCAAG 

AACGGCCCTGACCGCCCGCGGGGTGAGGGGCCCTTTCTGGGCAGGACCCGCCCCTTGGTC 

CCGCAGAGCCTTGGTACTTGGACCTGAACCTTGCTCCGAGAGGGAGTCCTCGCGGACGTC 

AGCCAAGATTCCAGAATGACTACTATCTTGACTTACCCCTTTAAAAATCTTCCCACTGCA 

TCAAAATGGGCCCTCAGATTTTCCATAAGACCTCTGAGCTGTTCCTCCCAGCTACGAGCT 

GCCCCAGCTGTCCAGACCAAAACGAAGAAGACGTTAGCCAAACCCAATATAAGGAATGTT 

GTGGTGGTGGATGGTGTTCGCACTCCAI I I 1 1 GCTGTCTGGCACTTCATATAAAGACCTG 

ATGCCACATGATT 

Sequence 1959 

CCACGCGTCCGGGGGACATGGTGGTGAGCAGGACAGAATTCCTGTCCTCATGAGTCTTAC 
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ATTCTAGAAGGAAGGAGCAGATAAAATCTAAATAAGGTTATGAGATTGTGACGAAGCGTC 
CGATTACACCTCAACAAGAAGGCAACTGACAAACAGCCTTATAGCAAGCTCCCAGGTGTC 
TCTCTTCTGAAACCACTGAAAGGGGTAGATCCTAACTTAATCAACAACCTGGAAACATTC 
TTTGMTTGGATTATCCCAMCCTTTATTTTTCTTCGGTAGCACCAATGAAAAGA 
Sequence 1960. 

TCCGGTGCAGCCGGGCTAGCTTCTCCTGCATCTCCCGAACGTGCTCCAGCTGCTCAAAGG 

AGCNTTCCTTCCCTGGAACGAANCCCCTGNCTGAACCCTNNNAAAGAGNCCTGGAACCCT 

GGAACCCTGGAACCATGGNCCCAGCCNTNCCCCANGNACTGGGCCCTACAGCCCTATCCA 

CACACCCGCTGAAACCAGCCCCCAGGACTCACCGAAGGCCNGGGGGCGGCCAGAGTGGAA 

ATCATTGAGCAGGTTCAGGAGTCCCCCCTCCATCTNATAGACATCAGTCACCTCGGTCAG 

GAAGGAGTGCTGCAGGGGCGTGCCTGCAGAGCX^GCTTCCACCTCCAGCCAGGACT 

Sequence 1961 

NGNCGCGGTTCTCGTCTCCTCCCGGCCGGCGGAGCGAGTGGAGGCTGCAGCCCAGCTCGT 

CTCGGCGCCCGCGTCGCCGTCNCGAAGCCCCCCGCCCCGCTTCCGCCGCGTCGGAATGAG 

CTCCCGGAAAGTGCTGGCCATTCAGGCCCNAAAGCGGAGGCCGAAAAGAGAGAAACATCC 

GAAAAAAAATCAAGCAGAAGATTGAGCTGCTGATGTCAGTTAACTCTGAGAAGTCGTCCT 

CTTCAGAAAGGCCGGAGCCTCAACAGAAAGCTCCTTTAGTTCCTCCTCCTCCACCGCCAC 

CACCACCACCACCGCCACCTTTGCCAGACCCCACACCCCCGGAGCC 

Sequence 1962 

CGCCCACGCGTCCNNCCCGCGTCCGGGAGAAGATGGAAGCAGTGCCCGACGTAGAGCGCA 

AGGAGGACAAGCCCGAGGGGCAGTCACCTGTGAAGGCTGAGTGGCCCAGCGAAACCCCGG 

TGCTGTGCCAGCAGTGTGGCGGCAAGCCTGGCGTCACCTTCACCAGTGCCAAGGGCGAGG 

TCTTCTCCGTACTGGAGTTTGCACCCTCAMTCATTCTTTTMGAAMTTGAGTTCCAGC 

CTCCAGMGCCAAGAAGTTCTTCAGCACAGTGCGGAAGGAGATGGCGCTGCTGGCTACCT 

CACTGCCTGAGGGCATCATGGTCAAGACTTTTGAAGATAGAATGGACCTCTTCTCAGCTC 

T 

Sequence 1963 

TCCGCGGGNGCGTGGGCGGCCACTCATTTGCATTATCTTAAATCACAAATAATTACTTAA 

TTNGCTGGAGTGTGTGCTTTGCMCTTTTATACCAGAGTAAAATTTGTATTTAAA^ 

AAATAAGAATGCCCATCACTAGGAGAAACACTCCTCACAGAAAACACACACACACACACA 

CACACACAATTTAAAMCTGAGTAMTTTAMTGTATGAAAGGCNCCACAMTTGATTTC 

MCAMTAMTTTCATTTCTCTAGCTACrTATGTCT 

TTATTGCCrrrCATTTGCCTATTTAGACTGATGTAGTTTGATATGATGAAGT 

Sequence 1964 

CTNTAGGGAGNCGACCCACGCGTCCGCCCCGCGTCCGGTTAGTTCTACCTGGTGCCCATG 

TTCTGATTGTGTGTGGGATTGCATGGTGTCCTGATTGCATCTAGGTGGAGCGGATGGAAT 

GTGCTGGGCCACTGTTGGGTGGAGAGCAGCACATTCTTACAGAGGAGATGGAGCGTTATG 

AGCATAGTATGTGGATAGGTATCTTCACCTGCCCGCCCCTGAGTCAGCCTCCTTGACTTG 

ATAGCTTGAAGMTCCTTTTCCACTGAAATAGAGGATAATTAATTGACACATCTGAAATC 

CCCAATCAATCAATCAAGAGAAAGGTAGAACTAAAAACTCCTTAACTTACTGTTGCTTAC 

ACCCCTGAAAGTCTGTTTTT 

Sequence 1965 

CGTCCGGCGCCCTCCAGCCCCGTCCGGGAGTCCCCGGCCCGCTGCGGTGCCGGGAGTACC 

TCCAACCCCCTGCNCCCCNNANGGAGGCCNAGGGGCTTAGCCACCAGGGCTCGGAAGTGG 

GGGCCGAATCCGGTGCNNGNCNNNNNNNAGGNNAGCANAGCCGGAGTTGGGGAGACTGGT 

TGCTGAAAAGCCAGGAGTCAAAATGACTGAGCGCTTTGACTGCCACCATTGCAACGAATC 

TCTCTTTGGCAAGAAGTACATCCTGCGGGAGGAGAGCCCCTACTGCGTGGTGTGCTTTGA 

GACCCTGTTCGCCAACACCTGCGAGGAGTGTGGGAAGCC 

Sequence 1966 

CCCACGCGTCCGGTGCACTCAACAGAACGGCCCTTCAAATGTGAGAAATGTGAGGCAGCT 
TTCGCCACGAAGGATCGGCTGCGGGCGCACACAGNACGACACGAGGAGAAAGTGCCATGT 
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CACGTGTGTGQCAAGATGCTGAGCTCGGCTTATATNNNNGACCACATGAAGGTGCACAGC 
CAGGGTCCTCACCATGTCTGTGAGCTCTGCAACAAAGGCTTCACCACGGCAGCATACCTG 
CGCATCCACGCGGTGAAGGACCACGGGCTCCAGGCCCCGCGGGCTGACCGCATCCTGTGC 
AAGCTGTGCAGCGTGCACTGCAAGACCCCTGCCCAGCTGGCCGGCCACATGCAGA 
Sequence 1967 

CGNCCCACGCGTCCGGCGGTGCCGCGGGGATGGCGGGAGCCGGAGCTGGAGCCGGAGCTC 

GCGGCGGAGCGGGGGCGGGGGTCNAGGCTCGAGCTCGCGATCCACCGCCCGCGCACCGCG 

CACATCCTCGCCACCCTCGGCCTGCGGCTCAGCCCTNNNNCCNNNNNANNNATGGCNNGN 

TCAGGGGGCCTGGGGTCTGGGGACAACGCCCCGACCACTGAGGCTCTTTTCGTGGCACTG 

GGCGCGGGCGTGACGGCGCTCAGCCATCCCCTGCTCTACGTGAAGCTGCTCATCCAGGTG 

GGTCATGAGCCGATGCCCCCCACCCTTGGGACCAATGTGCTGGGGAGGAAGGTCCTCTAT 

CTG 

Sequence 1968 

GCGTCCGGGCGTGTAAACCAGCCGGAGCGGCGCGGCAGCGGCAAGGACCGCCGTGGCGCC 

TAGAGTAGCCGACCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCXIGGTGACAGC 

GTGAGAGGTTCGCAGAGTACTAGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTA 

GTCGTT CTTGGCTCAGGAGGCGTTGGAMGTCTGCTTTGACTGTACMTTTGTTCAAGGA 

ATTTTTGTAGAAAAATACGATCCTACCGATAGMGATTGTTATAGAAAGCAAGTTGAAGT 

AGATGCACAACAGTGTATGCTTGAAATCTTGGATACTGCAGGAACGGAGCAATTTACAGC 

AATGAGGGGATTTATACATGAAAAATGGACAGGGATTTGCAmG^ATTTCATTNCAN 

GCCCANNCCCAATTTTACGTTTAAAAMCCTGNNNNAMAAMTTTTTTTGGNTAAAAAN 

CCNTNTTGTNTCCCCAAMNTTTTTTTGGGGNANAAAAANNGGCCCTCCNAAAAAAAAAA 

Sequence 1969 

CCCACGCGTCCGCACGCCAGTGCCTCCCTTTACCTACTAATGAGGCAAAACTTTGAGATT 

GGGAATAACT TTGCC AGGGTTAAAATGCAGGTAACAATGTCACTATCCTCCTTGGTGGGC 

ACATCTNANMTTTTMTGMGAATTCTTMGACNGTCTTTCTAAANNACTATTTNGTAC 

ATTATGCTTGAAGAANATNTGNGAATTGAGGGAAACA 

Sequence 1970 

GCGTCCGGTTTTCCAACTGCAGCT1 1 1 1 AATGGTTAACCTTCATCTAA I t I 1 1 1 1 ICTCC 

CACTGGTTTATAGATCCTCTGACTTGTGTGTGTTTATAGCTTTTGTTTCGCGGGGTTGTG 

GTGAGGAAGGGGTGATGGCATGCGGAGTTCTTTATCTTCAGTGAGAANNGGCCCTGCCCG 

CCTGAGAGCCAGCTTNCGCGTTGGAGGCACCGNGTTCAGAGAGCTGCTGAGCGCCACCCT 

CTACCCNGCTGACAGACAACACAGACCTGTGCCGMGGCTMNTTGNGGCTTTTTACGAC 

CCTACCCCACCCCCTGTTTTCAGGGGTT 

Sequence 1971 

CGTCCGGTGAGATTCTCCGTAATGGGCGGGGACAGAGTGCCCTGCAGGAGATTCTGGGCA 

AGGTTATCCAGGATGTGCTAGAAGACAAAGTGCTCAGCGTCCACACAGACCCTGTCCACC 

TCTATAAGAACTGGATCAACCAGACTGAGGCCCAGACAGGGCAAGCGCAGCCATCTCCCA 

TATGATGTCACCCCGGAGCAGGCCTTGAGCCACCCCGAGGTCCAGAGACGACTGGACATC 

GCCCTACGCAACCTCCTCGCCATGACTGATAAGTTCCTTTTAGCCATCACCTCATCTGTG 

GACCAAATTCCGTATGGGATGCGATATGTGGCCAAAGTCCTGAAGGCAACTCTGGCAGAG 

AAATTCCCTGACGCCACAGACAGCGAGGTCTATAAGGGTGGTCGGGAACCTTCCTGTACT 

ACCGCTTCCTGAACCACTTGTGGTGGCTCCTGACGCCTTCACATTGTGGCCATGGCAANC 

TGGTGGAGCCCTGGCTGCCCCCCAGCGCCATGCCCTGGG 

Sequence 1972 

GAGGGGAGGGAAATTTATTTCTCTGCTTTTCTATTATACAAGTTGTTTACAGAAACTGC^ 
MTTAAAAAATTACACTGGCATTTGCAGTCCTTAAMTAAATTAAAAGTTCTCMCTTTT 
I I I I I I I I GCTAAACATTTTTTTMGTATGAG TCCT TTGTTTAAAAAGAAANAGATTANA 
ACAGAAMTATTTTCTATAAATAATACATGTATTTTGGTTTTAGTGCTCCCGCCC 
Sequence 1973 
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♦ 

GCCGACCCACGCGTCCGACATCCCCAGGCACATCCGGAAGGAGGAAGGTTCCTTTCAGTC 

CTGCTCCTTCTGCANGNCCAAGAAATTCACTACCATGATGGTCACACTCAACTGCCCTGA 

ACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTGTGAAGCAGTGTCGTTGCATATC 

CATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGCATGTCCTAGGAATGCAGCCNCA 

GGMGT(X;CAGACCTAAAACAACCAGATTCTTACTTGGCTTAAACCTAGAGGCCAGAAGA 

ACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCGTAGTTCGTGTGCATGAGTGTGGA 

TGGGTGCCTGTGGG 

Sequence 1974 

CGCGTCCGCCGAGAGAAGAGGTACGGTCGAAGCCCGGAGCCAGGCCGAGCGGGAGCTGAC 

CAGGCTTGACTCGGGTACAGAACGAGGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCA 

GTGGATGGAGGAGGCCCAGCCCTGAGGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCC 

TACAGGGTGGGTAGGCGGGCGTGCCGCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCC 

GAGGGGCCTACGCTGCGGCCCGGCAACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCC 

CACCCGATCGGAAGCGGATC 

Sequence 1975 

GGCCTCAGGGGGNAGNNATCCTGCAAAGACNNACATGAGCCCANAGGGGAAANAGAGNCA 

CCTGNGAGTACNNGCCTTTGGGNNTGACCTTGGCTCTCAGCACAAGATATTTACAGCCTN 

TGAGCTTGATATTCTAGAATTGNTACAGAGATAGCTCTGGAAAAGAAATAGACTAGAAGG 

ATAAAGGGAAGGAATCATAGCTTATGAAGGTTTTTACTCTGCATCAGACCGNTTTCTAGN 

NCTATGACTTAACGTCCTATAGGCTGTAAGGTTCTCTGCGTGAACACTTCTTTNCTGGCC 

T(XTTTCTGCCC^ATTCCThnTTAMCTCAGTTGCTGAGTTTATTATNCCCTGTGCATCC 

CTGGGCATTGIvn'CATTCACATATGGAMCMTTCANGGAGGACCCTTTGCCTANTTTNCT 

TTATCCTCTTGMTTNTGGATGGGAAAAATNTTAANTNCTTTTGGCCGCCTGCANTNGGG 

GNAANTATTGGGCCTT 

Sequence 1976 

CGCCCC^CGTCCGCGTTGAACAGCTTTAAGAACAATGTGAATGAAATCTTAGCAACTTGG 

TTAGTMTCT GAAA AGTCTATTAATGTATACTTGAAATTCTGTTTGTATAAAAATGCATT 

TTCCCCTTTATTTTAACACTGTGTAAAAGAACATTATGCATGTGAGTGGTTTGAGAATTA 

AATGGTTTAATACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGGGGNGNCCNTTANNNTNNTTNAAAA 

AAAAACCCCCCNNNCCNCCCCCTGNACCCGNAANNAAAAAANNANNCCNTNGGGGGGGGA 

ANCCTNNTTNTTGNCCCTTMNAGGGGNCCCAAAAMNCNATTNCCNCCCNANTTTCNNN 

AAAAAANNCI 1 1 1 1 1 ICCCCNNNTTNNAAGNGGGGGGNNGGCCCANCCCCANAAGGGGTT 

TTTNAAAGGGGGGNAACCCCCNGGTCCCCGNCCCCAAAAAAAA 

Sequence 1977 

CGCCNCGCGTCCGGGCTGCCGTAACAATCGCCACAAACCTGGTGACTTAAATAACAAATA 

TTTATTTTCTCGTAGCTCTGGAAGCCGGAAGTCCAAAACCAATAACTTGGCAGGGCTGTG 

CTCCCTCTGGGAGATCTCGGGGAGCCTCCTTGCTACATGTTCCGCTTCTGGGGACTTCTG 

GTGGTGTTTCCCTGGAGGGCCAAGGCCTCCGGGCGGCCCGTGCATTCGGTGAACACCTCC 

AMGAGCTCTGAGTCACAMGCCTTGGGAMCTTTATTTTATTCTTTCTAGGACATTATC 

AGTAGTCCCGAGGAGACATCAATTACAAAACAAAAAAAAAAAAAAAAAAAA 

Sequence 1978 

cgtcccgacagtcagccgcatcttcttttgcgtcgccagccgagccacatcgctgaggac 

accatgggggaagggtgaagggtcgggagtcaacgggatttgggcggtatttgggcgccc 

tgggtcaccaaggggctggcttttaaacttctgggtaaaaggtgggatattggtttgcca 

atcaaatgacccccttcatttggaccctcaaactaccatgggtttacaatggttccnaat 

atgaattccacccaatgggcaaaattcccatgggcacccgtcaaagggcttgagaaacgg 

ggaaagcttggtcaatcaaatgggaaaattcccnatcaccatctttccaggagccgagna 

tccctccaaaatcaaagtgggggcgatgctgggcgccttgaggtaccgtcgttgggaagt 

ccacttggccgttctttcacccaaccattgggaagaaagggctgggggcctcattttgca 

GGGGGGGGGAGCCAA 
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Sequence 1979 

6AGTGTAGTTACTTCGGTCTTGCCTCACTGGGAGGCCACGTTGGTGACGATGCACACGAA 

GCCCCGGTACTTGTCCAGGTTAACCATGTGCCCGTCCGATGTCCTTGGCGGAAAACTCGT 

GCATGGAGCGCGCACAGCGCCAGTCGTCCNGGGACGCGCACGGGTCCTGGGACTTGGGAC 

ATTCGNGGATCTCCGGTCGCAAGCTGCGGGGACAAGGCTTCTTCCGCCGGCGCCGCGCCC 

TCCTGCGCCTGCACGCTGGGGCCTTCCGCCGCCGAGGGGCTCGCGGCCGCTAGACTA 

Sequence 1980 

CGCGTCCGATCGGAAGTGGCGCTCGTGCATTCAACTTGTTCCCGCTCATGGAACCCCTCT 

TTAAAAAGACGCAGGGCACCfGTGAGCGCAGGAGCGAGCCTAAGGCCACCCAGCGGCAGC 

GCCCGTGTCCTGGGCACTCAGCGTGCTGGGCAGAGCAGGTGCGATGGCCCCAGTCCTAGC 

AGCCCTCGCCCATGTCCTGTCaCCCTTACATGGCTCCCGGACTGTGCAGGGAGCCGATACG 

TTTGCTGATAGCAATACTGGAACCACCCGGGTGCGATGGCAGTGAGGAGACTGCCCAGTG 

CCTTTGGGGCTGTGCTTGCAATAAAGAAAGAATTTCCTGGAAAGGCAGTCTGCAAAAGAG 

GGMCCCGG TGACT CAGAAAGACAGGATGTTTTGGTAATTTACCCCNAAATGTGCCATCC 

ACCATAGTGCTTTTTCCTCTTGCCCTTCGGCTTGTTTGMTCTCACMTTATGGTATTTA 

ATTCTCAAAGAAATATGTATCTGTTAGCCCGNTTGGTGACACTTATACAGATGATTAA 

Sequence 1981 

CNCGCGTCCGGTCGAAGCGAGGACGTGGTGGGTCCTCTGGTGCGAAATTCCGGATTTCCT 

TGGGTCTTCCGGTAGGAGCTGTAATCAATTGTGCTGACAACACAGGAGCCAAAAACCTGT 

ATATCATCTCCGTGAAGGGGATCAAGGGGACGGCTGAACAGACTTCCCGCTGCTGGTGTG 

GGGTGACATGGTGATGGCCACAGTCAAGAAAGGCAAACCAGAGCTCAGAAAAAAGGTACA 

TCCA GCAGTGGTCATTCGACAACGMAGTCATACCGTAGAAMGATGGCGTGTTTCTTTA 

TTTTGAAGATAATGCANGAAGTCATAGGTGAACAATAAAGGCCGAGATGAAAAGGTTCTG 

GCCATTACAGGACCCAGTAGCAAAGGGAGTGTGCAGACTTGTGGGCCCCCGGATTGCATC 

CAATGCTGGCAAGCATTGCATGG ATTCTTCCAGTATATTTGTAAAAAAAAAAAAAAAAAN 

NAAA 

Sequence 1982 

GCGTCCGTGGTAACGATTGGCCCTAAGAAGCCCCTGCCTGACCCGTGAGCATTGTGGAAC 

CCAAAGATGAGATACTGCCCACCACCCCCATCTTCAGAACAGAAGGGTGGGGAAGCCAGA 

GCCCGNCTGCCATGCCCCAGCCAGTCCCCACAGCATAACAGGGTCTCCTTGGCAAGCTGT 

ATTCTGGAGTCTGGATGTTGCTCTCTMAGACCTTTMTAAMTTTTGTACACTGGACTT 

TAAAGTATTGNTCACAAGGGTTATGCAATTCCNNGNCANG 

Sequence 1983 

CCCCGCGTCCGTGACAATCGAGTAGTACTCCCGATTGAAGCCCCCATTCGTATAATAATT 

ACATCACMGGACGTCTTGGCACTTCATGAAGCCTGGTCCCCACATTAGGGNTTTAAAAA 

MCCAGGNATGGCMTTTCCCCGGGACCGGTCTAAMCCAAAACCCACTTTTCAACCGGC 

TTACCACGGACCC GGGG GGGTATACTTACGGGTCAAATGGCTTCTGGAAATCTGGTGGGA 

GGCAMCCACAAGTTTTCATGCCCATCGTCCTMGAATTMTTCCCCTAAAMTCTTTGA 

AATAGGGCCCGTATTTACCCTATAGCACCCCCTCTACCCCCTCTAGGAGCCAAAANAAAA 

AAAANAANAAANNAAAGTGCGGCCCGCTAGACTTAGGTCTAGGAGGAAAAAAACCTTTCC 

ACACCTTTCCCCTGGMCCTGGAMCATAAAMTGMTGCAMTTGTTTGGTTGTTAACT 

TTGGIs rTTAT TGGCAGGCTTATAAATGGGTTTACCAAAATAAAAGGCCAATAGGCATCACC 

AAAAATTTTCAACAAAATAAAAGGCA I I 1 1 I I I ! I I I AACTGGCATTTCTTAGTTTGGNG 

GGGTTTTGGTCCCAA 

Sequence 1984 

CCGAMGMTATCTGTGTGCTTAGGGAGGAMCTTTTTGATCTGCAGAAAAGCCAGAAGA 

CATCTAGGACATCCATAAAMTTCATCAGAGAGCATTTTACTACTGAGCTGCAAAGGGAA 

AAACTTAAAATGGGATATGAAAAGTGAAGAAAGTGATCATAGGGAGAAAAACCATTTCAG 

ATGACAAGAGCACCTCAAAGGCAGCAGCCTCAAGGAGCAGCCATGGCCCCAGACTTGTCG 

CACGGATGCAGAAMCTTAATGGAGGAGGCTGAGGTCAGMTGGGAAGAGTTTTTAAAAA 

ATAAAAAGGGGAGCTAATATGTGAGGGACCAAAAAAANNNNCANAAAAAANAAAGTGCCG 
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GCCCGCTAGACTAGTCTAGAG 
Sequence 1985 

MGCTTCCTGGTTCAAATGTGCCATTTCCCGGGTTGATGCTGCCACACTTTGTAGAGAGT 

TTAGCAACACAAGTGTGCTTAAGTCAGGCGGTAGGGAAATCCCTCACTAAAAGCAGGAAG 

AAAGGTTCCAATTCAAAAGGTGCCCAATGGATAGGAAGTCAAACAAGGAAAGGGTTAAAT 

GGTTNGGAAAACCACAMT CAGGG TGGTGGGATTTGGGTGCTTACTTTTGNAACAAAAAA 

GGGTCCCCCCTGGTGGGTCTTTTTGGTTCAMCMTTGGTACAAATTGGTAAGAAACTTC 

TTGTCCAMCCACTTAMmATTTTTGGTCTTTGGAGGTTm 

NGACTATTGGGAATCCCCGGCATGCCCTGGAATTCACTAAMGCCAAAGGGGGTCTTGTA 
AGGCCMCGCTGCTTCTTCTGGAGGACTTTCCATTTCCTTTTCTGGATTGGGCAACACCG 
TGCAGG CTCATGGACAAATCTGGTAGGGATAACAAATTCAGTGGTGGGANTTTCCACTTC 
Mill I CMGTCCTTTCATGTTAAAAGGAATTTTAAGAACACCCACATACAA 
Sequence 1986 

GNGTCGACCACGCGTCCGGAGCAGCAGCCATGGCCCTACGCTACCCTATGGCCGTGGGCC 

TCAACAAGGGCCACAAAGTGACCAAGAACGTGAGCAAGCCCAGGCACAGCCGACGCCXaCG 

GGCGTCTGACCAAACACACCAAGTTCGTGCGGGACATGATTCGGGAGGTGTGTGGCTTTG 

CCCCGTACGAGCGGCGCGCCATGGAGTTACTGAAGGTCTCCAAGGACAAACGGGCCCTCA 

AATTTATCAAGAAAAGGGTGGGGACGCACATCCGCGCCAAGAGGAAGCGGGAGGAGCTGA 

GCAACGTACTGGCCGCCATGAGGAAAGCTGCTGCCAAGAAAGACTGAGCCCCTCCCCTGC 

CCTCTCCCTGAAATAAAGAACAGCTTGNCCGGATAAAAAATNAAAAAAAAAAANAAN 

Sequence 1987 

GCCACGCGTCCGCAGGGAACGTGATTAGTGAAAGGAAGATAAACGTGGATGTTACTCCAA 

MCTTCGTTTAATGAATGCTTAAAGAATTCAAATTTTATCTGCCTCTCTTGTMTTTGGA 

TCTCTTCTTAATGTACATAGTGCTMCATGAAGACCTTTTTCTGCACTATATGCAAACAG 

GGTMCTAACTAAAACAAAGCCACTTTCMTCTTCAATCCTTGAAGGTATATCTAGGTTT 

ATGACAGTAATTGTGTTTACATTTTATGGTGCCTAGTATTGACAAAATGTTA7TTCCCTA 

CATTAMCATGACTCCATAGACCTTTTCATTTGTGGGGTTTTTATTTCCTATGATGTATA 

CTGCCACTAACCTTNCAAAAATTACTTAGTATTGCAAAGTCAGGGAATCATCAGGGAACG 

TTTAGCTTGGCCAAAATACTTGGTCTGGTTTTAAAAACCCTGTCNAGGTCTACCAAACCT 

GTTCMGGTCTACCCAATTTTMGGGGCAAATTTGGGGGNAAAAAGGAAAAAAT 

Sequence 1988 

GGTGTCGACCNCGCGTCCGCGAGTCCCGCGTTCTCTCCTTGAATCCACTCGCCAGCCCGC 

CGCCCTCTGCCGCCGCACCCTGCACACCCGCCCCTCTCCTGTGCCAGGCAAGGTGACCCC 

ATGGCAAGGCGCAAGCCAGAAGGGTCCAGCTTCAACATGACCCACCTGTCCATGGCTATG 

GCCTTTTCCTTTCCCCCAGTTGCCAGTGGGCAACTCCACCCTCAGCTGGGCAACACCCAG 

CACCAGACAGAGTTAGGAAAGGAACTTGCTACTACCAGCACCATGCCCTACCAATATCCA 

GCACTGACCCCGGAGCAGAAGAAGGGAAGCTGTCTGACATCGCTCACCGCATCGTGGCAC 

CTGGCAAGGGGCATCCTGGCTGCAGATGAGTCCACTGGGAGCATTGCCAAGCGGCTGCAG 

TCCATTGGCACCGAGAACACCCGAGGAGAACCGGCGCTTCTACCGCCAGCTGCTGCTGAC 

AGCTGACNACCCGGGTGAACCC 

Sequence 1989 

CGTCCGACAACATTTGGCATNAGGGTTGTATCTGTTGGTGGAGGACACAACGCCAAAAGG 

AAATGGGATTTCCTGGTTAGGCCTGCGGCTTGGCAGATGATTGTTATGGGAAAGACACTG 

AGTCTGTTTAGGCMTTTCTTCCTCTTTACTMTAMGTGTTCX)TATTTTGAAAGCAATG 

CTGAGTTGTGGACATGTGTATAAACCGTAATGCTGTAAGTTAGGCCTCTCTTGTCTAGAA 

TCTC AGCCT CTTCATACTCTCTCCCCCTTTTTTGCCATATTATTTTCCTTATCACTAACT 

ATATATTTTACCTGTCTCACTTACTATCTGTCTCTCCTCTCAGAGTGTTACTCCAGGAGG 

GCAGAGATMCTGTTTAGTCCAGGCTGTGCCTTTCGGTGCTCAAAATAATGCCTGGGTGC 

AAAAT AAATATATCTTGAATGAAATAATGAAGTAATCTTTAAAATGGTGCTCCGAGCATA 

ATTTTCTATAGTMCACATAGTTTGCAATTAGTTGTCTCTTTCAGGGATAATGGAAATTG 

GTCAACATAAGAAAA 
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Sequence 1990 

NCCCCGCGTCCGMTTTGTTTTATGGATTGMTNATGTCTCCTCTAAAMGATTTATTGA 

TGTTCTMCCCTCCATGTCTCGTATTGTMCCTTATTTGGAATAGGGTTGTTGCAGAAGT 

MTTAGTTACATTAAGATGAAGTCATGCTGGAGTAGGGTGAGCCCCTAATCTAAGAGGAC 

TGGTGTCCTTATAGGAAGAGGAAAATGCCACATGAAGAGAGAGAGAGATGCCCAAGGAGG 

TGTCATGCTGCAATGAGGGCAGAGATTGGAACTGTGCAGCTGCATGCCAAGGAACACCAA 

AGATTTCAGCAMCTACTGGAAGCTAGGAAAAAGCAAAGAATTCCCCTAAATATTTCAGT 

GGGAGCATAGCCCTGCCMCACCTTATTTCAGATTTCTATCCTCCACAACTATGAGGCAA 

TMGCTCTGCTGTTTTMGCTCCTCAGTGTGMCTATTTTTGTCATGGCAGCCTTAGGAA 

ACTMCACMTTTACAAACAGATTAAGTTCACTAATCAAAAGACAGCCCTTGCAATTGGG 

Gl I I 1 1 1 I I I 

Sequence 1991 

CCGCGTCCGGCGGGTGAGCCGCTGGCGCGCCGGGCCGGGCGGGGGATTGGCTGAGGGCGA 

CGCGAGAGAGGGGAGACCCGGACTGAGGAGAGGACGGGGTGGAGGGTCCCGGCCGGAGGC 

TAGCCTGAGGAGACCGGGGGCGGAGGGGAGACCCGGGCCGCGGAGGAAAGGGATGGAGGA 

GAGGAAGCCGCCGGGCGCCAGCGGGACCCCCGGGCTGAGGGGAGAGGCGCCCCAGGCCGG 

GTGAAAAGTGGCCGAGGAGACCTGGGCTGGGCTGGCAAGTCCCGGACCGGGGAGGAGGGG 

AGCAGCCCTCCGATGTGAGGGATCGCAGAGGAATGAGCTTCGTTCTGGATTAAAAAAAAA 

AAAAAAAAAAA 

Sequence 1992 

GTCACCACGCGTCCGCAGCGCACGCCCCNTGCCCTGAGAACAGGAAAGGGCCCGGAAGGG 

CTGACTCACCGGGCCGACNCTCACACGAAAATGGGATGCACTTTATTTGCTGGTGCAAAG 

GCAGGTGAGGGTGCCTNCTGNGTGACCGNTGGCCCTNCTGCCTGGNGGCGCTGAAGGGAA 

GGAGCCAGTGAGCCTGACCCCGGGAGGGGCCGTCCCCGTGTGCCGCGTTNGGCGGGGCCC 

CACGCGGCTCCCCCANGCCCGGGTCTGGGGCCCCAGGCTTTCCCCTGNCTNGNGGNCGNN 

CNTNCCNNGCT1TGGGNTNCTTGNNTNGGNNTTTTTAATGCCAGNNNTCANNACATAANT 

GCNTTNTGAAAGAGGTTCCAGCTATCACTTGTAACCATATATATATACATATATATTCTA 

TCTACAAAGTGTTTATTTNCAAAGATNTTTCAACGGTGAATTCAGTCCCCGGCCGCCCTT 

NTGACCATCTGTNCCCNGTCCTTGTCCCCGCCCCGGGG 

Sequence 1993 

TCNCCNCGCGTCCTTGATATTTTGAGAAAAATCATGTG AGTC A I MM I CTGTTTCTCTT 

TTCTCTTAACGATTATCACTGTAATTCTGAATCTGAAAGGTAAAACAATTAGTCAAAATA 

TTATTGCCATCATTCTACCTGTGTTATGAMCTACTTATTCATAGTTAATTCTCATTAAC 

ACTTACATTTCC ATAMG AAMCTCMGTATTMTAAMGAGACTTTACTGGCTTAAGAG 

GGCTGTGAMGATTTTTGATAGTGAATCATGACCCTAAGGGAGAGATTTGTGTGATAAAA 

GTATTGTATATMTAGATCAGCGATTTTTGTMGGCAMCAGMTTTGTAAGTTGGCAGA 

TCTTCCTAAGTTGCAAAATGTAATGATGAGCTTGGTGGGAGAAGAATGAGTCGTTCTTGG 

AATACCTATGTGCAGCCACTACCCATCTCAATGTCACCTTGTTTGCATTCTTGGATAGCT 

TGTATATGTAGTAGTTTGATGMTMTTTAAAGAAAAACACCTAAAATTTGAAAAATGAT 

TGTAGGGATCAAAAAAGGCAGATGAAATTAC 

Sequence 1994 

CACGCGTCCGCTGACCTGACCCNTTCCTGATCCNAGGCCNGGTGGTTGTCTTATTGCACC 

ATACTCCTTGCTTCCTGATGCTGGGCAATGAGGCAGATAGCACTGGGTGTGAGAATGATC 

AAGGATCTGGACCCCAAAGAATAGACTGGATGGAAAGACAAACTGCACAGGCAGATGTTT 

GCCTCATAATAGTCGTAAGTGGAGTCCTGGAATTTGGACAAGTGCTGTTGGGATATAGTC 

MCTTATTCTTTGAGTMTGTGACTAMGGAMAAACTTTGATTAAAAAAAAAAAAAAAA 

AAAAAAAGTGCGGCGGCCGGGCGCGCCGGGGTGGGCGGCGAGCGGAGCCGGCCGGAGCGG 

GCCGGGCA 

Sequence 1995 

TCCGACNAAGGAACAAAAGCGAAACACACAAACCAGCCTCAACTTACACTTGGTTACTCA 
AAAGMCMGAGTCAATGGTACTTGTCCTAGCGTTTTGGAAGAGGAAAACAGGAACCCAC 
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CAMCCMCCAATCAACCAAACAAAGAAAAAATTCCACAATGAAAGAATGTATTTTGTCT 

TTTTGCATTTTGGTGTATMGCCATCMTATTCAGCAAMTGATTCCTTTCTTTAAAAAA 

AAA AAATGTG GAGGAMGTAGAMTTTACCMGGTTGTTGGCCCAGGGCGTTAAATTCAC 

AGATTTTTTTMCGAGAAAMCACACAGAAGMGCTACCTCAGGTGTTTTTACCTCAGCA 

CCTTCCTCTTGTGTTTCCCTT AGAGATTTTGTAMGCTGATAGTTGGAGCATTTTTTTAT 

TTTTTTAATAAAAATGAGTTGGAAAAAAAATAAGATATCAACTGCCAGCCTGGAGGAAGG 

TGACAGTCCAAGTGTGCAACAGCTGTTCTGAATTGTCTTCCGCTAGCCAAGAACCTATAT 

GGCCTTCTTTTGGACAMCCTTGAAAATGTTTATTTAAA 

Sequence 1996 

ACGCGTCCGMGAGGTATATCCCATACCTTTTCCCCAMTTCCCTTGTCATTAATTTTCC 

AATTATGCTTCTTCCAAGTGCCTGACCATCCAGCCAAACCATTGGCTACAGCCCATGAAT 

CAGTATACAATGGCGCTACTGGTCATTTCTC7TTCCATACAAAGTGCACAACCTGGTATA 

CTC1TCAAAGTTCTGCCCACTGGGMGATTTGCCTTCACCACTGT(XTTCAGGGATAGGG 

GCTATAGAGCTGCAGCTGTCCACTTTCAGGTGGTGCCTGCATATCGTGCATAACCATCTG 

TGAACCAAGCCCTAGTCTTGTCTTCCTCTGTCAACTGATCATAGGGAACTCCCCATGAGG 

CCATCAGTGCAGGCTAAGGGAAAGAAGGCAAGGTGGCAGGAGTGGAGACCATGGACATTT 

GAGCTACTTTCTTAGGTAACTTACTTGTGTCTTCAGGACCTGCTCAAGCCAGATCACATA 

TATACCACTTTAATT 

Sequence 1997 

CGCCNCGCGTCCGCTACACTTAAGGATGAAGAGAGGAAAAATCTACAGTGTAGGCACAGA 

AATGCCTAGAAATGAGAGAAAAAAAAAACAGGAGCGCATACTGCCTCAGAAACACCAAGA 

GTAGTTTGGGAAGAAAGGAGTCAACAATAACAACTATCAAAGGAATGAATGGCCAACAGT 

GTAGCCTCAAGTAGGGTAGTGTGAGTAGTTGATAGACCAAAAATCCTCAGCAAAATACTA 

GCGAACAGATTCAACAGCATACTGAATGGATTATACACXJATGGCCAAGTGGGATTTATCC 

CAGGAATACAAAGGTAGTTCAGCATTAGAAAATCAATCAATGCCAACACACCTCATTAAC 

AGAAAAAGTAAAGAACTGCATACATGATCCCACAATTGATGCTGAAAAGGCATGACAAAA 

TCCAACATGCTTTCATGATTTAAAAAAAAAAAAAAAAAAAAAN 

Sequence 1998 

CCACGCGTCCGCACACTTGACCCCAGAGATCACGCCACTGTCAGCTGCCCTGGCTCAAAC 

AATTGCCCAGGGAATGGCACCTCCACCTGTCTCCATGGCTCCTGTGGCTGTATCTGTGGC 

TCCTGTGGCCCCTGTGGCTGTATCGATGGCCCAACCCTTGGCAGGAATCACAATGAGCCA 

CACCACCACTCCCATGGTGACTTACCCTATCGCTTCCCAGAGCATGCGCATCACGGCCAT 

GCCACACTGATGGGGCTAATGGACACTCCCCTGGTATAGCCTCGCAGGGCTGGGGTCAGG 

GGGCCCTTGCCCACTCACCTAGCCTTCCCCATCCCTGTCTGAAGGGCTCCCTTGAGAACT 

AGGACAAGAGACTACAAGGAGTATGTCCTGAGGAGGGGTTGGGATGGTGTGGTTTTCTCT 

CACCTCCCTTTTATGAGGGTCCTCTTGTCCATCTTCAAGCCTCACAGTGGGGGGCTT 

Sequence 1999 

NNGGCAGGAGAGGTTCAAATGCATTGCATCAACCTACTATAGAGGAGCTCAAGGTAATGG 

GGCTGGGTGAAGTGGGGTAGGTGGGTCTCAGAGTGCACATGGCTTCTCATATGGAGCTGG 

MGGATTGGGGAMTGAGCAGTAGTGTCTTCCCTGTCMCCTGGGGCTGTTTNTGCCACT 

CTTCCAGCCATCATCATTGTCTTCAACCTGAATGATGTGGCATCTCTGGAACATACCAAG 

TAAGTGAGCATCCTGCAATATAATGGGAGGCTCCG 

Sequence 2000 

ACGCCTCCAGCCTGGGCAATAGAATGAATGAGACTCCATCTCAAAAATAAATAAATAAAT 

AAM TACT GAGMAMGMTCTTTATTGTTTCCTGTAAMTAMTTTTCCTTTTAGCAAA 

GCTCTTTTCCCTTTGACTCTCGCCGCCTAGATTTCGGTACCAGGACCACACATTTTTAAA 

GATGCTCCTCACCGCCGTACAGCTCCTGTACAGCCCAGAAAGCTCCGTGCGCACGAAGCT 

CATCCAGCTCCCGGTGGTCTACGTGATGCTCATGCAGCACTCGCTGTTTCTGCCGACTCT 

ACTGACGTCTGACGGAGAGGAGAGCCCGGACAGCCAAGTAAAAGGTGACCCCTCACCCCA 

GCCCTTCCATTCCGTATCCAGATTTTATCTCCTGATTTCCTTA 

Sequence 2001 
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CCCGCGTCCGCCGAGATGGCGCCCCTGCACTCCAGCCTGGGTGACAGAGTGAGACCTTGC 

GTTTAAAAAAAAAAAAAAAGGAAAAGGAAAAAAAAGTTCTACTGTGGGTAAAATGCTATC 

ATGCTATCAAACAGCATCACATGCTACAGAGAAGTCTTGTGAAAGGATGTCAGTGTGGCA 

MCTTCATTGTCTTTTMGAMTTGCCACAGCTATCGTMTCTTCAACCACCACCACCCT 

GATG AGTCAGT AGCCATCGTTAACGTTGGCGTAAGACCCTCCACCAGCAAAAAGATTGTT 

AGCATTTTTTTAGCAGTGATGTATTTATCMTTATACATMTGCTATTGTACACTTCTTA 

ATAGACTATGGTATAGTGTAACCATAATTTATGCACTGGGAAACCAACAAATTCATGAGA 

TTCGCTTTATTGTANTTGCTCTGGMTTGAGCCTGCCCGTGTTTCTTGAGGTATACCTGT 

GTATTAGGTATGTTCCTTGTATATAMACCCCATTTTTTAGGGTAGGAAACTAANGCTTA 

GAGTTTGAGCCAA I I I I ICCCATA 

Sequence 2002 

CGTCCGCACTGTTCAMCCCTGTCATGCTTTAAAACTGATGCGAGATGATTTTG I 1 1 1 1 I 

GCATAATCAATACTTAAGGGTGCAATCAACTGTTAGTAATTGTGCAGTANAGTAAAGCCC 

TGTGGTGTATCMCTACTAGTTAAGAGTCTCAGTTGATTTCTGTAATGTTTGACCTAATA 

ATAGCCCGTTTCGTCTCTGACCCAACAGAGGAAGCACAGATCAAATCACCTTGGAGTGGT 

CACCAGGGGGACAGGGAGCCCCCCACCAATGTATCAATGGGTGATTTATGATGCCTTCTG 

CCCTTTG GCGAG TGMTGGGTTTCCCATAGGGGAAGTTNGGCCTCCCTCCGTGAGCTTTG 

GAAMTGTTTTTCTMTAGACACAGGGGAGGCCAGTTTTTGTTT^ 

TTC CCAM TTCNTCTGTTTTGGGNGGTNGGAACCTGNGGGGCCCCCGGGGTTTCTGGTTT 
TCX^TTTTTN CNGCN GGAAAATTCTCCTGGCTAAAANANTCCC I M 1 1 1 I I I I NGGTTTNC 
CANMG CCCTTTTTATAMTGCANNATANTGMTNGCTTGGGGAACNNAAMTAAANTTT 
11 M 1 1 CCCANTCAAAAAAAN AAAAAAAAAAANAAAAAAAAAANGGCGGGGCCCCTTANA 
TTTTTTTAAAAAAAAAAAACCCCC 
Sequence 2003 

TTNCCATCCCTCAGGTGCTGAGAACCAAAGTGCCTAGAGAAGGGCAGGAGGAGGATGACG 

ACGATGAGGMGACGATGCTGACGAGGAGGCTCCCMGCCCGACCATTTTGTTCAGGACC 

CTGCAGTGCTGAGAGAGAAGGCAGAAGCCAGGCGCATGGCCTTTCTCGCCAAGAAAGGGT 

ACCGGCATGACAGCTCAACAGCAGTGGCCGGCAGCCCCCGAGGCCATGGGCAGAGCCGCG 

AGACAACCCAGGAACGCAGGAAGAAGGAAGCCAACAAGGCGACAAGAGCCAACCACAACC 

GGAGAACCATGGCCGACCGCAAGAGGAGCAAAGGCATGATCCCATCCTGAGACCTGGTGC 

AGGGCCAGTGGGGAGGCAAGC 

Sequence 2004 

TNCACATTTTCCTCCTAAGCACATGGGTCATTCTCAAGGAGAGACCATATGTTAGGTCAC 

AAACATTCAAAAAATTGAAATATTATCAAGCAACTACTCTGACX^ACAATGATGTAAAACT 

AGAMTCAAMCCMGAGGAATTTTAGAAACTATAGAAGCACATGGAAATTAAACAATCT 

GTTTCTTMTGACCAGTGAGCCAATGAAAAAATCMGMGAMTTTGCAMTTGTTTGAA 

AGAAGTTATAATGGAAACACACTATACTAAAACCTATAAAACAGCAAAAGCAGGACTAAG 

AGGAAMTGTATAGCTATTAMGTGCCTACATTTTTAAAAAATGAAAAAACTTTAGATAA 

ATCACTTATCTATGAATTTGAAAAAAAAAAAAAA 

Sequence 2005 

CGGGMGCTTTNTTTTAATTMGCTGAAACCGMGCTTTAATTTTMTTTA 

ATGTGCCCATATTGGACAGTATAGCTCCAGGGTTTATTAAACCACCATTCCTGGTGATAG 

GATAGAGCCCAGCACAAACTAATCTGGGCAACAAATCAAAGGGCACAAGTCGCATGTAGG 

TTCX^MACTCAGTACATTTTAGAGAAMGGAGTTTGATTATGTAGTAGAAGGAAGAACTG 

CCTGGTGGGACTCATGATCTTCCTTTMGAGCAAGGCTCAMGACCTGGGGAGTTTTGAT 

TTGATGCTATGATGTCTCCTGGGGCTCAGAATATTATGAAATGAGGAGTGAATCTCTGAG 

TGAAAAGAACTCAAGCTGCTTGTTGCATTGCGGAATGTCTCTAAGAGGTAGAGAGGCGTC 

TGTTAAGTGGCTTTGTATGAAGGTTTCAGAAGGTAAGATGAGCC 

Sequence 2006 

NCATTTAACTGTAATAGATATAGAACGTATCCTTCAATAACCAACAACAAATGCTTGTTT 
CCAACAGGCTCTGATGGTTGTATATANGGAAACCTTGATACCAATACGTCCAGCTGACCC 
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AGAAAAAAATCAGAGCTTAGTTAAATCACTGCTGCTCAAGGCTGTTGTATCTGGNNACGC 

TCGAAATGGAGTTGCACTCACTGCCCTGGATCAGGATCACGTCGCAGTCCTAGGAAGTCC 

ACTAGCAGCTTCT MGG GTAACTGACATCCMTTCATTAAGMGGCCTTTGTATGTTTCC 

TTCACGGTGTACCTTTTAGTCTTCCTTTTNTTTTTCATTt 

Sequence 2007 

GTCCGGCCCTGCACTAGGGACTGGGGAGGACATGGAGAATAATATATGTACCAGCACAGA 

CAGACAGGCAGCTTCTGGAGCCAGACTGTCTGTGTGATCTTGGGTAAGTAATCTAGCCTA 

ACCATAGAGATGATAACAATTGTACCTGCCACAGGCTCATGGGGCTATTGTGATGGTGAA 

ATGTAAAGATGTAAAGTATAGAGAAAGGATGTAAAACTGCCCGACATATAGTAAGGGCCA 

TTTAAMGATGCCTGTTATTGTCCCTCTTCTTGAGAGCATATTAAGGGCTCAAATCTAGC 

CAGAGTCTGGCGGGTACAGAGGTAAACAAATAATGGCAGGTTGGTTAAGTACTGGACACC 

AGTGATGTCCAAGGTCTCTAGGATGCCCTGGAGAAATAATGAACTCTTCTAAGATTGTTT 

GGMGGGGATTTATAMGGACATCTAGGCTGGGTTTTGAGTGAGGTATGGGGAGTTCGCT 

GGTCAAG AATGTAN G AATTACAAAAG C AN AAAGTGCCTTATTTN AAAAAAAAAAAAAA 

Sequence 2008 

NCCCCGCGTCCGGAAAATATTTTAGAMGCACCTTGMGATTAGTATTTTTATGTAACTT 

CTGTTGGAGAGATGTCTTCAGGAGACTGAAGTAGAAGAGCGACTGTCAAAATGGAAAGTC 

CCAGAGACATCCAATTTATGTAAATCAACATCACCTGAATTCAGAATCTCATCCAGATTT 

CAACAAAGACTTCTGMTGCCAACAAAGAAGAGGACTGAATTTACAGACTCTCACTCTAA 

CAATATATGCTGTTCMTTTGAAAACAGAATAAAATTATTTTGGCAGAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2009 

AGAGTGTNCAGCTGGCTTTCCTCACTTGGGAAAAGGGGTACTGCCAGTCTAAGCAGCCTC 

CTCTGTACTCAGCCAGGACACCCAGCGCGTGGGACCTGTTTGTGTCTGTTTTGCTTCCTT 

GGGMCGGCACAGTCACTCACCCTGCCATTTGCGGAAATGACCTGGTGCACTTTGACTGT 

TAAGCAATGCGT TATTG CTGTAGTCAAGGTTAGTGCMGCAAGGAAACATTCCCAGTAAG 

GTATTTGTTTCCATTTTCTGTCTGTGCTTCTGTCAGAMCTTGCTAGGACTTTAGTGGCC 

AATAAAAAAGAAATTCTAGCTTGATCGACCCACGCGTCCGGTAATGATTGATGAGGACAT 

GACTGCCCGAATTAGCATGGCTGATGTCAAGTTCTCTTTCCAATGTCCTGGTCGCATGTA 

TGCACCTGCCTGGGTAGCCCCCGAAGCTCTGCAGAAGAAGCC 

Sequence 2010 

GCGTCCGACAAACATAATGATCCGTGACAGTGACCTTAATATATAAATGTGTGTGTTGTG 

ATTGTTCTACTGACCAGCCATTTCCTGTCTCCCTCTTCTTATGCCTCCCTGTTAAATTAG 

ACACAACAACATTGAAATCAGACCAATTAATAACCCTACAGTGGTCTCTGAGAGAGGCTA 

ACAGCTAATCCTCTTGTGCTAAACAGCCAAATTGTGAATGCAAAGGAAAAGTTCTCAAAG 

GAAATTAAAAGTGCTACTCCAGTGAACACATGAATGATAAGAAAGTGAAACAGTCCGGGC 

GTGGTGGGTCACGCCTGTAGTCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACCTGA 

GGTCAGGAGCTCAAGACCAGCCTGGCCAACATAGTGAAACCCTGTCTCTACTAAAAATAC 

AAAAATTAGCTGAGTGTGGTGGTGCACACTTGTAATCCCAGACACTCGGGAAGCTAAGGC 

CGAGAGAATCACTTGAACCCCAGAGGTGGACATTGCAAGTGAGCTGAGATTGTGCGACTT 

GCACTTCAACCTGGGTMCAGAGAGAGACTCTGTCTCAAAAAAAAAAAAAAAA 

Sequence 201 1 

GTCTATTAACATACGGAGCTGAATCTGCCGCAGCTNGAAATGCTCAAGAACCAGCTGGAC 

CAGGAAGTGGAGTTCTTGTCCACGTCCATTGCTCAGCTCAAAGTGGTACAGACCAAGTAT 

GTGGAAGCCAAGGACTGTCTGAACGTGCTGAACAAGAGCAACGAGGGTATGGGGTAGGCG 

GGTGAGGGTMCCTAMGTGGCGMCCTGCTTCTCTCGTCCCACCTCCTMCCCAGTTTT 

TCTTACCTGAMCGAGAAAATCCATTACATATCGTATACCGCTTCATGAACCCTTTGCAT 

GTTG CCTG CCTAGMTTGAAAAGTACAGGACATTCCTCTGCTCCTATTGCCCCTGTTTCC 

GTTCTTTTCACACTGTCTGTGGGTGCTGTGCCCTGTTGGMCTCTCTTTAACGTCTTACC 

GTTGGAGCCGCTTACCTTCCCAGGTGTTGTCTTCATTGGCTTTCACAAGGGAAAA 

Sequence 2012 



• 
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NGGCAGCTGCTTTGTCTGGGAGGGGGCTTTGTGTCGAGTCTCCCTGAATGAGCAGGGCTG 

GCGACAGTTGTCAAAACACATGGTGCTTGGTCAGAGCCCCCGTAGAAGCCCCTTGTCCTC 

CGCATGGCCTCCGCCTGCACCCGGGGCGTGGAATGTGCTCTTGTGTGTCCCTGGCTGTCT 

GCTTGCTTCTACACTGGCCCCTGCAGATGGAGGGGGTGGGGTACAGGGGTTCCTATAAGA 

AGCAGACACTTGGGGTTTTTCCCAGGCCCTGTTGCAGGAGGGTGCGTGGGCTGGTTTCCC 

TGAAGGCGCCTGGGCCGTGTTGGTGTTAACTGATCTGAGATCTTCTGTGGCCCTGATGTC 

TATGAGCATGCCCCAACTTGCANGGGGCTGAGTAGCCCGGGCACCACCAGGAGGCTTGCG 

TGCCCTGTGCTTGGGTGTACCCATGCCCTGTCAGCATCGTTGGTCTGTTAGGGGTCAGGG 

ACTTCGGCTTCTTGTTTAATACCCTNC 

Sequence 2013 

MGTCGGACGCGTCTCAGTTCCTTAAAGCTGTTGGTCCAAGGCTACCCTCAGTTCCTTGC 

CACATGGGCTTCTCCMCAAGGCCATTTACATTAACAAAGCCAATGAGAAAGAGAGTTAC 

CGGGCATGGTCGCTCACACCTCTTAATCCCAGCACTTTGGGAGCCCGAGGCGGGTGGATC 

ACTTGAGGTTAGGAGTTCGAGACCAGCCTGGCCMTATGCTGAMCCCCGTATCTAC3TAA 

AAATACAAAAAAATTAGCCTGGCATGGTGGTGGGCACCTGTAATCCCAGCTACTCAGGAA 

GCTGAAGCAGGAGAATCACTTGAACTTGGGAGTCGTAGGTTGCAGTGAGCCAAGATCGAG 

CCATTCCACTCCAGCCTGTGCAACAAGAGCGAGATTCCATCTCANACAAAAAAAAGATNA 

ANNAAAAAAAAAAAANT 

Sequence 2014 

TCCTATTCCTCTCTTGAGTTTATGCAC 

CATAAMTTCTTTTA GAAM TGCAAATAGTGAACTrTGTGAATGGATTTTTCCATACTCA 
TCTACMTTCCTCCATTTTAAATGACTACTTTTATTTTT^ 

GTATCATGAGTAGGTCTTACATCAGTGATGGGTTCTTTTTGTAGTGAGACATACAAATCT 

GATG T TMTG TTTGCTCTTAGAAGTCATACTCCATGGTCTTCAAAGACCAAAAAATGAGG 

TTTrGCTTTTGTMTCAGGAAAAAAAAAATTMTGMCCTTAAAAAAAAMNI 

ACANGTGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCA 1 1 1 1 1 1 1 INACTGCANTC 

TAAGTTGGGGGGTTGGNCCAAACTCATCAANNNATCNTAATCATGGCTGGGATCCCCNGG 

GTCCCGNGCCTCGGAMTTAMTTNCCTTTTTTCCGGTTTTCCTTGGGGTTT^ 

TGNGGNGAAANI I I I I I I I I GGGGGGGGGGGGGG 

Sequence 2015 

CGCGTCCTGAGAGGTCAGGCCGGTCCTGGGGGCAGCAAGCCCGGCCACACTCCCCACCGC 

GACCGGGGCTCTGGGCTCGCTTCCTGCTTCAGTTTCCCCAAGCTCCNGATGAGACTCCGC 

TACTACCACCACGTCGATAACGCAAACCTAGAGGGACTCAGGGTAAACTGAGGCACTCAA 

ACTGCCGAGGAGCTCCGCCTCCCGAGAGACATTTAATCCGGGGGGATTTGCAGGAAACTT 

CTAAATTAAGGGTAGCGGCTGCTGCAGCTGAGGGGGGGCACACCGGTCCCTGCGCCCGGG 

CAGCTGCCGTGAGCTCACGCCCCGAAATAGCCCCAGGGGCCCCAGCCGCAGCTGCCACTG 

GGTCCGGCTG TCACT CAGAGGAAGCACGGAGCCCCCAGCCCAAGGGTCCCCTCCCCTTCG 

CATCGCGGGGTTTTTCCAGCCGACCGTCGGCCACTTTTTCCTCCGACNGCTGGCAGGGAA 

GAGGGGGATTGGGGGCCGGGACCCCAAGGGAGGCGGTCCCCAATGGGTGGGCCAAGGG 

Sequence 2016 

ACGCGTCCGCACACCCCCAGGTGCCCGCGCTGGCCCCCCAGGCGTGGTGGCCTGCACGGA 

GGGGACCACTTACGTCTGCTCCGTCTGCCCAGCAAAGTTTGACCAAATCGAGCAGTTCAA 

CGACCACATGAGGATGCATGTGTCTGACGGATMGTAGTATCTTTCTCTCTTTCTTATGA 

ACAA AACAA AACAACAACAAAAAACAAACAAACAAAAAAGCTATGGCACTAGAATTTAAG 

AMTGTTTTGGT1TCATTTTTACTTTCTG 

ACA TGM GAACTGTTTTTTGCCTGCTGGTACATTACATTTCCGGAGGCTTGGGTGAATAA 
TAGTTTTCCCAGTCTCCCTCGGATGGTGGCCTTAAGGCCTGGTAGTGCTTCAAGAGGTCC 
ACTGGTTGGATCTCTAGCTACTGGCCTCTAMTACMCCCTTCTTTTACAAAAAAATCTT 
TTAAAAAAAAAAAAAAAAAA 
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Sequence 201 7 

CGCTCCGCAGCATCMGAGTTATTTTTCTGACTGTCMGTTTCAACATTCAGGTCTGTCC 

CCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAG 

TAGACAGAGTTGCCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGTACCCAGCTGC 

AGAACTTCACCCTGGACAGGAGCAGTGTCCTTGTGGATGGGTATTCTCCCAACAGAAATG 

AGCCCTTMCTGGGMTTCTGACCTTCCCTTCTGGGCTGTCATCCTCATCGGCTTGGCAG 

GACTCCTGGGACTCATCACATGCCTGATCTGCGGTGTCCTGGTGACCACCCCGCCCGGCG 

GAAGAAGGAAGGAGAATACAACCGTCCAGCAACAGTGCCCAGGCTACTACCAAGTCACAC 

CTAGACCTGGAGGATCTGCAATGACTGGAACTTGCCCGTGCCTGGGGTGCCNTTTCCCCA 

ACCAGGGTNCNAAANMCTTTGGCTGGGGCAAGAAATNAAANCATATTTGGTCCGGAAAA 

AAAAAAAAAAAAAAAAGGGCCGGCCCTTNAACTAAGTCTAAAANAAAAAAACTTTCAAAA 

CCTTCCCCTGGAACCTGAAACATAAAATGNATNGCAATGGTGGGNGGTNAACCTGGGTTA 

ATTTGGCANCCTTTTTANTTAAAANTTGGGGGGGGGGTT 

Sequence 2018 

CGGATCAAGACCATCCTGGCTACAGTGAAACCCCGTTTCTACTAGAACTACAAAAAATTA 

GCCGGGCGTGGTGGCAGGCACCTATNGTCCCAGCTACTCGGGAAGCTGAGGCAGGAGAAT 

GGCGTGAACCTGGGAGGTGGAGGTGCAGTGAGCCAAGATGGCACCAGTGCACTCCAGCCT 

GGGCGACAGAGGTAGACTCTGTCTCAGAAAAAAAAAAAAAAAAAAAATCAGTCACTGGAT 

TTGGGCCCACCCTACCTNCATATGACCTCATGTTAACTTGATGACATCTGCAAAGACCCC 

ATTCCCAAAAAGGTCACCTTCACCAGTMGTNGGGGGTTAGGACTTGMTATAGCTTTTT 

GGTT GATGTAATTCMCCCACAGCACTGCCTTTTNCATTCCATGTTATGTTTTTTGAG 

TTTTGAGATTTGCCAMTATATGAAGCTATAMTTATCAGNGAAMTAAATMTTTCAAA 

ATNTMGCTGTTGAAMCTCTAMTTATTTTTMGCCTTTAAMGA^ 

GACMGGNCCCGNNNGGGCTTCAATGCCTNTAATCCCCANCACTTTTGANANGGCTGATT 

GGGNGNGNGGATTCCACCTTGAGGGTTTAGNGGNTTTCAAAGACCAAGCCTGGNCNCTAN 

CGGNGGTTGAAAACCCCTGGTNNTTCTNACCCTTNAAAAAAAA 

Sequence 2019 

GTTTTT TTATT TCTACTGTCAAATGATGTGCAAAACCTTTTACTGGTTGCATGGAAATCA 

GCCMGTTTTATAATCCTTAMTCTTAATGTTCCTCAMGCTTGGATTAAATACATATGG 

ATGTTACTCTCTTGCACCAAATTATCTTGATACATTCAMTTTGTCTGGTTAAAAAATAG 

GT GGTAGA TATTGAGGCCAAGAATATTGCAAAATACATGAAGCTTCATGCACTTAAAGAA 

GTAl 1 1 I I AGMTMGMTTTGCATACTTACCTAGTGAMCTTTTCTAGAATTATTTTTC 

ACTCTMGTCATGTATGTTTCTCTTTGATTATTTGCATGTTATGTTTAATAAGCTACTAG 

CAAAATA AAACG AGTTGAC CCAC GCCGTCCGGACACAAGAAAGGAATATAATTCATACAC 

TATTGCATTTlTMTAMTCTm^ 

GGTTAAATGATMTTG MTGT AMTATTTAAGATGCAGCACCATATTTTATAACCCAGCT 
TTAGGCATTTCTTCATATTTTMGGGAMCCCCCCACCTC 

TGCTCTCTGA MTGC CCTGCTAMTGCCTTCTCTTMTTATTTGGAATAANGGTAGGTTT 

TTGGGGGAAAATTTTAAAAAAAAAANGGGGGGGNAAAAAAAA 

Sequence 2020 

AATTTl iNATMTCTGMTTATCACAAMAGATMGGATTTTTAMGTTATTTGGAGGGA 

GTGTACACATTGTTTATTTAATAGTGAGGGCTATTATACAAGCNGGTCNATGTAAAATAA 

TCCCTTTATATGTATGAGCATAGTTAATTTGGTAAACAACAGACAATTACATACTGTGAT 

CATMGGACTTTAGTATCAGTTACCATATAGCAGGTACTCTTTAGTCAGGATATACCTAT 

ATAGGTGCTAMTTMAMTCMCCTTATATCTCAMTTTACTTCCTAGTAGAGTGTAAC 

GCTGCCATAATTGCAAGCCTAATTATGGGGTTGTCCATACTGCAGTCCCATCAGTACTCA 

TACACCAMGGGTCGGGCCTGTGMGACTGGAAAAAGAATAAMTATCTTTTGATTGAGA 

TACTACAAGCAAAAATGACTTTCTGGCTACCATTACTGCAAAGAACAAAACTCAACTGAA 

AACAATATGTAGCAATTAAAAAGTCTCATTATCTTCAAAACACAC 

Sequence 2021 

CCCGCGTCCCGGAACTCTTACCCATAACCCTAATGATGCAAGTCATATGGGGGAACACTT 
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TGTAAATGGTCAGGATAAAAACCAAATCTGGGTGCCAGATCCCAGCACTACTTTTTATTA 

CTGGAGAAATGGGGGGGATAGAAAATTCTACTTTGAATTATTTAG I I I 1 1 1 1 1A AAGAGT 

GGGTTGTGTTTGTGCTTCTCCCACCTTTCAAGCATTTATAGAACATGCTGCCCCACATAC 

AMGTCMGACCACTTACTTTTATGTGACACTAGTAGTTTGGGGTTAATGGTTTGNGTAA 

AGAACAAGCTGCATATGAGTAAAGGTTACCCCAACCCNCAGTGAGGANGAAAGATGTTCA 

CATACTGGGAACTGTCCTGNCAAATAAATNTGGCCCCTATTGGGCTCTGTTTTAATTNGG 

MGNGGGCAAAAGTAACCTCTTGCTTTGGGGCAACTATTTGGNNTCAAAATTANAAACCT 

TTTAGACCCAAANTTNANNNNNNNNAAAANNGNNNNNNNANGGGGCCGGNCCCNTTNGAC 

CTTAGTTTTTANANAAAAAAAAACCTNCCACACCTTCCCCTTGGAACCCTGAAACAATAA 

AAAGGAAANGCCANTTGGGGGGGNGGGTAAACCTTGGTTTNATTGGCACNCnTATAAAN 

GGNTTNCCAAAANAAMGGCAATTTGCCTTTCCCCAMNTTTTCCCAAANAAAAAAAGGG 

Sequence 2022 

ACCACGCGTCCGGTCTGCAGAGGCCCGGGCCTGGGCACAAAGGGAGAGAGGCCTCCATTG 

TCCCGCAG GGGCC AAAATGCAGACCGTGCATCCCCGGTGACCTCGGGGACCGTNCTCTGA 

TCAGCAGGATTTTCTTGGACTCTGGGGTCCTTGTCCTGCTCAGGCATCCCTGCCCTGCTC 

TCCTTGAGGGCCCTCAACACTATCTTCCCTGGACACAAGTCTGGGGACAGCCGGGTGTTG 

AGGACCCCAAAGGGGTGACTACCTGCTCCTGGGCCCCACAGAGTCCTTGTGCTCAGTGTA 

GTGGCTGAGCTGGGGGATGCCCTGGAATTCGGAGCACACAGCACTGGCTTACTGTGGTAC 

CTGTGCA GTGA AATTGGAGACAGAATCACCAGGATGGAACACAGGTCTTGCAGGATCACG 

GAAMCCTTTTAGAGTTGTCTTGACACCACTTGATGTTGAGTGTCCGGGTGTTTGTAGGA 

TGGCCTGCACTCAGTCCAGGGGCAGG 

Sequence 2023 

CGCGTCCGCTTGACCCTGTATTTTGGGAGTCGAACGGAGAATGGAAACTGAAAGTGGAAA 

TCAGGAAAAGGTAATGGAAGAAGAAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAA 

ACGGTCACGAGTTAAACAGGTGCTTGCAGATATTGCTAAGCAAGTGGACTTCTGGTTTGG 

GGATGCAAATCTTCACAAGGATAGATTTCTTCGAGAACAGATAGAAAAATCTAGAGATGG 

ATATGTTGATATATCACTACTTGTGTCTTTTAACAAAATGAAAAAATTGACTACTGATGG 

GMGTTAATTGCCAGAGCATTGAGAAGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCAC 

CAGAATCCGGAGGAAAAAAC 

Sequence 2024 

CGCGTCCGCAGACTTTCCCTCTGCAATAAATCCTGTAACAAAATTGCACTCGCACCCCTT 

ACGTTTATACAAATTTTTAAAATAAATAATAACTAGAAANGCAAGAGGAAACCAAACCCA 

AAATTATTAGAAGAAAAGAAATGATAAAGATTGGGGCAGAAATTAATGAATTGATACTAA 

AAATAGTACAAAAG ATGGATGAAAG AAAAAGTTG 1 1 1 I 1 1 1 1 1 1 1 1 I A AAAGATAAGCAA 

AATCAACAAACCTTTAGCCGGACTGTAAAAGAGAGAATATCCATATGAATAAAATCAGAG 

ATGAMMGGAGTCATTACATCTGATMCACAGAMTTCAAAATATTTTAGAGACTATTA 

TGAGCAACTATATGCAGTAAATTGGAAAACTTAGAAGAAATGGATGAATTCCGAGACACA 

TAAAAACATACCAAGATTGAACCATGAAGAAATCCAAAACTTGAA 

Sequence 2025 

AATTATCCTGGTGTGGTGGCGTGTGCCTGTAATCCCAGCTACGCTGGAGGCTGAGGCATG 

AGACTCGCTTGAATCCAGGAGGCAGAGGTTGCNTTAAGCTGAGACCACACCACTGCACTG 

CAGCCTGGGTGACAGAGCAAGACTCCGTCTCAAAAAAAAAATAAGCTATTGATGGGCTAT 

ATATTGTTAAGCTATAGGGTGTGGTTACAGTGTCCAGTGTAGCATTGNTCGATTAATTTA 

TAGCCTCTTGTGGCAACAGCAAGCAGNTtCCAGAGATGAATACACAGCTTGAGTAGGCAG 

GGGGGAGTAGGACATGACGGCTGCCTCATCTTAACACCTTGGGCCTGATAATTTAAAAGG 

ACTCACAGTCCTCAGATAAACATGACTTTCTTTTCTCATGAGGAAACANANAAGGTGGCT 

AAAGGGTATNTCTTTCTCTCATGATCCCAAACCTATCAGGTTT 

Sequence 2026 

ACCNCGCGTCCGGGTGCTGTACCAGACCAGAGGCCAGCTCCATGTCCTCCGCGTCGGCAA 
TGATACCCACTGCCAACCAACAAAAATTGGCTGCAACCATCCCCTACCAGGACCCGGCCC 
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CTACAGGGTGAAGTTCCTGGTGATGAATGACGAAGGACCCGTGGTGAAACCAAGTGGTCC 

AGCGACACTCGCCTGCAGCAAGCCCAGGCACTTCGGGCTGTCCCCGGCCCCCAGAGCX)CG 

GGCACCGTGGTCATCATCGCCATCCTGTCTATCCTCCTGGCCGTCCTCCTCACGGTCCTC 

CTGGCTGTGCTCATATACACCTGCTTCAACAGCTGCAGGAGCACTTCCCTATCAGGCCCA 

GAGGAGGCAGGGAGTGTGAGAAGATACACCACGCACCTCGCGTTCAGCACTCCTGCCGAG 

GGGGCTTCTGAGGGGTTCCAGAGGGGCCCACGTGTCCCTCCACCTCCTCCCTGGCCCAGG 

CTGCAGAGCCTGAGCTGGGACACGCCCTGAAGCTTCTGGACCCTGAGAGAGATTGGTTCT 

Sequence 2027 

GCTGC TAGAAA AGGTT AGACTMCTGGGAGTTCATMTTTGTCTTGGAGATTTAAGAAAG 

TTTTTGAAGAGGTTTTCTGTGTTGAGATGMGAGGMGATTGAGACAAAGCACAATGGAG 

AGTAGGGGAGAGAGCACTTCAGAATGTTTGAAGGTGCTAAGAGGAAGTGCTTGACACACG 

GGAGGAACTGAGAAAGAAGGCTAGGGCCATCGGCTGGAGGGACAGGACAGGTGGAGATGT 

MGCATGGGCCTCCAGACCATGGTMGGCCACTGGATCTTTACTTTAATGCATTTGTGCA 

TGTGATTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2028 

CCCCGCGTCCGAAAAAGACATATGTACAAGTCTATTTCCTATAGCACTATTTGTAATACC 

TAGCACTATTTGTANTATCTAAA GACTG TAAACAACGCAGGTGCCCACAAAGGGAAAATG 

GTTTGACAMCTTACGAACATCCTTTTAAAGGAGTACTAGACAGGTCAAAAAGGAATGAA 

GAATGTATATATTACTATGGAGTGAATCTTCAGGGTATATGACTAAGTGAAAAAATGCAA 

GGTGAAGTANTATGTAACATATGGTACAGTTTACTTAAGAGAGAAATACAGATGTATATA 

CACATTACCTAAAGCAAAATGAGGACCCTACTGGGCTGCCATCCCAGCTGGACTGCTGCT 

GTGGAGCTCAGCATCAAGTACT 

Sequence 2029 

CGCCCCGCGTCCGGAGAGAGCTGTCTTTGCAGTTACTAGGTTTCATCAAACTTGTTTTTT 

TCAGTATGGTAGGTTTAAAMTGGGGATACATTTTTGTTTTTATTTGCATTT^ 

TTTCTTAGGTTAGTTGGCTACTTAAATTTCTTTTTCTGAAAACT^ 

TTAAMTTCTTATTGACTTGCCTGMCTATTTGTAMTTACAGAAATTAGCCCTTTGTCG 

TATGTGTTGCAGGTGCTTTTCCAGTTTGCCAGTGGTCATTCATTTTGGTTTATGGTAC 

TTGATAGACCAGMTCTTTGACTTTTATTTAGTGAGATTTATCCATCTTTTTCTG 

CATCTTGGTATTATAGCATTAATATTCTCTTTTCT 

Sequence 2030 

CCACGCGTCCGGTTGGGACGGCACCAGGCGAGGTGTTGAGTTGGCTCGGCTCAAGGTTCT 

TCGGGGTGTGAGCTGGCATGAGGACCTGTTGGAAGTGGGATCCAGGCCTGGNGCAGNCTC 

CCAGCTGCCTCGATTTGTGCGTGTGAACACTCTCAAGACCTGCTCCGTTTATGTAGTTAT 

TTCAAGAGACAAGGTTTCTCCTATCAGGGTCGGGCTTCCAGGCTGGATGGAGTGCCCTGG 

CGCGATCTCGGCTCACCGCAACCTCTGCCTCCTGGGTTCAAGCGATTCTCCTGCTTCAGC 

CTTCTGAGCAGCTGGGATTATGAAGGGGT 

Sequence 2031 

NCCCCG CGTC CGGCTCMGGAGGTGATTAAAAGMTGTGTMTTTAGTATTTTTAAGATG 

TTTAGAGTTTTAAAMCTTACGTTGTTGCTGTCATTGTATTTAAATAGTNATAAAGAATA 

TAAACTGATATAAGTAA I 1 1 I I I I I AAATTCTTCAGATATAATGAAGATCTGGAACTTGA 

AGATGCCATTCATACAGCCATCTTAACCCTAAAGGAAAGCTTTGAAGGGCAAATGACAGA 

GGATMCATAGAAGTTGGAATCTGCAATGAAGCTGGATTTAGGAGGCTTACTCCAACTGA 

AGTTMGGATTACTTGGCTGCCATAGCATAACAATGAAGTGACTGAAAAATCCAGAATTT 

CAGATAATCTATCTACTTAMCATGTTTAMGTATGTTrTGTTTTGCAGACTTTTT^ 

ACTTATTTCTACATGGTTTAMTCGACTGTTTTTAAMTGACACTTATAAATCCTAATAA 

ACTGTTAAACCCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2032 

CGACCNCGCGTCCGGCGTTCTACCCTTCCGGCCCGTGTTCTATCCGCCGCCTCCACCTTC 
CATNCGGCGCCGGCTTTCGGCGCGACGGTCGCCGCGTTCCATCGTCGCGCGGCCCTTCGG 
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GCGCCCGAGCCCGCAATGTCGGGCCCCAACGGAGACCTGGGGATGCCGGTGGAGGCGGGA 

GCGGAAGGCGAGGAGGACGGCTTCGGGGAAGCAGAATACGCTGCCATCAACTCCATGCTG 

GACCAGATCAACTCCTGTCTGGACCACCTGGAGGAGAAGAATGACCACCTCCACGCCCGC 

CTNCAGGAGCTGCTGGAGTCCMCCGGCAGACACGCCTGGAGTTCCAGCAGCAGCTCGGG 

GAGGCCCCCAGTGATGCCAGCCCCTAGGCTCCAAGAGCCCCCAACCGGGACCCAACCCTG 

CCTCCCTGGGCTAGGCTCTGGCCTGGGCACTCACCCCCTGGCTTAGACACCTTCTCAAGG 

GCTGGCCTTCAGGGACCCCTGGTGGGTCTGCTGCCTGGGCCACCCTTCTGCCTGGGCCTN 

CCCTTG 

Sequence 2033 

CGACCACGCGTCCGCTACCTCAAGGNCCTGGGCACCGAGCGGGCCTACAAATCCGCACTG 

GACTACACCAAACGMGTCTGGGGATTTTCATTGACCTCCANAAGAAAGAGAAGGAGGCG 

CATGCCTGGCTGCAAGCAGGGAAGATCTATTACATNTTGCGGCAGAGCGAGCTGGTGGAC 

CTCTACATTCAGGTGGCACAGAACGTGGCCCTGTACACAGGCNACCCCAACCTGGGTGCT 

GGAGCTGTTTGAGGCGGCNTGNAGACATCTTCTTCGACGGGGCCTGNGAGCGGGAGAAAG 

CTGTGTCCTTCTACCGGGACCG 

Sequence 2034 

GGGGGGGAGGGGNGNAAAAAAAAGNGANGGACAAANAAANAGAAAAAANAAGANANAAGA 

AAGNNANAAACANNNNAAAANACNNNGGNAANCGGAGAAAAGAAAAGAAANNNGNANANA 

GAANANNANGAGGGGNCAAGGNAGAAGAAAANNNGANAGAAGGGGNNAAAGGGGGNCGGG 

AGAAAAGAAGNAAAAAAAAAANAAAANNNGAAAAAANGGGAAAAATNNGGGGAAANNGGA 

AANNAAAGAAAAAAAAAANANCNGNGGNAAAAAAAAAAAGGANNNAAAAAANNGGGNAAA 

ANANGGNNAANGNANANANAAAAAGNGGGGGGGAGGGGGGAGGGNAAGGGNNGAAAAANN 

NGAAAGAAAAAAGAAAGAAANAAANGAANAANAGANGGAGANGNAAGGNAAAAANAANAA 

AAAGNNNNNAANAANANGAAAGGGNAAGNAAAANGGGNNNNAAAAGAAAAAAAAAAAGNG 

GNAAAAAAAAAAANNNNGGAAAAAAAAAANNNGGNNANNNANNCCAAAAAGAAAAAAAAA 

AAAAGGGNNNNAAGGGGAAAAAANAANAAANANAAAAAANGGGGAGGNGGCNGGNAAAAA 

AAAAANGNNAAAAAAAAAAAAAAANNNAAANAAAAANANANANNGNAANANNNANAAAAN 

AAAAAAAAAAAGAAANAAGNGGGNNGNAAAANACNNAANANNGANANGNANAAAANNGGG 

AANAAAANAAANNAAAAGNNANAANGAAAAAAA 

Sequence 2035 

CCCCCCGCGTCCGCGTTTTTATGTGTGTATGTACAAMCAAMTACCTTTTTGAAATTAC 

ATAAGTGATACATGCTTATTGTGAAAGAGTTGGATAATACAATATACTGTAAAGAAGATG 

MAACCACTCATAMTTCCMCCTGTAAATACTTTCGTGTTCATATTTTGGCATCTAATG 

TATC CTT TATGTATATTTA MTATATATTTTATTCAAGTATAGGATCATGTACCTCCCTG 

TTTTATAATTTTCTTTTTT AATTTT TCAGTGTATTGTG 

TACATCCCACAATGTTCATTTTTGTGCCTCATTATACCATGGAATAGTGACATCCTAGTA 

TGTTTMGACATTCCTTATTAATGMCAATTAGGCTACTTCCMTTTTTAATTATAAAGG 

ACATTTTGAAGGACTTCCTTGTACATATTATTCTATTC^ 

TTTCATGTAGTATATTTCCAGTTATTTAAATGAGAAAGTACATGGTTTATAACAAATGGT 

ATTTMCGTCCMGTCACTGCTATCTAMGGGGTAATTTTTMGGTATAAMTAATTTGG 

CTTATAMAMTCGTGGGAAAMTATNCTAGAMTATTTAANGATTAACTTCTAAATTGT 

AAATTGGCATATTTAATGATAGAATTCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2036 

CGCGTCCGGMGAMTTGTGCACCCTCCCAAAACATACAMGTTTAAAAGTTTGGATCTT 
TTTCTCAGCAGGTATCAGTTGTAAATAATGAATTAGGGGCCAAAATGCAAAACGAAAAAT 
GAAGCAGCTACATGTAGTTAGTMTTTCTAGTTTGMCTGTMTTGAATATTGTGGCTTC 
ATATGTATTA TTTTAT ATTGTACTTTTTTCATTA 

MTTCCATAGTTTTTAATATCCCAGAAGTGAGACMTTTGAACAGTGTATTCTAGAAAAC 
MTACACTAACTGAACAGAAGTGMTGCTtATATATATTATGATAGCCTTAMCCTTTTT 
CCTCTAATGCCTTMCTGTCAMTAATTATMCCTTTTAAAGCATAGGACTATAGTCAGC 
ATGCTAGACTGAGAGGTAAACACTGATGCAATTAGAACAGGTACTGATGCTGTCAGTGTT 
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TAACACTATGTTTAGCTGTGTTTATGCTATAAAAGTGCAATATTAGACACTAGCTAGTAC 

TGCTGCCTCATGTAACTCCAAAGAAAACAGGATTTCATTAAGTGCATTGAATGTGGCTAT 

TTCTCT AAAGT TACTCATATTGNCCTTTGCTTGAATGCAATGCCCGTGCAGATTATGTGG 

CTGCTA I 1 1 1 I ATTTTCTGGGCATTACTTTNACACCNTAAANGGAGAAGCNAACATTTNC 

TTCTTCACTGACTGGCAATGGNCCTTTACTGCAATAGGAAGAAAA 

Sequence 2037 

CCCCGCGTCCGGCTGGGCTTAAGGGATCTTTCCCAGGTAGCTGGGACTGCAGGCATATGC 

CACTGTGCCCAGCTGCTCCCTTAGTCTTGACTATGT AI 1 1 1 II I II 1 1 1 I GGTTGTTTTA 

TTAGGATAGAGTCCTAAG AATGG CATTACTGGGTTAATAGGTATGAACATTAGATCTTTA 

ATACATACAGTCGMTTACTTTTCATAAAGCCATATAGCTTTTTATATTCCCACCGATAC 

TATCCCTATCACTMGTATTAACTTATTTTCATCTTTGCCAATATGATCATCCAAAGTGA 

GGCAGAGGTTGCAGTGAGCCAAGGTCACACCACTGTGCTCTAGCCTGGGTGCCAGAGTGA 

GACTGTGTGTCAAAAGAAAAAAAAAGGGGGGGGTGCCGGGTGCGGTGGCTCACGCCTGTT 

ATTCCAGCGCTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGTTTGGGAGTTTGAGACCA 

GCCTGACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAAT 

Sequence 2038 

GTCGACCCCGCGTCCCGGACGCGTGGGTCGGCCATGAACAAGTTTTCAAGTATCAGTTGA 

TrTATGATATAGGCTTATCCATTTGGTTATAAAATCATATGTTTATTACATAATCATTGA 

CAAATAGTTTCTGTATAATAACTGGCAGAGTAGCTCTAAAACATATGCAAGGAAATAAAT 

AAAGAA AAMC TTACMTAMGAGAGTAACTCATATTTTMCAGTTTTGTGAAAAAAATA 

G AAMT ATTTTATGTAGCTTATAGTACATATATTTTTTTACAACAGAAGAATCGCATTCT 

GATTT TCCAT ATGGATCATTTCCCTATGTTGCTAGACCAGTACACTGGCAACCTGGTCAT 

ACAGCTTTTCTTGTCAAGTTGAGGAAGGTCAAACCACAACTTAACTGAGTACTCCAGATG 

ACAGTAACTGACTTG AAGATG GAAAAATATCAAAATAGAACTTTATATTGAAAATCACTG 

CTTCCATAGATTGGCATTTTTAGCTATTACTATGACTTATATAACTTATACATATAATTT 

TGAAAATAACAACTAAAAG ATGTA TAACATAGCCAAAACTGCTTAAACCATCCATTTTGA 

CCACTTGTCTTGCAGNrrAGTTTTGACATTTTGTAGGTTMTGGATTCCAAATTGGTTTAA 

GTGGGCCATCTCATTCTTCACTTTCTGGNAANCCACTCCATAGATTTGGCTTTTCTTCAG 

GAAMTTAAGNTTCCTTTNCCTTTATTTGGATTGGANGGNCATTGGCCTACTGGAAAANA 

AATATGCCTTTTTAGGGTTAAAAAAAAAAAAA 

Sequence 2039 

GTCGACCNCGCATCCGACCACGCGTCCGCCAGACCTCACGTCAACCGGCTGCACCCCACT 

TTCCAGCCTGCGCCCCAGATCTGCAGCCTTCGCCCCTAGATACACCCGCCTGGTGATGAG 

GCGCTCCTCGCGTTCCTTCCGGGCTCCAGGTGTCCGTGAGCCTCCCTTCCGCCCTGGCCT * 

CCGGTCTCTGCCTTGCTCGTGCTTCTACCACCACCCTTCCCCTCCCAACCCGGTGGATCC 

TCTCGTCTCCCCCAGTCTCCAGTGCACCGGCTTTCCCTCGTCCTCTGCGCAGTCCATCTC 

AGCTCATCTCTCCAATTCAATGCCATCATCTCTCCTCACCATCTCTCGGTGCCCTGGAAT 

GT TTGCT GTCAAGATGTCCCCTGTGAAACCCACAAACGCTTGCGATTTGGCCTCCTTGTT 

TTATTTTGTGTAGTCCTACAACGTCTTGTTACTACCCCCTATTACAACACTTATAACTCA 

NAANGAAANAGNNNNNNNNN 

Sequence 2040 

CGTCCGCGGAGATCCGGCACACTGCGGACCGCTGGCGCGTGTCCCTGGATGTCAACCACT 

TCGCCCCGGACGAGCTGACGGTCAAGACCAAGGATGGCGTGGTGGAGATCACCNGCAAGC 

ACGAGGAGCGGCAGGACGAGCATGGCTACATCTCCCGGTGCTTCACGCGGAAATACACGC 

TGCCCCCCGGTGTGGACCCCACCCAAGTTTCCTCCTCCCTGTCCCCTGAGGGCACACTGA 

CCGTGGAGGCCCCCATGCCCAAGCTAGCCACGCAGTCCAACGAGATCACCATCCCAGTCA 

CCTTCGAGTCGCGGGCCCAGCTTGGGGGCCCAGAAGCTGCAAAATCCGATGAGACTGCCG 

CCAAGTAAAGCCCCTAGCTTGAGTCGACCCACGCGTCCGATTTAAATATTTGTCCCATTG 

TTTGTGATTAGGATGTAAGCTTTGTGGAATGTAATTAACCCTGCTTTACGAAGTCACCAT 

ATTATAATAGGAAAAACACTGCCTAGGAGGCAAAGAGATCTGAATTCCAGTTCTGATGCT 

GCCACTGTGTMGGAAGTAGTTTTATAAACCATGGGCAAATCATCTTGAGCTTTCTCATC 
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TGTAAAGTTAGGGG 
Sequence 2041 

TCGACCNCGCGTCCGAAAAACCAAAACCTGANTGAGATCTTGGAACCGNTGTGCGCCGGC 

CGNNCCTCTCCCANGGGACCANCCANCCCCGCGCGGTGGCCGACTGNATAGGCGGGACTG 

CGCTTCGAGGCTTAAGGACGNCAGATCGGAGGCATCGTGTGTTGTCTGTGCGGAGAAGCC 

AAAANNGTGATTACGTTTATTTGCMGACCGTTCATGTTGTTTTCAGTTCATGGTATGAT 

TAAMCCCGATCCTTTGTTACCATGCCCTTAGGTACGAMAMTMTTGTTTNGATATTT 

GGCAGTCACCCAAAAATATC CAAAA AGCCATGAAACAGTANAGGTAAACAAGTANGAAGT 

GAAAN TAATNTTCGTCCTTTGTTTTCTTTCTGGAGGTGCTCAAAACACCCTCTCAAACCA 

TTTTTCTCAGCATAGAACCAAGTGTGGNCNGGNTANCAGCTAATATTTACNAGGGNAGAA 

ACGMCCCTNGCGATATTTAGTCACTTTGTTNCX)NGGGANCACANAAAATNTTGAACAAA 

CACATGAGAACTGTCACCGATCTCTGTATTGATNACCANGGATACCCGTGAATTTTATGT 

MTATTMTCTNNGGNAGGCANGAIvn"mTTNNTAGGTAmGCCTTTTCCMGGT^ 

TTCCNTACCAAAGGAAMNGGTTATTTTAAAAACTTTTACCANAANAAAGGGGATGNCTT 

Al I I I 1 1 GGTCCT 

Sequence 2042 

NGGACTTGGTTTGMCGCGTTTTCCCCAMGTTTATGTGTTGGAAACTTGACCCCCAATG 

CAGCAGTGTTGGAAGGTGCCTACTAGGTGGTGTCTGGGTCATGGGGGTATGACCCTCATG 

GATGGATAAATGCCATGACTGAGGCGGTGGGCTCCTTATAAAAGAATGAGTTTGGGTGAA 

ACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGCACATACCTGTAATCC 

CAGCTACTCGGGAGGCTGAGGCAGAATGGCTTGAACCAGGGAGTCGGAGGTTGCGGTGAG 

CAGAGATCGCACCACTGCACTCCAGCCTGATGACGGAGCAAGACTCCGTCTCAAAAAAAA 

AAAAAAAAAAAAAA 

Sequence 2043 

GAGAAGCCTGGGGGTCCTGGCTGAACTGGGCTGGGTGAAGGGGGCCCCCTGACCCCCTTG 

GGGTCCGGGCTGGGCTGGGTGAGGGGCGGTTTCCGACCCCCAGCCAGGTTCCCAGGCAGG 

ATGAGCTGGGGTTGGGGTGGCTAGGCCGTGGGCCTTGGGAGCTGGGCAGTCTGGGCTGGG 

CTGGGCTGGGCAGGGCGCCACATGGAAGCTGGAGGAGCAACGGGAGCGCTGGGCGTGGGG 

TGCAMTTGCCCAGTGCCTTCTGTTTCCCAGGCAGCTCTGTGGCCATGGATATGTTCCAG 

AAGGTAGAGAAGATCGGAGAGGGCACCTATGGGGTGGTGTACAAGGCCAAGAACAGGGAG 

ACAGGGCAGCTGGTGGCCCTGAAGAAGATCAGACTGGATTTGGAGATGGAGGGGGGTCCC 

AAGCACTGCCATCAGGGAGATCTCGCTGCTCAAGGAACTGAAGCACCCCAACA 

Sequence 2044 

ACACTC^VTCAATTAG GTTTT ATTTTT"ATTXCTTT"CX5TCTACCCCCAG AAACAAGCCTGTT 

AAi ii i i i i iCCTTCTCCTCTGGCGACTGTGTGATGAATCCTTTCTTGCGTGATCAGGTT 

GCGGATAGACTTGTAAGGGTGTTTGCTGCATACAGTGTAAGCATTGTGACCGCCAATAAA 

CTTCAATGGTTTCTACTGAAAAAAAAAAAAAAAAAAAAAGGACGCGTCTACTTCCCACTG 

GGTCCCTCCCACAACACATGGAAATTCAAGATGAGATCTGAGTGGGGACACAGCCAAACC 

AAATCAAAAGGATATACAAAATAACCAGAAAACAATGAACAAAATGACAGGAATAAGTTC 

TCACCTATCAATAATAACTTTGAATATGTGTTAAATTACCTACCTAAAAGATAGAGACAG 

GCTTAATGGATAAAAAATGACTCAACAACCGTCTACAAGAAACTCACTTCACTTGTAAAG 

ACACACACAGACTGAAAGTGAAGGGATTGAA 

Sequence 2045 

GCCNCGCGTCCGTGAGAATACACAAGGGGGCACGCTTCCAGTAGATGTGTTGGGGAAGGA 

GGAGGGCAGAGGGGACAGGGGACAGGATTCAGCTTTGTGGTGGGTCCTGAGGGTTCCTAC 

CAGGGGTAGCCAGGATCTGGGAAACAGATCAGCGACTCTAGTCTGAAGTGGCTGCCTGGT 

TCGGGGGCTGCCTTCAGCAAGATTCAGGCAGGAGAGACGGAAATAGCCACCTTCCAGGCG 

TGAGTCCTGGAGATAAAAATGGATTTTAACCTAGGACTGCCGGGAGCTGGCCCTCCGCGG 

CTGCTCAGACTAGGGCTGTGTGTGCTGGCTCTCGCCTGTTTCCGGTGTCTAACTGGCTTG 

TTTCTCTTTATGGCTTGGCTTCATTCCGACCTGGGGTGGGGCCACATNCAACCCACTGCC 

CACTGGCTGTCCGTCTGGCCTGCCCCCGCGGTTTCAACCACANTGGTGAAACAANCCTTG 
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CAAGATNTACAACTCGCAACACCGGGTCAAGCAATCAGCTGCATTCCGGACCGGTGTGNA 

AGACCGAAGGGG 

Sequence 2046 

CCCCCCGAATATCTTATCCTTAACATTAMTTGMTTTTTTGCAAATGATCAAAAGGTCA 

TTCC GAGTAMTTCTGTTGTATAGTGCAGATGATCMGCTGAGTATTTGCCATGTTTTTA 

TTTTAGAAAAGAGATGTTGCTATAACATAAGTAAATACGATTCTCGTATGTGGCAGATAA 

ATTTACTTGTMTCTGCTCTAGAGTGAMTTATTTTTTACATATAAGCATTGTCATCAT^ 

CTMGG ATTATTGMTMTGMTATAAMTGTTCTTGTGTATTTGTGTATGTGTATATAA 

I GAAAGTTTCTTTATCCTATTGACCCTTCTCATAAACAGNAGCATATATATATTA 

TATGTAGTAGAATTTATATAGGMCATTGTCTTTTTCCCAGTAATGCTGATTCTAAACTA 

GTTATGTCAATTTCATGTMCATGACATTNAGAATAGTGGGGTGCTAAATATATTTAGAA 

ATGATTTCCAAMTTGNTGTATTTCTMCATAGAANGATATTTGTCATTTTAAAATAATG 

TAAAGAAAAAATGC 

Sequence 2047 

GCACCCCTCCCTGTTGiACACAGCCTGGATCCAGAGTTCAGCAGACCTTGAGACAATGAAA 

ACAAACTTAGTAATMTCATTTTTCMTCATTGCAGTMTTATTGATTTGGACAAAAATC 

MTTGACGTCAAMCCTTA AAGT GACGTTTCTCTGCCTATGGAGTGGGTCATTCTTTTAT 

TCCTTTAGTTTCATMTAAATTTTCTTTTACTTAAAMAACTTATAGTTTGATGM 

GAGATAT ATACC TCATCTCAAAGAATCTTCACACACACACTTATTAATTACAAAAGGAAA 

ATCAGTAATTTTGCAGTGGAGACATATGGCCAACTCCACCTTACCCAAGTGGCTGAAAGT 

CACTGCACCAGTAATGG 

Sequence 2048 

AGAAAAGCCNAGCCAACAGCTCTTAAAATCAGAAAAAACAANGGGAGTCCTTCCTTGTCT 

CNTCTGTGNTCNCNGGCCTTGTCTCTGAGACTNTCTGTGCCCNNAANCNNTNTNNTNGCT 

NTNANCTGATTCTANTTTTGNTNCCCATGGMTCTGTCCTMGACTGGGGNTTTTTGNCA 

NATGACAGNCTTGCCNGNACNCAMTATCATAACAGCATRNNNANCGANTTTTGCNGAT 

CAAGTMNATANTTGCNTGACMTGACAGCTTTTTAACTCTTTCAAAGTCACX)TAAAAGC 

TATTATTGCAGGAGGATTTANGAAGTCACATTCATTNAACACCCAAGTGCTATGGGTGAA 

NNATTCATGATAGCTTGGCCCACMGGTCATGMTTGAGGAGGGAATCTTGCTTTTCAAA 

AMNCMTGGMTGNTCCCNCCACTGAAAAAGGGNNATACGTTTTAATATTTTGGACCCT 

TCANAAAGGNTAANGAAAAAAACCCANGGTTCTTTCNAAAAAGTTAGNGAATAAGGGGGA 

ACTTMNTTTCATGGAANACAAGCCCATTNTTTNAAAAAAAAAAAAAA 

Sequence 2049 

CNTACGAACGTCTGAAACGTGGAGGAACCTTCAGTTCTGGGAACTCCCTGCCCCTTTCCC 

GGAAAATTCATGAGTAATCCACCTGTTTAGCATATAATCAAGAAGTAACCATAGGCATAG 

TATA TCMGCAGCCCACACTGCTGCTTTGCCTATGGGGTAGCCACTTTTATTCCTTTACT 

TTTTATTAAACTTGCTTTCACTTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2050 

C GCNTCCG AAATCCAATCCTAATGAAAGAGATTGATAAGTGTGACTACAAAAGGTTTAAA 

AC i M i i ICATAGCAAATTATCTCAGAAACTAAATTAAAAGACAAGGGAGACCAGGTGCA 

GTGGCTCACGCTGTAATCCCAGCACTTTGGGAGGCCGAGGGAGGTGCATTGTTCTAGCCC 

AGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAGCCCTGTCTCTACCCAAAATACAAAA 

ATTAGCCAGGCGTGGTGGCTTATGCCTGCAGTCCCAGCTACTTGGGAGGCTGAGGTAAGA 

GGATGGCTTGAGCCCAGGAAATCAAGGGTGCAGTGAGCTGCNATTATGATTGTGCCACTG 

CACTCTAGCCTGCATGTCCAAGTGAATCCTACATNAAAATAAAAAGTNCAAAAANANAAA 

AAATGTGCCGGCCCGCTAGACTAGTNT 

Sequence 2051 

CCACGCCTCCGGAAATGCCTCTCTCCAGAGTCGGACCCTCACCTCCTTCCTGGAACTGCC 
TTTGGCCCCAGAACCATGAGACAATCCCCACCCTGAGAAGCTNCGATCACTGGGAGGAGA 
GAGAAAGCCTCCAGCTTTGGGATTCAGGCTTCAGMGTTTTTAGCAGCCTTTGCTCATTG 
GAGAGGTGGGGAGGATAAGTCTATAAGGAATCCTATTTCCCCAGCTCTCCCACAGAGAGG 
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ACAAMGAAGTCTTCACACCGTTGTGGMCTTTCCTGCAACTTCTGGATGCAGACAAGCC 
TCAGAGCAGACTGTTCTGGCTCCAGNGAATATCGGCTGCCAAGCTGTGAGCATCCAGGGA 
TCCNCGTCTGCCTGGCTTTCCTGAAAGTCAGAAGGCGCCTTGGTCATACTGTGTGGGGTG 
NGTNGGATNTrNAGTTNTGNTCTCTTTTCTTTTTCCTTTTNTTM 

GCCAACACCCCTGACAGCMTTGTGCNGCACTTGGCTTAATTCACACCCTATGAATAATT 

TTTNATATTTCMCTTGGAAMGGTGGTTAAGAMCTTTT 

Sequence 2052 

GATGGACTGTGTCATNCAGGACGGCCCTGCTGCATTGGCACCAAGGGCAGGTGTGAGATC 

ACCTCCCGGGAGTACTGTGACTTCATGAGGGGCTACTTCCATGAGGAGGCCACGCTCTGC 

TCTCAGGTAGGTCTGCAGAGTGTCCGTCGTTCCCCTCCCCCAGCTACTGTGATGCTGATA 

TGCTGCTCTGCGCAGGTGCACTGCATGGATGATGTGTGTGGGCTCCTGCCTTTTCTCAAC 

CCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCCTCTTCCTGCACGCCGGGATC 

TTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCCTGCGGGACCTGGAGAAGCTG 

GCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTGGTGTCACCGGCAACCTGGCC 

AGTGCCATCTTCCTGCCATACCGAG 

Sequence 2053 

NCGCNTCCGGGCAGAGCCCCCGGAGCCTGGCCAGCCCTTCCGGCAGCTCCAAAGCCACAG 

GCAAGCCCCGAGGCTGGGATGGCCGGCCCAGGAGGGAGGAGGACGACGTACCTCCCGAGG 

AGAAGAGGCTGCGGCTGGGGCTGGAAGGGGGAAGCGCACAGCCCGAGGACTGCNAAGGAC 

GGGGAGGACGCGCCGCGGCCAGGCAGGGAGGAGACCGGCACCCAGACAGGTGGCGACGGC 

AGAGGAACACAGTGGCTCACGCCTGTMTCCCAGCACTTTGGCAGGTCGAGGCTGGCGGA 

TCGCCTGGGGTCAGGAGTTCGAGACCAGGCTGGCCAACATGGCGAAACTGTCTCTGCTAA 

AAATACGGAAGTTGGCTGGGAGTGATGGCACGCACCTGTAATCCCAGCTGCTTGGGAAGC 

TGAGGCAGGAGAATCGTTTGAAGCGGGGAAAGCGAGGTTTGCAGTCAGCTTGAGATCACA 

CCACTGCACTTCANCCACCTGGGGTGACATGAGCGACACTTCTGTTTTCAAAAATAAACC 

GAA 

Sequence 2054 

CTGTGTAGGACAGACTCTCTTTGACTCCCTAGGATTTACCCCAGTGCCTAGCATGTTTCA 

CAGCTTAGAGGAAMCMCATTTGTTGACTGACTTTTGATCTCX)ATT1TTTGGTGAGATG 

CAGTGGCTTACACCTGTMTCCCAGCACTTTGGGAGGCTGAAGCGGGCGGATTACTTGAG 

GCTAGGAATTCAAGATCAGCCTGGACAACATGGCAAAAAATACCAAAAATAAAAAAAAAT 

AAATAAATAAAAAATTTAGCCAGACATGGTGGCAGGCACCTGTGGTCCCAGCTACTTGGG 

AAGCCAAATCGCTTAAACCTATGAGGTGGGAGGTTGCAGTGAGCCAAGATTGCACCACTG 

CACTCCAGCCTTGGTGACAGAGTGAGACCCTGCCTCAAAAAAAAAAAAAAAAAAAA 

Sequence 2055 

TCGACCCCGCGTCCGGGMTTTGGGGTGGAATGTGATGAGATTAAAATGTAGCTTTGGTA 

T AACT TCATGTGTATTTCAAAATATACTGAACGTCAACATGATTTGAATAAAGAAAATGT 

ATTTTCTACTTGMCCACATAACACTGTTATTTAAMCAGTTTTCTGCAGTCTAAAAAAA 

AAAAAAAAAANNAACAANNAATN 

Sequence 2056 

CGTCCGGCAGAATGGCTCCCGCAAAGAAGGGTGGCGAGAAGAAAANGGGCCGTTCTGCCA 

TCAACGAAGTGGTAACCCGAGANTNCACCATCAACATTCACAAGCGCATCCATGGAGTGG 

GCTTCAAGAAGCCGTGCACCTCGGGCACTCAAAGAGATTCGGAAATTTGCCATGAAGGAG 

ATGGGAACTCCAGATGTGCGCATTGACACCNAGGCTCAACAAAGCTGTCTGGGCCAAAGG 

AATAAAGGAATGTGCCATTCCCGAATTCCGTGTGCGGGTTGTCCAGANAACCGTAAATGA 

GGGATGGAAGATTCACCAAMTTMGCTATATNCCTTTGGNTNCCCTATGTACCTGGNTA 

CCACTTTNNAAAAANTTTACCAGACCAGGTCCAATGGNGGGATGGAGAACCTAAATCGNT 

TG^CGGCCGGANTCAAAATTAAGGTTNTTAAAATTTGCCAAAAAAAAAAAAAAAAA 

Sequence 2057 

CGCNTCCGGAGAGAGCCCAGGGATGCCTTATGGTCAGAACAAATTTATAGACAACAAAAG 
GGAAGTGACCGTGCAGAAATCAGAAGTGAGGTACAGAAACAGCTGGACTGATTACAGCTC 
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MCATTTGCCTTTTTTGAACAAATCTGAACACTCAGCAGTGTATGAGTGGTTGACCGTAT 

NGGCTGCTGTGATTGGCCAGACCTCAGCTATTGTTACAGGCACATACTCTTAAGTCAGGT 

TTTCMTCCTATCTGACTATAMGTTAGGTTACAGTTTGTCCTCATGGACTCAAATTTAG 

MGTATGGCGTCCTTCTCAGGCCATATTTAGTTCAGTTTAACAAGTGCATATGGCTTCTG 

ACAAAGGTGTGGCCCCTTTAGGACTCCAAAGACGCTGTCACTTACATGGTATTCAGGGAA 

GACACAGAGGATCTGTGAGCAGCCTGCAGCCAMGCT7TTGAGATCATATTGAGATTTTT 

TTGTANTATANGAGGAGGGGTTCTGGCTC 

Sequence 2058 

CGGAAGCATCGACCTGCGAGCTCACAGAGCTGGGAGCAGAGCACCCACGCACACCCCGAA 

TGGCTATGGAAGCTGCAGGGCGCCAGGGACACTGGGAGTCCCTGCTCTCATGGCAAAGCA 

GGGACGGGGGACTTAAAAGCCACCAACAGGAAAATCGGGGAAAAAAGGGAAGATGGTGGT 

AACAGTTGGACACTATTTCTTGGCAAAACCGTGGAAAAACACGTTCTACACCAGCAGGTG 

GCAAATTGTGGCCGCCATCTGTGTTTGCAAATAAAGTTTA 

Sequence 2059 

CCCCTACAATGAGCTGTCCCGCCTCAGTGGCCTGCGAACCCTCAACCTCCACAACAACCT 

CATCTCCTCCGAAGGCCTGCCTGACGAGGCCTTCGAGTCCCTCACCCAGCTGCAGCACCT 

CTGCGNGGCTCACAACAAGCTCTCAGTGGCCCCTCAGTTACTGCCCCCGTCCCTCCGNGT 

CGCGGATCTGGCTGNCAACCAAGT 

Sequence 2060 

ACCCACGCGTCCGCCGATTTTCCAGGTGCCGTCTGTCACCCCTTTCTTTGACTNGGAAAG 

GGAACTCCCTGACCCCTTGCACTTNCCGAGTGAGGCAATGCCTCGCCCTGCTTCAGCTCG 

CACATGGTGCGCGCACCCACTGACCTGCGCCCACTGTCTGGCACTCCGTAGTGAGATGAA 

CCCGNTACCTCAGATGGAAATGCAGAAATCACCCATCTTCTGCGTCACTCAAGCTGGGAG 

CTGTAGACCGGAGCTGTTCCTATTCGGCCATCTTGGCTCCTCCGCTCTATTTGCAGTTCT 

TAAAGGGCTATTGTACTCTCTGGGATTGTACGAACTTGGACTGNATTGGAACTGCAACAG 

AAAATCTTCCAGGCAAGTGCCA 

Sequence 2061 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACTTTCATTGAGTGTACGAGGGAAAAAAGCAT 

GTATTGGGCCACCGGAAGACAAGCTAATAAATAGGCTGGAAGTAATATTCTACCAGCAGG 

AACTCAACAGCTCCAGTTAAATGCTTTGATATAGNGGCTCCTTTGCAGAGCCAAAACAAG 

ATTTATTAAATTTCCTTCAAACTGTTTATCTTTAAAACAAATATMGGTT^ 

TGCTGMGCAAATGTGMTGCCAAAGACTATGTTTTGCAGTTTTGCTTTCCTCCCAATAA 

ATATTMTGTATGTMTTCTAGAGGGTAAAMTGTAMTAGGTTTGGACAATATTTGCAC 

CCTTGTTTGTGTTATGAAAAAMTTTTTCCMGGCGAGCTAGAGAGMAGATGTTTGGCA 

TGCCAMTTMCTTGCATGTTTGTTAAAMAACMACACATGTTTTTGAAAAGAAACCAG 

ATCTGMCGTGTATTTGTTGAAGTTTTGCAAAA 

Sequence 2062 

CCCTTT CGAGCGNCCGCCCGGGCAGGNAC I I I I I I I I I 1 1 1 1 1 1 I I I I I 1 1 I I » I I I 111 
M< II M l M I MMIll lI llll tl lll MlHt ll M MM MM I HHMI I UN 
TTTTAAMAAAAAANNAAAANNNTNNNNTTTTTT^ 

TTNTTTNGGGNTTNNTNNAAAAMNNNANNNAAANCCCNNCCNGNNNTTTTTAGGGNAAA 
AAMANTTTTNCNGGGNTNAAAAAAMTTNm 

ANNAMNNANCCNN NNTTTTTT NNNNNGGGGCNCCCCCTTTTTTTNCCCNTGGNGGGGNG 
GGGNAAAAAGG GGNTTTTTT TNGGANCCGAAAAAAMCGGGAMNTTATCCCTTTTTTG^ 
GGGGCCNTAAAC I I I k I I I I NNCCNCNNTTISTTTTTTTAAAMNCCCCCNNCCCTTTNTCC 
CTGNTGGNNCCCTTTTTGGCCCGGGGAAACCCNN l I I I I M I II H I I I I I 
Sequence 2063 

AAGGGAAAATGTCACGTANACTAGATCAGGGAACAAAATCCTCTCCTTGTGGAAATATCC 
NATGCAGNNNGNTGATACAACTTANTATCTTATTGCCTMNAAAAAAATTTCTTATCATT 
GTTTCANAAAAGCAAMTCATGGAAAATTTTTGTTGTCCAGGCAAATAAAAGGTCATTNT 
AATTTAGCTGCAATTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTCCCCTGATGTCAT 



# 




WO 01/070979 



PCIYUSO 1/09126 



TABLE 1 

345/467 



TAAG C ACCATC CAAAAAGTCTGCTTCATMTCTATTTTCAAGACTTGGTGATTCTGANAG 
TTTTGGTTT7TGNGACTTTGTNTCTCANGMAAAAMM 

TATMTTCAATTTAMTATGNTGNGGCGTCTCATCCAGGATNGGATAGGTTGTCTTCTAT 

TTTCCATTTTACCTATTTAC 

Sequence 2064 

CCCTTTCGAGCGNNCGCCCNNNCAGGTACAGACTTAGAMTTATCTMAGATTTCATCTT 

TTTACCTCATATTTCTTAGGAATTTAATGGTTATATGTTGTC 1 I I I I 1 1 CCTATGTCTTT 

TGGCTCAAGCMCATGTATATCAGTGTTGACTTTTTCTTTCTTAGATCTAGTTTAAAAAA 

AAAACCACATMCMTTCTTTGAAGAAAGGAAGGGATTAMTMTTTTTTTCCCTAACAC 

TTTCTTGAAGGTCAGGGGCTTTATCTATGAAAAAGTAGTAAATAAGTTCTTTGTAACCTG 

TGTGMGCAGCAGCCAGCCTTAAAGTAGTCCATTCTTGCTAATGGTTAGAACAGTGAATA 

CTAAGTGGMTTGTTTGGGCTGCTTTTMGTTTCTCTTMTCAAAATTACTAAGATGATA 

GAATTCAAGMCTT GGTAC ATGTATTACTTGGTGGTATCGATAATCATTTAAAAGTAAAA 

GACTCTGTCATGCATTTTTCCCCATTTC 1 1 1 1 1 1 1 1 I CCCTGTCTCCGGGGCCAACCCAA 

GTGGGTCTTCATTTT 

Sequence 2065 

CCCTTAGCGTGGTCGCGGCCGAGGTACNCGGNGTCCAAGATGGCGGATGAAGCCACGCGA 

CGTGTTGTGTCTGAGATCCCGGTGCTGAANACTAACGCCGGACCCCGAGATCGTGAGTTG 

TGGGTGCAGCGACTGAAGGAGGAATATCAGTCCCTTATCCGGTATGTGGAGAACAACAAG 

AATGCTGACAACGATTGGTTCCGACTGGAGTCCAACAAGGAAGGAACTCGGTGGTTTGGA 

AAATAACATCTGGGCCTGCTGGAGAAAAAGAAAGATTACAAACTTCGTTGCATATGACTA 

CCGTAAAAAACAAGAATACCTCAAAGCTCTTCGGAAGAAGGCTCTTGAAAAAAATCCAGA 

TGAATTCTACTACAAAATGAC TCGN GTTAAACTCCAGGATGGAGTACTTTA A i M I I I I I 

TNTTNATAI^TITNCCAGGAACATTTTCTMTTATGTTATATAAATGGGTATGTGATATGTG 

NGCTATTTGTGTGCTAATGTCCTAAGTGAAGTTCTGCAGACCATCTGGGTCAAAGTGCAT 

TTCGCATGATCCAAAANATGAAGAACCACTTGTTTGTCACGGGAGACNAGGGAAAAAAAA 

A 

Sequence 2066 

CTTAGCGAGGTCACGNNCNANGAACGCGGGGNGIVfrCAGGAAGATNTCTGAAGAGTGCAGC 
NGCCTGAACCGAGCCCTGCCNAACAGCTGACAATTGCACTGCAACCATGAGTGA 
Sequence 2067 

CATGCAGMNTCCTCGCTGGAGTTTCATAAGGCCAATGAGTGCCAGGAGCGCCCTGTTGA 

GTGTAAGTTCTGCAAACTGGACATGCAGCTCANCAAGCTGGAGCTCCACGAGTCCTACTG 

TGGCAGCCGGACAGAGCTCTGCCAAGGCTGTGGCCAGTTCATCATGCACCGCATGCTCGC 

CCAGCACAGAGATGTCTGTCGGAGTGAACAGGCCCNGCTCGGGAAAGGGGAAAGAATTTC 

ANGCTCCTGAAAGGGAAATCTACTGTCATTATTGCAACCAAATGATTCCAGAAAATAAGT 

ATTTCCACCATATGGGATTCCCAGACCATGANGCCAAAGTAATTCCTAATCCCCACACAC 

AGGMTGGCATGGGACCTGNGATTTTGAGTTTTCAAGGGGCCGTAAGNTTrTNATTCTTC 

ACACCTNCAAATTACCCGACCCCAAAAAAAAAANAAAAAAAAA 

Sequence 2068 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTNTCCGATTTCAAGAACTGATGAAATTAGAAA 

AAACACCTACAGAACATTGGATAGCCTGGAGCAGACCATTAAACAGCTCGAAAATACAAT 

CAGTGAAATGAGTCCCAAAGCCCTAGNTGATACCTNATGTTCTTCCAACAGAGATTCTGN 

AGCAAGTTCATCCCACATAGCCCAAGAGGCCTCTCCCCGACCCTTGCTAGTTNCGGATGA 

AGGTNCCACTGCCCTAGAGCCCCCTACGTCGATACCTTCAGCTTCACGTAAGGGCTCCAG 

CGGGGCCCCACAGACGAGCAGGATGCCTGTCCCCATGAGTGCCAAGAACAGACCCGGAAC 

CCTGGACAAACCCGGCAAGCAGTCCAAACTGCAAGAACCNCGCCAATATCGNCAGGGCTA 

ATGGAA^NCTMGAMTCTTGGNNGGGGACTNTTAAAGCCTACTTTCCCCTACTTACCT 

GCTTCTAAAGATTCCAAGGCCNTTCTTCCAAACTTTTGGG 

Sequence 2069 

CCCTTTCGAGCGGCCGCCCGGGCAGGTTCATGGATNNGAGCAGCTTCACCAACCCCTGCA 
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AAGTGACTCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCC 

ACGCACAGCTGMGCATGAGGAAACTCATCGCGGGACTAATTTTCCTTAAMTTTANACT 

TGCACAGTAAGGACTTCAACTGACCTTCCTNAGACTGAGAACTGTTTCCAGTATATACAT 

CAAGTCACTGAGAGAACATCACCACCCTGAAGCCAGAGACTAACACTGCAGGACTCAGCA 

GGACTATTTAAGAAACAACTGAGGCATCAGACCAACTTTCCCCACAAGTCCTCGGATCTT 

TCCTGCCATGCTGATGCCATATATTCCAACGTGATCAACCTGGCTCCCCAGAAGGAGGAC 

GACTTTGCTGTCTACACCAACATGCCCCCTTTTCATCACCCCAAGAGGACATTGCCAGAC 

CAAGTGGGAATATGTCTTCCATTGTATTTCCAACTGATGGGGAAAGCCTANATGAAGATG 

CTCMGMGNGGGGGGTCMGACCCTGACCCCAGCNTGMTCTTTGGCATTACCCTTTGC 

TTTAAATTTAATGTGGTG G NG NTTT AAAAAAAAAAAAAA 

Sequence 2070 

CCCTTTCGAGCGGCCGCCCGGGCAGGTA C I 1 I I I I 1 H I I 1 1 1 1 1 I I 1 1 1 I I CNGCGGGG 
NGNNCTACTTNANMTCTTTGGCNGGTTTNNCNGTTTTNGGTTTTCTNANCNCTTGGNCT 
GGTN CATTT GGTTTNGMNMTCNGTTNCTTCNGATT^ 
MNGGTTTTNGAAAN CTTTNN CCTTOT 

MTTGGGGTTNATCATTTTTTCTAGGCCNGMNGTTTNGNNCNTNCCTNTTTCANAATCT 

ANATTAATAC 

Sequence 2071 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCNGGTTANCAGACCCACAACACGAAGCTCCTG 
CCTTTTAAGACTACAAAGAGGCAGCTCAAAATTAGACTGCACAGGTAAGCGAGGAACTGC 
AGTCTAAGCCTGGACTCTGCCTTCTGCCCTCCCCCCGCGTACTCAAGCAATAAAAT 
Sequence 2072 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTN I I I 1 1 I i I 1 1 INTTTNGCNGAGTGAGCTA 

CTNTAGGATCTTNTGCTGGTTNTACAGTTTTTGGTTTTCTTAGCACTTTG^ 

TNNGNTTNGMGAATCTGNTTCTTCTGATTTTTTAACANAAGGTTTTTTGACAAATC 

TTTGAAMCT1TTACCTTCTTCTGTTGMGTTGTTGG 

GTTTATCATTTTTTCTAGGTCTGAAAGTTTTGCGCATTCCTCTTTCAGM 

TAACTAMAATCTTAAMCTTGNTTTTTGAGAATATTTACTCTTCTGACAAATTCTTCGA 

TTAAACTTTCCTCAGATTTTAAGTTAGCTAAAACAAAANTTCCTTGTTTGNGTT^ 

TNTCGAA TGCT NACTCTGTATCTTTCMGTTITNCMTTTTTTCGATGTCTTAGCATCAA 

AAACAGATTTTAAAACGTCTTCAAATTACTTTTAAATCTGTTCCTGCAGCTAAAACNGTC 

Sequence 2073 

CCCTTAGCGTGGTCNCGGCCGAGGTACGTGCTTATACAAGATGTCAATTATGTGGTCGTC 

CACATGCTGTATTACGTAMTTTAAAATTTGTAGMTTTGCTTCCGTGAACTAGCTCACA 

AAGGACAAATACCAGGTATTAAGAAAGCGAGTTGATAATATGATAATCACAGATCCAATA 

GCAGATATGATCACAAGAATCAAAAATGCCTTCACACGTAAACACAAAAATGTTATTATT 

CCTCATTCTAAGAAAAAAGAAAGAATCTTACAAATCTTCTTAGATGAAGGATATATAAAA 

GGATTTACTGTATCTGGTGMGTTAAMMGAMTTAATGTTGAGCTTAAATACAAAGGA 

AATACAAGTTCAATTGNTGGAATTAAAAGAGTTTCCAAGC 

Sequence 2074 

CCCTTTCGAGCGGNCGCCCGGGCAGGTGGGCAGGTACTTCAGCAAGTCCTCTTTCTCCTC 

AGCAGTAAGCTCAGCCGGCAGGTGCCTGACCAGAAGGGTTCGGTCGCCCCGAGGCGGGGA 

AAGCGAGGAGGAGCTCGTGCATCCCCTTGATATCGCAAGCGGCTGCTCGGGAGCTGCCAT 

TTTCCTTGGAGAAGCAAAAACAGAAATCGTGGGAAGAAGTCTCAGTCAAAATCGCGGCAT 

CMCACMGCTGGGAGAMTATTTTTTCCGCCTCGCGCTAAGGATTCTGGAAACCAGGAA 

ATACCGAGAAAGAAAGTCACCTTCTCGCGAGAACTGCGCCACCGAAAAGCGGCAACCCTT 

CGAAGACTCTTCGGGGAAGGGCGCGGTGCTAATGATTTAAATTCCAAGGGGTCTNCGGAA 

AGACTTACAAAGCCAAAAATTGGCCCAAAAGATGTGCGANGGTTAACACAAGTTGTCAAT 

CAAAGAAAGGAAACAGGAACCCCAACCCCTTTAAGGA 

Sequence 2075 
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CATTMTATACTTTGCACCAGCAAAAGCGATTTCCAACATATGTGTTTTGGAGGTAATTA 

AGTMCTCTGTATAAAAATAMTGCACTTTTCCCTCCTTTCCCCAGTGAATGGAAAACTT 

CCATACTTTCAAAATAATAATAAAAMATAATTTTTAAGAGCAACAGCCXJTCAACTCTTT 

GCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCTGCTAGTTTCTTCTTG 

TATGTCATGATAAAAAGGGAATGTGGGTGTGTAACTTTTGTGTATGTCCCGTTTCCAAAT 

TTCCCTCTCCAAAAAGCCAACCAAATAAACAAACAAACAAACGAAAAAAACAGTGCAACA 

AAACACAAATAGCATTCCAACAGTT 

Sequence 2076 

TTTCGAGCGGCCGCCCGGGCAGGCACATAAAACATTATTCCTTCCTTGGCCTAAAAACTC 

ATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTTTTAGAGMCTTATTTTATT 

AGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCtACATAAAAATGCA 

CCCTAGACCCGAMCTTACTAGACTCATTATAAAATTTTCTTTAAGGTGTCCACGCAGTC 

CCTGGTCAC ACTTGAAG CAG TCCG GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAG 

GAACTTACAC 1 1 1 1 1 1 1 1 AATCTTTTCCTACAACTTCATATTTTATAAATAAAAAGACAA 

AAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCCTGTGACCTGCACATATCC 

GTCCAGGTGGCCTGCAGGAGCCAAGAAGTCTGGGAGCAGCCCGAAAAACCACAAAGAAGT 

GAAACAAGCCAGTTCCTGCCTTAACTAATTAACCCACCTTACGACATTCCACCATTATGA 

CTTGTCCACCATTATGACTTGTTCCTGCCCTGCCCCAACT 

Sequence 2077 

CCCmCGAGCGGCCGCCCGGGCAGGTTTANGTCANAGTCTTCTNTTCTNTTCTNNNTGA 

GATGGAGTCTTGCTCTGTTGCCAGACTGGAGTGCAGTGGTGCGATCTGGGCTCACTGCAA 

TCTCCACCTCCXJGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAACTGGGACTAC 

AGGGTGCGCGCCA(XAAGCCCAGCTCATTTTNGTATTTATAGTAGAGATGGGGTTTCACG 

ATGTTGGCTAGGGAT GGGT CTCGATCTNTNGGTCAGAGTCTNTTNCTGTAAAATATCCTT 

GGGTAAAAGAAGCAATTTTANACTGTAAACTGATGNCAANATGCTTTAAGGGAAGAAGGC 
N 

Sequence 2078 

TCCCTTNCTTTCTCGCACGTTCGGCCGGCTTTTNCCCGTCAAGCTCTAAAATCGGGGGGG 

CTCCCTTTAGGGGTTCCGMTTTAAGTGGCTTTACGGGAACCTTCGAACCCCAAAAAAA 
Sequence 2079 

CCCTTTCGAGCGGCXJGCCCGGGCAGGTNCAGGGTCTGTCAGAAACTGTTGGAATCTTACA 

TAAAGTCAAGTCTCAGAAATGTCCGATGCTTCACCATATTCTTATATTCTATGCAATTGT 

TGTCTGTGCACTAATCATCTCGACCTTCTACATGAGATACAGAATTAATACTCTGGAGGA 

GCAGCTGGGGTTACTAACCTCCATTGTGGACACCCATAATACTGAACAGGCAGCACCATC 

TGGCCTGAGGTCACAAGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2080 

ACCNTATAACGGCCGCAGTGTGCTGGAATTCGCCCTTTCGAGCGGCCGCCCGGNCAGGTA 

CGCGGGGNTGGTTCCAACTTTTCTGCTNATCTGGGAGGTGNTGGGCGCGGACAGTCNAGA 

TGTCAGAGAAAAAGCAGCCGGTANACTTAGGTCTGTTAGAGGAAGACGACGAGTTTGAAG 

AGTTCCCTGCCGAAGACTGGGCTGGCTTAGATGAAGATGAAGGATGCACATGTNCTGGGA 

GGATAATTGGGATGAATGACAATGTAGAGGGATGACTTCTCTAATCAGTTAACTGAGCTG 

AAACTAGAGAAACATGGGTTATAAGATGGGAGACTTCATAGCXJATCCAGAAGAAGTGCTG 

AAGTAAACCTAAAACTTGACCCTGCTNAAATACATTGTAGGGGCAAGAAGAACCCAAGGA 

ATGGGGACACT 

Sequence 2081 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGNAAGTGTCGGCGCCGCCACTGTCCG 

GCCACAGCCTAACGCTCTTNGCTGTCGTTTGNGGNCTCGCGCAGGGCGGCCCCGGNTCTG 

GTGTTTGGCNGTCGGAATTAAACAACCACCATGTCCGAGCAAAAAGGCAAAGACCAAGAC 

CACCATAGAAGCGCCCTTAGCGTGCAACATCCAANGNGTTAGNCATGTTNGACCAANNCA 

CAGATTNGAGGAGTTCAAAAGAGGCCTNCAACATGAGTTGATCAAGAANANGAGATGGCT 
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TCATTNGANAANGGAAAAATGCCNCAAGGNNNNGCCTNNGCTNACTCTAGGGGAAATAAA 
CCCCNACCTGANTNNGAAAATCTTGAATGGCCCATGGATGAAATGAGGCCCCCA 
Sequence 2082 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGTGATAThTrCACATATTTTTGAGAAAAATTCC 

CAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATA 

ATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAG 

CTMTGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCC 

ATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTTC 

AGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTT 

TCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAGCGCCGAGGCTTTGGCT 

CCTCCCCAGCTGCTGCGAGCTTCCGCTGCCCCCGCGTACCTGCCCGGGCGGCCGCTCGAA 

AAGGG 

Sequence 2083 

CGGGCAGGTACCAAAGCCTTN TGNC CCCATTTTCCATCATACGAATAGTATTCCCTGTTG 

CTAAGCCGATGATACATTANCCTTTTCCCAATAGGNGNGAGNGGATGGACTGAATGGAGA 
A 

Sequence 2084 

CGCTCACTGCCCGCTTTCCAAGTNCGGGAAAACCTGTCGGTGCCAGCTTGCATAAATGAA 

ATCGGCCAACCGCGCCGGGGGAAGAGGGCGGTTNGCAGTAATTGGGCNGCTCATTCCAGC 

TTCCTCCGCTTCACTGGACTTNGGANGGCNCTCGGGTCCGTACNGGCTGGCGGGCNGAGC 

CGGGGNATCAAGCTCAANTTCAAAAGGCCGGGNAAATACCGGGATAACCCACCAAGAANT 

CAAGGGGGGATAAACGCAAGGGAAAGGAACATGGTGAAGCCAAAAAGGG 

Sequence 2085 

NCNGANCNGGGAGNGGGGAGAGCCCGCGGAGGNGCCCCAANCNCCCNAAAGCGAGNCGC^ 
NNACGCGCGCNCACGGGCCGCGAAAGACAACGCAAGACGGGGAAAACCCCGCCGCNACCC 
AACANAACCGCCGNGCAAGCACAANCCCCANAG 
Sequence 2066 

TTTNNAGTTTTNGGGANCCACTAGTTCTNGAGCGGCGAGGTACTTATCAANAATGGGGTC 

CNTAMGACATAATCTTTGCCAGTAGATGCAGAGTTGCTGGTAGNTTATGACTAATTCCA 

AGACCGTCCGAAATCCCTGGGCTGTGTTGAAATGTGNISTTTCATGCTCCCTCGCTCCCAAG 

CATAGACCGNCAGGAGCTCCAGGGCATACTGAGGNGGCAAGCTTCCAAGCTTCTTCTTAC 

AATTTTGGACCTGCCCGG 

Sequence 2087 

CCGGNCAGGTACACTCGGMATGGMGGCGANGTTTGAGTCTTTCTTCATCTCGGTATGG 
AAATCCACCTGCAGCTGTGGGTCCATGACGAAAGGGATGATCGGTGTCCCTGTGATTATC 
ACACAAGAACCAGTAGACAAGGACACAGGCCGTGTGTGTGGTACCT 
Sequence 2088 

CNGGACTCACGCGGCGGCGGGAAATGTNATNCGATCACAATTCCACACACATACCNCCGG 

GANTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnATCAGGGGGATAANGCAGGAAAGAANATGNGGAGCAAAAAGG 

CCAAGNAAAMGGCC 

Sequence 2089 

TTTNGGGGACCCACNANTNNGGNGAGCGGGGCCACCGCGGNGGAGCCCCAACCCGCCCNA 

CAGNGAGGCGCANNACGCGCGCCCACTGGCCGACGANNNACAACGACGNGACGGGGAAAA 

CCCNGGCGGNACCCAACAAAANCGCCNGGCAGCACAANCCCCNNTCGCCAGCGGGCGGAA 

NAAGCGAAGAGGCCCGCACCGAACAGCCCCTCCCAACAAGAGGCGCAGCCNGAAAGGCGA 

AAGGGACGCGCCCNGNAAGCGGCGCACAAAACGCGnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnAGCGCCCGCNCCCTNCGCGNANCAACCCCTNNC 
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CNTCNCGCCACGGACGCCCGGCTNTCCCCGNCAAGCNCTAAAAACGG6GGCCNCCCACAA 

AGGGGGCCGANANAAGAGCNNNAACGGGNACCCNCGACCCCAAAAAAACNNGGAAANAAG 

GGGGGAAGGGGNCAACGCAAGNGGGGCCAANCGCCCCGGANAAAACNGGAGANNACCGCC 

CCNCNGAACGGANGGAAGACCANCGGCCCNAAAAAAAGGGGNCCCACGGGGCCAAANCAG 

GGAACAACACACAAACCCCAAANCCGGGGCCANNNCCNNTGGAANCAAAAAAGGGANANG 

AGCCGAACACCGNCCCAATGGGGGA 

Sequence 2090 

NATTGCGAANTGGGCCGCTCTTCNCGCTNNCTCGCTCACTGACTCCGCTTGCGCTCGGTC 

CGNNCGGCTGCCGGCGAGCGGGTATCAAGCTCACTCAAAGGCGGGAAANACNGTTATTCC 

ACAAGAANCAAGGGGGAATAAACCGCCAGGGAAAAGAAACAATGGTGAACAAAAAGGCCC 

AGCAAAAAGGGCCAAGGAAACCCGAAAAAAANGCCCNCGGTTGGCTGGCGTTTNTTCCAA 

TAAGGCTCCGGCCC 

Sequence 2091 

CCCTTANTTTNGNCNTTNNCGANGNACCACACACATAGGTAGCCNGCATTCATGGAACAG 

GCACCX3TGGGCTGGGCTGCACCACACCATCTTTCCATGTGTTATGTCTTTCTAGAGACTT 

CTTGAAMTTGGTAGGATTATCATATCATATGTTCTTGGAAACATCTGTTGACTATTTCT 

GTACATCATGGCTCGGACTTGGGTCAAGCTCTTGGCACCAATGTCCTGGCATGAGTGTTG 

GATGCCAGCAATCAGGTAAGGGACAAATTTGTGGATTGACCCTTTGTCCTGCACAGCACC 

AGACACTCCCTGGGCCACTTTGATTTTGTCAAGCTTCACTGAAATATCTGTTCTGGCTGC 

TGAGGTGCTTGTCCCATGGCATCGAAGAGAACCCATACCGCGATATTTCTTTAGCNCGGA 

TCCCATCGGAAAAAGAAAGTANTCACCAGGGGGCCCTAAGTGGGTGGCAGCCAGGAAGAG 

AGCCCCATCATGGACTGNNGGAGGCCCCAAGGGGC 

Sequence 2092 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTAMGAGTTGTTTTTTGGCCGGGCGC 

NTTGGCTCATNCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTC 

TGGAGTTTGAGACCATCCTGGCTAACACAGTGAAATCCCGTCTCTACTAAAAATACAAAA 

AATTAGCCAGGCGTGGTGGCTGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGAATGGCGTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCNCCCCTGCACTC 

CAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAGTACCTGCCC 

Gnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnG 

TTMTTGCGCCCCTTGGCGTAATCATGGTCATAAGCTGTTTCCTGTGTGAAAATTGTTAT 

TCCGCTCACAATTCCACACCAACATACGAGCCCGGGAGCATTAAAGTGTAAAAGCCTGGG 

GTGCCTAAATGAGGGGAGCTAACTCAACATTTAATTGCGGTGCGCCTCACTTGCCCGCTT 

TTNCAATTCNGGGAAACCTTGCGTGNCCAGCTTGCANTTAATGAAATCGGCCCAC 
Sequence 2093 

CCCCGCGTCCGCCTCGNAAATTGTTGATGCTCT7TCCCCTCCCCGAGGTCTCGCATNCAA 

ANCCTGGTGGGCTGGCCTTGTGTGGCTGCTTCTCCAGGCCTGGTCAGNACCCAGCAGGCT 

CAGGGTCTGCTCCTGATGCTGNGCTCTGGGACAGGCACGCCACTGTGNGAAACACTAAGC 

NAGGTAATCGAGCATTTNGTGATCACAGACTCCAGCTTCCTGGTCCACCCAGCATGTAGT 

CAGCACTCTGACCTTNACACCAGAGCTCCACAGCGGCTAGGAGTTGACTTCCTGTGTCAT 

GACCTCAGGAAATAAATTTCCTTGACTTTAAAAAAAAAAA 

Sequence 2094 

TTCTGCTGAGA CGCG TGTGGCTNCCTCCCCGCAACANCCAAAATGNTGAAGCTGATCGAG 

AGCAAGGAAGCTTTTCAGGAGGCCCTGGCCGCCGNGGGAGACAAGCTTGTCNTGGTGGAC 

TTCTCTGCTACGTGGTGTGGACCTTGCAAAATGATCANGCCCTTCTTCCATTCCCTCTGT 

GACAAGTNTTCCAATGTGGNGTTCCTTGAAGTGGATGTNGATGACTGCCAGGATGTTNCT 

GCANACTGTGAATTCNAATGCNTGCCAGACCTTCCAGNTCTATAAAANGGGNCAAAAGGN 

GGGGGNNNTCTACNGNGCTAACAAGGAAAAGCTTGAAGCCTNTATTACTGAATATGCCTA 

ATCATGCTCTGAAMGTGGGACCAGCTNCK5MGCTGNTTNAMCCTCGTACCNTTNTTAA 

TTTGCTAAAAACTATGAAAGTGTGGAGAGGCTATCCCAACTGNCATCTGATTATTAGTGA 
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CAATAAAAAAANTAATTCTACCCCTTNANAAAAAAAAAAAAAAAAAA 
Sequence 2095 

TGTGTAGCACXTGNGGNGTCCTTGNGTGATTATTCTTGTNCGAGGTACTTAGGGCMGTC 
ACATGCCCTCCATCCCNTGGCTCANAGATGAAGAGTAAATCCAAAACATGTGCCTCGCTC 
TTGGTCACTAACTGCTGNCCTG 
Sequence 2096 

TCGAGCGGCCGCCCGGGCAGGTACTTTNTTAATGCCTTNGTTGGAGTCCTNATCCTCATC 
TTTAAAAAAAAAAAAAACAGNTTANCCTMGCCANATTCACTTTTTTTAGTTNACAAA 
GGATTAANTNGCCACANTGTGATTT 
Sequence 2097 

ATTNNCCCTTAATCATCTCACGCCCCATGTATGATTCTCAAAGNGCCTAGCGTGANCANC 

NGTCCCTNAGACCACACCAATTTCTTNATGTCNCNCTCAAGAAAGCCAAAATGACAATNA 

TAANGCCATCTCAANCNCAATANCCTACCANAACCACCCCNCGGNCTTATCTANACTTCA 

ACTCMACTCTCTGCCTCCTTACT^CTGGGGAGCTTNAAAACCANNTNACTNATAACTT 

TAAAAACCTNTCTNTAAAATNTCANAAACCACANCCTCACCATTNNCACAACCACCCCCA 

ACACCAAANANNTTCCCAAAACAACC 

Sequence 2098 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAAATTTAAGTATGAAAGAAAGAAAAAACAAAAAGTTGAAAAAGAAAAACAATCTTTCAC 

AAACMTAGAGAAATTCGTTTATCTTTTGGAATCMTTTAAGNGATATAAAAATCAAAGC 

AAAMAAGCCAAAGMTTTTTATTAGATAACGACAGAGTAAAAGTGGCTCTTCGTCTTAG 

AGGGCGTGAAMTACAAGACCTGMCAAGGTAAATTAATTTTAAATTCTTTTTTTGATGA 

AGTAAAATCGATTGCAAMTTAAGTAMGAAATGCAATCAGTTGGTAATTTTTTA 

Sequence 2099 

NGNCCTTNCGAGCGGCCGCCCGGGCAGGTACAAATTGAGCTCTCTATTCATAACCTCAAT 

GTAT GTATTCCTGCCCATTMTATACTTTGCACCAGCAAAAGCGATTTCCAACATATGTG 

TTTTGGAGGTMTTMGTMCTCTGTATAAAAATAAATGCACTTTTCCCTCCTTTCCCCA 

GTGAATGGAAMCnCCATACTTTCAMATAATMTAAMAMTMTTTTTAAGAGCAAC 

AGCCCTCAACTCTTTGCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCT 

GCTAGTTTCTTCTTGTATGTCATGATAAAAAGGGAATGTGGGGTGTGTAA 

Sequence 2100 

NCCTTAGCGTGGTCGCGGCCGAGGTACACTGGAGGCTGGAGCCTGCAGATGGCATGGCTC 

TGCGGCTCACCTTGCTGCAGTTGGTGGTGGTGACAGAGACTGCAGCTTGACTGTAGTGAA 

TTTGGAAATTATCTGTCTGGAAGCTCTGAGTTTATCTTGGGACCTCAAGAGGAGAGGATC 

ACCCAACTCACAGCAATCAAACTCCAAATGGTGCTATAAACTGAACCACACATGGACACG 

TCAGTCTTCCGAGGACCCTTAGATCAACCCCAGGAGGAGCCCTAGCTGCTGTTCCCCATT 

CGACGCCCCTTTCCAGCAGG 

Sequence 2101 

NAGGGGCGGGAATTTTGGGNGGGCCCCCCTTTCTTANMTGCATTGCTTCCGNANGGCCC 

GGGGNCCCCCCCCCCAGTGGTGGATGGGATATTCTTNCCAAAAATTGGGGGGGGCCCTTT 

TTNNGGNCCNNAAAAAACCCCNGGGGCCGNCCCGGGGCCMGGTTACCTTGGACTTGGAA 

MATTTGGGGCNTTTTNTTTTGGGGGNNCCTTTTNCCCNNNNNNNNNNNNNNNNNNNNNN 

NNNANGGGCCCCCGGGGTTTNTTNCCNTTTTCCCAMANAGGCCCCNTTTNNNGNGNGGG 

GNGGGTTGGGGGNAAAAAAANNNNNNNN I I I 1 1 II 1 1 I NNNNNNNNNCCCCCCCCCCCCN 

TTAAAAAANAAAAAAAAA 

Sequence 21 02 

CCCTTTCGAGCGGCCGCCCGGGCAGGCACTT A I II II I 111 II I I II | I | I I | ll CTTTT 

rn 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 n i m n i n i m i n 1 1 1 1 1 1 1 1 m 1 1 1 m i n 1 1 i i 

TTrTNNTNAANAAAAAAAAAANTTTTTNTTNMNTNGGGNCNNAMCTNTTTAACNAANN 

AAAAAAAANNNTNNTAAAANGTTNTCNAAGNNGGGNNNNNCCNNANAGGNANAAAANGAA 

MNGNNTNATTTTTTNTTNAAMAAAAANNriTTNNTAAAANTGTTGNNGGNGGGGGTAGG 
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TTAAAAAAAAAA 
Sequence 2103 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTGTCTCTGTAGTCTCTCCATTCTAAAGTT 

TMTTTGGAAAGGTCTGTCCTGAATTATCAAAGTAAGAATACTCAGATTTCCATAAGCTC 

TTCTCACCTCCTCTTAGCCCMCTCAMCTCCATCAGACCTTCTCACCTTGATTTTGAGC 

TGGMTGTTTCAATGAGCAAAATAAATTAGACAAANGTTAAAAAAAAAAAAAAAAAAAAA 

NGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2104 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGACCCCGGTCCTCAAGGAATCAAAGTTTAAG 

GAAACAGGTGTAATTACCCCAGAAGAGTTTGTGGCAGCTGGAGATCACCTAGTCX)ACCAC 

TGTCCAACA TGGNA NTGGGCTACAGGGGAAGAATTGAAAGTGAAGGCATACCTACCAACA 

GGCAMCAATTTTTGGTAACCAAAAATGTGCCGTGCTATAAGCGGTGCAAACAGATGGAA 

TATTCAGATGAATTGGAAGCTATCATTGAAGAAGATGATGGTGATGGCGGATGGGTAGAT 

ACATATCACAACACAGGTATTACAGGAATAACGGAAGCCCGTTAAAGAGATCACACTGGA 

AA 

Sequence 2105 

CCCTTAGCGTG GNCGCGGCCGAGGCAC I I I II I I I I II I » I I I I I 1 1 1 I I NG C I I 1 1 1 1 1 

I II I I If I i I M I I I 1 1 I NTTTTTTNTTNTNCNTTTCN A I I I I I l ATNCI l I I I 1 1 1 1 1 1 

TTTTNMTGGCCAGGCTCCCAACATTTNAAAAAAACTGCNCCCCCCAATGGGTGAACAAA 

GTAAAGAGTAGTAACCTAAAGTTCACCTGAGTAAGCCACTGNGGAGCCTTAAGNGGNGAG 

GTCTTCCAATTTNANAGNGATGNGNCTTCAACTTGTATNATNATTTTANGCGGAAAAACA 

TAA 

Sequence 2106 

TCGGCGTCGCGACCCCCGAGGACCTCCTCTNCTCGCTCTGTGGCATACACTAGTCCTGGG 

CACTCAACCGCGGAGAGCCCCCGACCCCGGGGTAGCGGCTGAGCCTCAGCCGGGACCGGN 

ACCGGANCCCGCGCGGAGCATGTNATCCGGGCTGGGGGCAGCTGGNACAGTGGGCTGGGT 

TGGCCCTCCT 

Sequence 2107 

AATTTGTGTGTTTGTGTrTGTTTGGGGTTTTTGTT^ 

TTTTCTGGGATTTAAATTATAGGTGTATTTTCCTATTCTCTTGAGAAGNAGACTAAACAG 

TCTTTGCAATGATGACGGATGCACACAGMNAMACATTAGMGACATTACTTTCTATCC 

TCTCATGTGGTTGANCATTCTTACACGCCAAATGACTAMTTGGTGTTTCNTAGGAAGGA 

GCAGCTGTCACTTACAATGTGAAAATATTAATGTTTTAGGCCAGNGGGCCAAACCTTCAA 

GGGGGCCTNGTTGGNCAATTTATNGTCCCCTNATNCTTTTNAAAMTTTGAANGGTTCAN 

NNAAANTTGTNNAACCACCAACAAAACNCCCCCTTTNANCATNGNGGGGGCAANGGGGTT 

GGGGACCTTNGNGACCTTAANTGNATTGGCCAATTACCGCCTTGGGGCCTTGNTNAANGC 

CTTCAACNCCNAMANMTTGGCCCACCMTTGGGGGGTCTTTTNGTNANACCTTNTTTG 

NACCMCCMCMCCTTMNCTTNGANCTTCACAAANGGCTTGGGGCNAGGGTANCAACA 

ACNTTCAAGGmATNCCGGCTTCNCAMNGGGCNCTTNMCCTT^rfT^GGNGMTTTGA 

AGGCTTAAAAACCTTGGCCATGGAAATCCCANCTTTAAGGGCCCAATCCCCTT 

Sequence 2108 

ACCACGCGTCCGAGCTCGCTCAGCACTCCCAGGTCCTTAGCACTCX^CAGGTCGTAGCTGG 

CG CAGTCAG TAGGAACTGTAACTATGTCTCTGATGCACCACGTGTTTAGACACAGCACAG 

TCCTTTTTTCTGTTCCTACGGTGGMGTAGTTTCTCTTTGGGCATGCTGACAGCACTT^ 

TCATAGCCTCACCGATGAGCCCTTTCTGCGGGAGTGACTCCATGCCTGTATACAGAGTAT 

TTATACAGATGT TTTAGCA TCTTCATATGCGGTGTTAACCCCTAGTTCTGTACAGCATAT 

TCTGTTCAAGTATTTTTTTACMGCTTGTGCTGTAGGCACATGCCTTCTGCTGCAGAAGT 

GGACACCCGTGGCACACCCACCCCGCCCCAGTGGGGTGCCATGCCTTCCTGGGACATTGC 

CACTTCTGCCCTGGAACTCATGCAGGTACGTAGTAGCTGCTATTGCCAGA 

Sequence 2109 

NCGCCTATCACATAGTCAAACCCAGTCCCTGGCCACTGACAGGAGCTCTGTCAGCTTTCT 
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TAATAACGTCCGGCTTAATCATATGATTTCACTTCTACTCTATCACCCTACTAACACTAG 

GCCTACTAACCAACACACTAACCTTATATCAATGATGACAGTGACATCGTACTGAGAAAG 

CACATACCAAGGCCACCACCACAATACTCCCGTCCAAAAAAAGCNCTTCCGGTACCGGAA 

TTMCAACAT ATTTTA TCANTTCTCNAAAMGTTATTCTTTTTNTTTATTCGNGCCTTCT 

TCTTGAGCAN 1 1 1 1 1 ANCCACATCTAAAGGCCCTCAGCNCCACAACAACACCTGGCTTTA 

AGGNGGGGCNCAATTTGNACCCANCCCANACAAGGGCGATTCANCCCCCCCCTGNAACCC 

CCCCATTAGTAMGGTACNCCCCTNNCTTAAAAACMCCCTTCTTGGNNATTTAACTrTG 

NCAATCCTTGGGGGGGTNAAACAAANTCAACCCTGGAAGGCTTCACCNACAAGNCCCTNC 

ATTAAAATTAAATNGACCCCGAAAAAANCAAAACCAAANTCCCAAAGGCNCCCTGCCTCA 

TTCACCAATACCTAACTTAGGGTTACCCTTAT 

Sequence 2110 

CGTCCGGGACCTTTATGTCTTGTNAAGATGTCTAGGCCTGGCCGGGCGCGGTGGCTCACA 

CCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGGCAGGAGTTTGAG 

ACCAGCCTGATCAACATGGNGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGC 

ATCGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCCGAGGCAAGAGAATNGNTTGA 

ACCCAGGAGGTGGAAGTTGCAGNGAGCCAAAATCACGCCACTGCACTCCAGCCTGGGCAG 

CANAGTGAGACT CCGNC TNAAAAACAAAACAAAACAAAAAAGCAAAACCAGATGTCTAGG 

CCMTGATMTTATTTTTGATGCATTGTGGATTANGNTCTTTTGTTAACCCCACTGTCTT 

GGGGAATGATGCXJTGCTGGGAMTTGAGTTTTTTGACTGAAACATGGAACCTTNCTGCTT 

TTTTTCTGGNTCCTATGAAGGTTTTGGAACATNTGAAAACACAAAAACTCACCTTGAAAT 

TTGAGCAGGTCGATGATGGCAAAAAATTATTT 

Sequence 21 11 

GCGTCCGCTGATCTGCTTTGGGACGGCCTTTATATACTTCCTCCTTTCCAGGCCTTCCAC 

CACCAGTGACCACTATTCGACATCTGGCCCACTCTCAGTCATCCTCCTGCTTATGCTTGT 

CTNCTCCTTGAAGGCTTCCCACTGCATGTAGGACAAAGGTCAGATTTTGTAACAGGCCAG 

GCCTGGNCTTNATAGTCTGGNATCCACTAATTTATGGTCTNAGTCTNATCCCTTGGAGGA 

TTACCTCTGNCCTTNGNAAGCTCTGTGCTCCNG 

Sequence 21 12 

TTCATAACAATTCTCCTACAAATCACCTTAATTCTGACATTCATGGCCACAGAACTAATT 

ATATNCTACATTCTATTTGAAACCACCCTTATCCCCACCCTAATTATCAGGTACTAAATG 

AGGCAGCCAAGCAAAACGCCTTAACGCAAGNACTTACTTTCTATTCTACACACTAACCGG 

CTCTCTACCCCTACTCATCATACTAAGCGCACACCTACAAACAACACAGGCTCACNTAAA 

CATCACAGCTANCTAACACTTCACGNGACCAAAAAAACNTAACAACCCACCTGGGNNCCC 

CACAGCANTTACCCTGGGCCTAGCNANTGGCATATATAAGCGCTTTTATGGACCAAAAAA 

TACCCCCTTNTTATNGGGGTCTACACNCTTATTGACTTCCCNCANAAGNCCCATGNTTTG 

AMGCCCCNCATTTGCAAGGGGNTCAAATANGTCCTTTGCACGNCAGNTACCTTCCTTAA 

AMCTAGGGCNGGCTATGGGGCAATAAAATACCGGGNTTTAACTCCCCATTTCTTCAAAN 

ACCCCTTAAACGGGAAATTANCATAAGCCNTACCCCATTTCCCTCATTATTAATCCCTTA 

TGGGGGCNATMNTCATAACAAAGGCCNTAAACANTGCCTCCCGACAA 

Sequence 21 13 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCTACTTTTTAAACAATTTCAACTGCA 

G CTCTC TTTCACTMGTNAGATGGGNAMGCATGCCATTTCTGTTTNCTTGNGATTTTTA 

CTTTTTAGAMCACACATGCTTCCACTGCCATCTGACACTTCTCCACACGCTTTCAT^^T 

GTAMCCTGAATTCTATTTNGAGTACCTATCAAATACTTTCTGGAGGNGGGGCACGCTCC 

GCTCGGTCATGATGCTGATCCACTTGGGAACATCAGTTCCTTTCCTCTTCACTCCAGCGT 

CATAGAGATCCCGAGCATCTTGGNNAATCAGTTCATAATCAATGACAGAGCCATCCTCTG 

CTCTTCTACCCTTTGCCA 

Sequence 21 14 

GTGGGGGGGGGGGGGGNAANCTCGTTGGGACNCGCCCCNAATTNNNANNNAACCNNNGNN 
CNNNCTNGAGGGCGAAGGNATNNATAAGCTTGAGGGGGGGGGGCTGGANACCGANGNATC 
CACTAGTTCTAGAGNGGGCCGCCNACCGGGGTGGGNGCTNCAGCTTTTTGTTCCCTTTAG 
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TGGAGGGGTTAATTGCGGCGNTTGGGCGNNAATNATGGGNCATTAGGTGGTTTCCTGGNG 
TGGAAATNGTNATCCGGGGGGGGAGGTNCANACAAAATAGCNGAGCCCGGNNAGNATAAN 
AGTGTAAAAGCC 
Sequence 21 15 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTTGTGTCCCTTTCTCTTCAAAGAT 
CCTGAGCAAAACANNGATACGCTTTCCATTACCTNATGGGGGGGNCNNGGACCCAGCTCT 
TGGCATTGCTANGGNNGGCTANNATCNNGCCACNTGAGGNTGTGGANNN 
Sequence 21 16 

GTAACTCACATTAATTTGNGTATGCGCTCACTGCCCCGCTTTCCAGTCGGGAAAACNCTG 

GTCGTGCCAGCCTGCATTAATGAAATCGGCNCAACCGCCGCCGGNNGAGAGGGCCGGNTT 

TGCGTATATGGGCAGCTCTTCCGCTTTCCTCGGNTCACTTGACTCGCTTGCGCTCGGGTN 

CGTTTCGGGC7TGCGGGCGAAGGCGNGTATTCAGGCTCACTCAAAGGGCGGGTNAATACG 

GGTTATTCCAACAAGAATTCAGGGGGGATAANCGCANGNAAAGAACCATTGTTGAAGCCA 

AANAGGGCCCAAGCAAAAA 

Sequence 2117 

ACCTGTCGTGCCAGCTGCCATTAAATGAATCGGGCCAAACNCGCCGGGGNAGAGGGCGGT 

TTGCGGTATTGGGGNCGCCTCTTCCCGCTTTNCTCGCTCACTGACTCGCTGCGCCTCGGT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTTCACTTCAAAGGGCGGGTAATTACCGGTT 

TATCCACCAGNAATCAGGGGGNATAACCGCAAGGGAAAGAACATTGTGAAGCCAAAAGGG 

CCCANCAAAAAGGCCANNGAACCCGTAAAAAAGGGCCGCNGTTTGCTTGGCCG 

Sequence 21 18 

CCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGC 

TTNTCCTCGGCCAGTTCGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGG 

NCGAAATAGAAGCCCANAGAGAGGTAGGTGTAGGAGGCCTGCAGGTACAACTTGTTGGCC 

TACATAAAACACCTAGATGGTAACAACGAGGCAGCCCTGGAATGCTTACGGCAAGCTGAA 

GAGTTAATCCAGCAAGAACATGCTGACCAAGCAGAAATCAGAAGTCTAGTCACTTGGGGA 

AACTACGCCTGGGTCTACTATCACTTGGGCAGACTCTCAGATGCTCAGATTTATGTAGAT 

AAGGTGAAACAAACCTGCAAGAAATTTTCAAATCCATACAGTATTGAGTATTCTGAACTT 
G 

Sequence 21 19 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTATT 

GGAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGC 

TTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGGTAATGGC 

CTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 

Sequence 2120 

TGCCCGGGCAGGTAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 GGG ATGGAGTCTTGCTCTGTT 

GCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTNTTCCTTCCAGGTTC 

ACGCCATTC TCCTGC CT CANCC TNCCAAGTAGCTGGGACTACAGGTGCCAGCCACCACGC 

CTGGGCTAAl mi iGTAI I 1 1 I AGTAGAGACGGGATTTCACTTGTGTTAGCCAGGGATG 

GTCTCAAACTCCAGACTCTCGTGATCCGCCCACCTTTGGCCTCCCAAAGTGCTGGGGAAT 

TACANGGCATGAGCNCACTTGCGCCCNGGCCAAAAAAACAAACTTTTTTTAAAAAAATGG 

TACCTNGGCTCGCTCTAAGAACTAGGGGGGATCCCCCGGNGCNTGCAGGGAATTCCGATA 

TCAAAGCNTTATCGATTACCCGGTCTGAACCCTTNGAGGGGGGGGGGGCCCGGGTACCCC 
AGCTTI I I IGGT 

Sequence 2121 

GGTACCTTGTCTGGAGAATGCAGTGACAGCACCGGCCCATGCTTGAGAACCCANGCGGCT 

GTGCAGAGGGCAGCCGACCACTATAGCCAGCAGATGGCCCAGCAACTGAGGCTCCCCACA 

GACACGCTCCAGGAGCTGCTGGACGTGCATGCAGCCTGTGAGAGGGAAGCCATTGCAGTC 

TTCATGGAGCACTCCTTCAAGGATGAAAACCATGAATTCCAGAAGAAGCTTGTGGACACC 

ATAGAGAAAAAGAAGGGAGACTTTGTGCTGCANAATGAAGAGGCATCTGCCAAATATTGC 

CAGGCTGAGCTTAAGCGGCTTTCANAGCACCTGACAGAAAGCAT 



• 
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Sequence 21 22 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCAGGTACCCTTGGAAGATGGGAA 
AGGTGAGGGAAATATNNGAAGCAGGGTCAGAACATCCACTAAGAACATAGCACCTNAGTA 
NAGCTTACATTATATGAGCCAGGGTAGAGTTANTACTGAAT 
Sequence 2123 

TACTA TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTA CI I I I I U M » I I 1 1 1 

TTTTTGCAACTCGAGACGTTAGTTTACTCCAGAGTCATGGAGACAGTGAATGGAGTGTGT 

GTGTGTGTGTGCGTGTGCGTGAGTGAGAGAGAGAGAGAGGAAGAAGGGAGGGAGGGATGG 

TTACAGGAGTGTGGGAAAAGTCAACAGGAAAAGGAGGGAAGGTTCGCTGTNAACTGACAG 

AAACTGTGACATCACAGGCATGTGCGGAGTCCCCGCCCCCCCAGGCACTGTGCCAAGGCC 

AGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGNCTGCAGGGAATTCT 

GATATTTAAAGCTTATNNATACCGACAACCTCNGAGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTNGCCGTANTCATNGCCAATAGCNTGTTT 

NCTGTGTGAAAANTNGTTTATTCCCGCTCACAATTTNCACACAACATACNAACCCGGGNA 

GCATAAAAGTGTAAAAG 

Sequence 21 24 

GCGAATTGGATCTCNCCGCGGTGGCGGCCGAGGTACCT77TTAAATCTAGCCCAGTATAA 

ACATTA GCCT GCTTAATATTTAGACATTTATAGGTAGAATTCTGAGCACTCAACTCATGT 

TTGGCATTTTAAAGTAAAAACAAGTGTGACTTCGAGGACCAAAGAAATTGTCAGCTATAC 

ATTTATCTTTATGAACTCATTTATATTCCTTTTTAATGACTCGTTGTTCTAACAT^ 

AGMGTGTTCTTATAAAGGTCTAATGTATCCACAGGCTGTTGTCTTATTAGTAAATGCAA 

AGTMTGACTTTGTCTGTTTTACTCTAGTCTTTAGTACTGGTTGTCAGGATTCAGCCGAA 

TGGCTTGCCTCAGAGGGTCAATGGCGTTCTGAGATGGTGGCCAGTTGTCCAGACGGTAGC 

TTGCTATTGCCAGTCCAGAGGTGAATTCTGGGTTCTTTGGCT 

Sequence 2125 

GCGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGAAAGTTCTTGATGAGGG 

TCTCAATGGCCCTCTCCACATCACTGAATTCCTGAGCATCCTCTGCGTTGGCTGACCGAC 

ACTGTGCCATGGTGCCCACTGTGTCTGGTCCTTTGGTGAGAGTTCTGTTGTCCTATAGCT 

GGCCCCAGAGGAGCTGATGGCTCATGATCTGTTGGCAGCCGCTGAGACAAGACAGGAGGC 

CCCGCGTACCTGCCCG 

Sequence 2126 

CCGCGGTGGCGGCCGAGGTACCCGGTGCGCATAAGAGGAAGATTTCTGAAGAGTGCAGCT 

GCCTGAACCNANCCCTGCCGAACAGNTGANAATTGCACTGCANCCATGANTGAGAACAAT 

TAGAATNCCTTGGNGGGCAGCCTACGGNANCTNAAATGCCATTTAACCTGGAACTTGATG 

Sequence 2127 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACGANGTCATATGTTATTTACAATT 

GGGTTTGTGTGGGATGGGAAGTAGGGCGGATGAGCCAGTACTTNTGCAATGAAGATGCAA 

TAGTCNTTGTCCTNTCCCACTGTCNCCTNTTTCCTCACCCNATGGCAGCTTACATGACCT 

ATTCCCAAAGGGTCCACCGAGNCCTGAACTCAGCTTCATCACCAACATTCCTCGCCTTNA 

GNAAGAATTCAACACTGTATAAGGGAGTNGAGGCANAGACTTGGGNCAGGGNGAGGGTGG 

NNAACACNNAAGCACACTNTCTTGTCATCAACCCAAGTTCAGAGACAAGGCCTCCNCANA 

TGNGGAAGATGATGCCCTTAGACACCNCAACCTGNTTGTCNNCNTTCNTNGAAGCNCAAA 

GCAGCCTGNATCTCAACTGAAGAGAAGGGG 

Sequence 2128 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGGGAATGGAA 

TGGAATGGAATGCAATGGAATGGAATCTTCCGGAATGGAATGGAATGGAATGGAACGGAA 

TGGAACGGAATGGAACGGAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTC 

CGGAATGGAATGGAATGGAATGGAATCAACCCGAGTGCAATGGAATGGAGTGGAATGGAA 

TGGAATGGAATGGAACAACCCGAATGGAATCGAATGTAATGGAGTTGAATAGAATCAATC 

CGAATGTAATGGAATGGAATGGAACCGGAATGGAATTGAATGGAATGGAATGGAATGCAA 
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TGGAATGGAATCAACCCGAGTGCAATGGAATGGAGAGGAATGGAATGGAATGGAAGGGAG 
ACTACCCGAATGGAATGGAATGTAATNGAGTGTAAGGGAATTGAATAGAATCAATCCCAA 
TGTAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTTCCGAATGGAATTG 
Sequence 2129 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGNACTGAAANTCCA 

CACGACANAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTG 

CCTAATTTACAGCACCCATGAGGAAAGGCCACTTANGGATGCAGCAAGAAGGAGCCATCT 

GCAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACT 

TCTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTC 

Sequence 2130 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAACTGAAAATCCAC 

AAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGC 

CTAATTTACAGCANCCATGAGGAAAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTG 

CAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACXJTTCATCTAGGACTT 

CTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTCTGGAGCAATT 

GAGATGTCACTTTACATGGGGAGTTATCCATTGATGACGATGAAATGCC 

Sequence 2131 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATCATTAATGTAT 

TATATACACTGATACTTTAAAACTTGTGTGGAAAAAACTAACTTATAATTTTGTATCACA 

CACCCTGGATATGTGTTCTGTTTCTAAGCGACATTTGTGAGAGATTATTGTAAAATGAGA 

GCGAGCAAATAAMCTTAATTTAATCTTTGCAGATACATACTTATGGGAAATTTGAACAA 

ATGAGTG AAACTCT GTGTTTTTAGTAGGCTGTGATAAACATTTCCGGGCACTTTGCAAAA 

GGACTTTCI I 1 1 1 1 GCCGGGNGCTTTAATNANTTAATAAAAA 1 1 n I \ I AAAAGTTAAAA 

A AAAT NGTGGNAAANAAAAC I 1 1 1 1 1 1 1 1 1 1 I NTTTTTAAAAANNAGGNTTNNANNAAAC 

NTTTTrrrNTTTNG CCGN ANNAAANCCCCCCCCCCCGGTTTNCNGGGGAAANAAAAAAAAT 

NNGGGCCCNC CNTTTTN TTTTTTTCNGGGGGGGGGGGGGGGGGGGGN 1 1 1 1 1 1 1 1 1 1 I GN 

GMMGNGNTGTTTTTTNNCCCCCCCCCCCCCTCTTTTTAANANAAAAAAATTAATTGGG 

GGNNN 1 1 I II 1 1 1 I I INNATTNNNNACCCCCNCNAATTNGGTNTTTTTTANNTNAANANC 

CCCCCNGGCCNTTNNTNATANATGCCCCCCNCCCCCCC 

Sequence 21 32 

GAGGTGGTTACATTCGTCGAAGGACACCAGCTGCGGAATTTGCGGNTTTGGCAGATTGAA 

ATCATGGCNGGTCCAGAAAGTGATGCGCAATACCAGTTCACTGGTATTAAAAAATATTTC 

AACTCTTATACTCTCACAGGTAGAATGAACTGTGTACAGAATCCATTCTCATTCTTTACT 

TGCTACATTATGACCATGAGGAGGGCANAGTAGAGGTGAACTCTCTGTATACTTGCTGAA 

AGTCTTCTTGTACCT 

Sequence 21 33 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCAGTAAGAGATGGG 
GTTTCACTATGTTGGTCACTATGGTCTTGATCTCTTGATCTCGTGATCTACCCACCTTGG 
GTTCCCAAAATGTTGGGATTACAGGTGTGAGCCACTGCACCAGGCAAACTGCGATCTTTT 
AGTGGTGCCTCTTCTCTCTTTTGACTTAAGGATGTTGTCCCTTAAGGAAACCTGGAGGCT 
ACTACTGTGATACACTACTTGAGAGATGGATTGTTGCTCTTTCTTCTACAGTCTTACANG 
GAGT AGATTA TAAAGACGGAAGATGTTACCATTTGCNTTAATTGTTGGAAGCTGANAGCT 
TT TAAl 1 1 1 IGGTTNCMCTGTTTTTGNGGGANNTCCCGNAAAAATTTTNNNNNAATTTT 
TNI 1 1 1 1 1 INGGGAAAAGGGGGCNCNTTTTTAAAAACAGGGGGGGGNNGAAAAAATTTGN 
GGGNGGGGGGCCCCNNCCAAAAAAAAAAMAAMGGGGGTTTTTTNTTNCCNGNNNCCGG 
GGGGGGGGNNGGGNCNNTTTTTAAANCC I 1 1 I 1 1 1 1 1 GNTTNCCNCCNAAAAAANNAACC 
CCTNNNGGGGNG IIIII llllll 
Sequence 2134 

C CGGGC AGGTACAAGAGATAGAAAGACCAGTCCTTGCTGAAAGACAAGTCTGAATGCTCC 
AC i 1 1 i iCAATTCTCTCTCCATTCTTCAGTAAGTCAACTTCAATGTCGGATGGATGAAAC 
CCAGACACATAGCMTTCAGGAAATTTGACTTTCCATTCTNTGCTGGATGACGTGAGTAA 
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ACCTGMTCTTTGGAGTACAGGACAATNAAGACTACTCCTATNTGCGGAACAACTAGCTT 

TCTATTTAGTTCTAGAATGTTGAMCTGACCGATTGGCTGACATAAMGTCACATTTTAC 

MAAMGTGTCTCCAMTGCTTTGACTAGGGGAAAAACCCCTTTTCAATTAGAGGGAGCC 

ATTNTGCMCAMTTTCCCACAAATAATTCGCTTATTCCMGGGGCMNGGCACCATTTG 

ATATNGGGAMTTTTTTTGTTTTTNGNGCCAAMTTTAAGGGNA 

Sequence 21 35 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCAAGACTCAGCACTAGTCTGATG 

ACCTGCTAATTCACTGACAGCATAGGGCTGTCTGTTGTTTTTGCGCAAGTTGGTGTGAAC 

AMGTTCACMTATCTGGTCGMTAGGAGCCTTGAATACAGCAGGCMAGTGACATTTTT 

GCCAGATGACT CCCCC TTTTCGGAGTACCTTGTTCAAAAAACACCGCTGAGTCACTTCCA 

GGTGCTGTTMGTTTTCTTTAGTGAAGATGTCTATACCAGAGGGAGCATAGTTCCAGATG 

ATTTCCTCAGCGGCAATGTAGTAGTGTCTMCATGCTTCCCACGGATATTATCCTTTTGA 

TGAAGACTTTGTTACACTCCTGGACCTGGAAAAAGGCTTGCAAACCGGC 

Sequence 21 36 

TNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAATGTAGAAA 

CAATGAGGATGGACCTGGTTTAATAATGGAAGAACAGCACAAGTGTTCTTCGAAGAGCCT 

TGAACATAAAACACAGACACCTCCTGTGGAGGAGAATGTAACTCAAAAAATTAGTGCCTG 

GNAAATTTGGGCTGATGAGCTTTCTGGATCCTAAGCCACCTACCGMTTWn'GGANGGTT 

GGCTGNNGGTGTGGGGAAACACAAGTCTTTTCCAATTTTTACAAAACGAA(}CAATTGACC 

CCAGGGACTTNTTTGGTTTATTGGGTGGGG 

Sequence 21 37 

CCGCGGTGGCGGCCGNNGTNCNCGGNGCCNGAAGAGGAAGATTTCTGAAGAGTGCNGCTG 

TCTGAACCGAGCCCTGCCNAACAGCTGANGAATTGNACTGCAACCATGACTGAGAACANT 

AAGMNTCCTTGGAGAGCACCCTACGGCAACTAAAATGCCATTTCACCTTGNAACTTGAT 

GGAGGGGAGAAAACT 

Sequence 2138 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCAGGTACCANATGAANNG 
NNAAGACAAGGCCATNCNCCACTTTATAGAGGGNGTNAAANTAAACCANAGNTCCNGGGA 
GAAAGAAANG 
Sequence 21 39 

CCGGGCAGGTAC I I M I ! 1 1 1 1 1 i I I 1 1 1 1 I 1 1 I GTTTNATCATATTTAGGTTATTTATT 
AATGAMGGTATISTTGANATTTTCAGGMTACAMTTTTGCACCCTGATGACCTCAAATGC 
GTGCA ACAAGATGTTTAATACANGAAAATAACACAAAAACTGTTGTTACAGTGGTTAGAA 
TTTTTMCTTTAAAAMCCATGMTTTGTATTGNT^ 

ATATATACTTMGCTTAMTTAATTNCMCANGGNMACATTTTCCAACCCAGAGGTGTG 
GCCTGATGTTGGGGTTCMNTCTGGACTTNTATTTTTTGGGTANCACACTCMCTTTTGA 
ATTGTNTTAAGGGNTTATTTCNANCCATTCTAACTCTANGAAAAAATNTTNAAATTNCGT 
TTCCCAMGNCATTANCCGNGGMTTT^ 

GAGGTTAAMGGAAGGNGTNTMTATCACTTMTTATCCCMCATTTTCTAAAGGGGGGG 

GGAAAAC 

Sequence 21 40 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTATCAGCTAATTGGGCTCCTTT 

GMATGCTGAGGATCTGCTTACGCAGGTTAACTCTTTGAGGAAGGGGGTNGGGTAAGTAG 

CCCTTMTGTCTTGTAMTCMGGGGCCAGATGGCGTTCGTCAGGCTTTCCCAGCTAAGG 

GAGAGTCTACTCATATGGGAMCMGCCTAGGTMTTAAGGAGACAAAAAGGGAAAATTT 

AAAAATAGGGTTAGTAAAAACAAGGTTAAAATAGGGTTAGTAAAAAAACAAGGTCAGGCA 

TTACAAAAGGATTCTCCTGTCTCAGCCTCCACTGGGATTACAGGCTCGCACCACCAAGCC 

CAGTTMCTTTTGTATTTTTAGTACCTGCCCGGGCGGC 

Sequence 2141 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I I U I I I I 1 1 M I I I I I I 1 1 
NTNAGCAAGCACGTGCACTTTATTGAATGACACTGTAAACAGGTGTGTGGGTATAAACTG 
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CTGTATCTAGGGGCAGGACCAAGGGG 
Sequence 21 42 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTACCCTTGTCCCTCTCCTAA 

AGGGAGCACAAGGAAACTGAAGAGACTGAAAAAGAAGAGAGTTTGTAGCTGCAAAAGAAT 

AGGGATAGCAAGGAAACCCAGAACTGCATTCCCCTAAGTGGGGCCATCCCATGTGATTGA 

ATTGTCCATAGCTTGCCTATGGTGAGAAATGTGCATGCTCCGTGAGCTGGTCTCTTGAAA 

CAGGACTTATGCTTCCTCTATATTCTGGTTAAATTTTCCAAACACATAAGTTCACTGAGC 

ACAGATTTCTTATCCAGAGACAAGTAGAATCTAACCGCAGACTGTTGGCAGAGTTTCCAG 

GCACTTAGCCATGTTCCCTTCCTGACTCAAATCCCCAAAGGCCTTCACTCTCACTGAGAA 

TCACACTACTGTCCCATAGATAAGGCAGGCATTGAAGCACCTGTCGTGATCCTCTAGGGG 

GGAGMTGAMGGTTATTTTCCTGCATTGCATCATCATAGCTTTTAATA 

Sequence 2143 

AGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGGNGCNNNGCTCTACAGAATAGAGGCN 
ATNCTTTAGCTTAAGCCTGTCTGCTGACCAGAGAATGGAATTNTGCGTGGNCTCANGGAA 
CAAAAGGAAACTAGGCAGGGAAGGGGAAGAAAAGTGC 
Sequence 2144 

GGCN AATTGGAGCTCCCCGCGGTGGCGGCCGNGGNACTTN I rTTTTTTTTTTTTTTTTTl 

TTTTATMCTGATMTTCTTTATTACTAMGGTTTATTTACATAGNGTTTANNGCNTAAT 

AAAAAATAAATTACAATACAAAAGTGCT^^^TAGGAAGGAGACACTAAACAACAGGCCCA 

TNTT ACCCC TTGCTNTTTNGCCAAGACATAAGCTACAAATTTTGCCGGAAAAACTGNCCA 

TACATTTTTAACTACTTCTTCATTATTCTTATGGACCATCATCCAGGACATNTGTTTGAA 

GAAATATCCAGTTATMTATTTTCAAAGGNTNGAATTGNGAAGAAAAAAATATAAAATGT 

GA TTAAA G GATNThT rAGCCTTCAGATGTAATTTACNGGGTTTAAAAATTGGCCCCTTTAA 

AA CTTTT GCTTTTTTAGACNAGNNNTAAAAAGGCACMT^ 

GATTTTCANAACNGGGGGNTCTTNTGNGNMTNCNTACCANTTTTTTT 

Sequence 2145 

AGGCACN AGANGANTTTNC I I I I I I I I I I I I I I I I I 1 1 NCTGNNATACAAAGAGCAGATT 

TTTATTGAACTTGGGCNATAACTATATTNCCATACMTNTAMTATTCATGAATAGTTTC 

CCAA GTCTGGAG CGACCACATAGGGAGAAAATGTAMTGTCTCAATTTTTGGTTNCACNA 

AAATG! I I I i I 1 1 ATATCAAMTTTGNTTNTNAAAGCTTGGNGGATTAGCTTAAAAAANA 

AAAAAMGTTTCCTTGAAATCNGGGAMCMGACATTTTAAATGAATCAANCAAAATTTC 

AAATTAAAAAAATTATGAAMTATTATCCTCATTAGTTTCATTTAGTGCCCATGGAAANT 

TAATTATTCTCTCTGC^^TGATCTTTGGGGGGACMGTTTCMTGGAAAGCCTGTCAG^IT 

TAANTTCATTAAAAGTTT 

Sequence 2146 

GCTCCCCGCGGTGGCGGCCGAGGTACAAAGGAATAAAAATGTTACAACCTAAAAAAACAA 

AACACCGTAAMCATTTAGAGTAAAACATGACAAACGTGCTGCTTTTAGAGGAAAAGAAG 

TTTCATTTGMGMTTTGGACTTCAAGCTACAACATCTCAATGAATAAGTGCAAGACAAA 

TTGAATCATGCAAGGATTGCCGATTACAAGAAGAATGGGTCGTGAAGGACACGTTAACAT - 

AAGMTCTTCCCTCATCTATCTCTTACTAAAAAGCCTATTGGAGTTAGAATGGGATCTGG 

TAMGGTTCAGCTGAAAMTGAGTAGCTGTTGTTAAAGAAAATACAGTTATGTTTGAAGT 

TGGTGGAGTTAMGMGAAATAGCTTCGTGATGCACTTCGTTTAGGTGGACACAAACTAC 

CNTGTTAMTGGAANGAATAATTNNGCANGGGAGGAATAATGGAATTCAAAGATTTAAAG 

AAAAAMCT ACTG AGCATTTTAGMCCGTTATTAANTTGMTATTAMTCAGAGCTTTTT 

ACCCCTTCGTTTTAGAAATTAAACCTCAACCAACTNGGATCATNCCTCNAAAATTCAAAC 

MGGTTAGAAMNGGTTTTNCCCGGMTTTTTACNNCMTTTAGCCACCCAAAAAAAAAA 

Sequence 2147 

TCCCCGCGGGGCGGCCGAGGTACATNTGCCTGNCTNCCTNCTGTCCTTCCTTTTTATTAT 

AAGGATACATTTATAGNACCCCATAGAAGGAAAAGATAAATTTCATAGGCTGNTAAAAGA 

GGCTAGGCCTAAGTTATAATGCCTCCTCCTCACAGNCCAATTTNCCCAAGGGGCNTTANC 
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ACCAGAGCAGNTTTTCTAGCTTGNGGACAATNCCNNCAGGCTTGAGTGATAATGNCCCNG 

TNGCGGTAGCTCTCCACTTGNTNAAGGACCAAANACACCTTAGCAG 

Sequence 2148 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA Ct 1 1 I I I 1 1 1 II I I I I 
mum I GAC TT TATA TATAATACTTTATTTTATACMTTCAGTATATCATGTTAACATA 
TTTCACTT TTAATTT TATGATATGTGTGACATATTTTTMTTTTATGG 
TATCATA ATTTTTTTAAAGTTTGTATCTTTC^ 

m m M iCCTGTAGCTTCTGA GMC ACATTTTTAGATACCCGGCTTCTAGTTATACCTGA 
AGCTCCACAGTGTAGACATGTTTTGGCCMCCTTGTTTTATCGGTGTATGAAATTTGTGC 
TMTAGGATGGATCCMTTGTCATTACATTAGAAAACTAAATGGGAAACATTTCTT 
Sequence 2149 

TCCTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI I I I I I 1 1 
II 1 1 I I II II I I n GACT T TATAT ATMTACTTTATTTTATACAATTCAGTATATCATGT 
TMCATATTTCACTTTTAATT^ 

ATTATACTATCATMTTTTTTTAMGTTTGTATCTTTCATTMTMGGAGGTTCCTTA^ 

AATGGAI I I I I I I ICCTGTAGCTTCTGAGMCACATTTTTAGATACCCGGCTTCTAGTTA 

TACCTGMGCTCCACAGTGTANACATGTTTTGGCCAACCTTGTTTTATCGGNGTATGAAA 

TTTGTGCTAATAGGATGGATCCAATTGTCATTACATTAGAMCTMTNGGGMCATTTTT 

TTTTACATTNGGGGCAACNTACCAAAATTTT 

Sequence 2150 

CCGGGCAGGTACACGGGCAAAGGGGCTTGAGAAGGCCCGGNGGCGAAGCCGAAGAGAAGC 

AACTGTGCCCCGGAGAAGAGAAGCTCGCCCATTCCAGACTGGGAACCAGCTTTCAGTGAA 

GATGGCAGGGCCAGAACTGTTGCTCGACTCCAACATCCGCCTCTGGGTGGTCTTACCCAT 

CGTTATCATCACTTTCTTCGTAGGCATGATCCGCCACTACGTGTCCATCCTGCTGCAGAG 

CGACAAGAAGCTCACCCAGGAACAAGTATCTGACAGTCAAGTCCTAATTCGAAGCAGAGT 

CCTCAGGGAAMTGGAAAATACATTCCCAAACAGTCTTTCTTGACACGAAMTA7TATTT 

CMCMCCCAGAGGATGGATTTTTCAAAAAAACTAAACGGAAGGTAGTGCCACCTTCTCC 

TATGACTGATCCTACTATGTTGACAGACATGATGAAAGGGAATGTAACAAATGTCCTCCC 

TATGATTCTTATTGGTGGATGGATCMCATGACATTCTCAGGCTTTGTCACAACCAAGGT 

CCATTTCCACTGACCCTTCGTTTTAAGC 

Sequence 2151 

CCGGGCAGGTACGCGGGGNANTGCNANANACNCAAANCNNGNTANTACANTGCATCAAAC 

ATGTTCMGATTNNCCMTTGACGGGATTGGATTNAMGATATNCCACCACTTTTAGCAA 

GATGGNGAAGTGCTAAATNACACAATTAATCAACTGGCTGAGTTAGCTAAAGATGCATAT 

GTTATTATAGGTCCANACGCMGANGTTTCTTNCTTGGGACACCTACTGCANCTISrTTTTA 

AAAAAACCTTTTATTATGGTAAGAAAACCTAAA 

Sequence 21 52 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGNNGGCCATTACCCTACCAAC 

TMCTMTGTTCCGCACCCCCATTTTTMGTGMGCTGTGMGCTCCTTTCTATTACTCA 

TCATGCGATAMTAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 21 53 

AGGTACACTTCTAGCACCTAGCAGAGAGAGGCTTCACTACATNATGCTTCCTGACATCTC 

TCCCTTTGMGAGCAGTCAGACTCCTGCTTTGCTCTTCAGACTTAATTTGGGGGTTTAAC 

AGGTGAGGTTGCTGGGGGAACTCTTTTACAACATCTCTCTGAAAGAATCCGGGCTGCCAG 

TTTCATTTGGTTTGGGTGTCAGTAGCATGATGGAAAGACAAAAAAACACAACTTGACATC 

TQCAGAAATGGGTTCAAATTTTACCTGCAACTCACCMTTCTGTGGCCTTGGTTCAGCAA 

TTAMCTCCCTAAMTTCAGTTTTTTCTTTGTAAAATGGGGTTATGAACAGTACCAAATG 

AAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCA 

AGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAA 

MTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAA 
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ATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCA 

GCATCAAGCTGGAATGGGG 

Sequence 21 54 

GNCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGTGCACGTNGGTTAN 
AGGCTGCGTGGCAGGANNGNGNTCAGATTNTCCCCTGCACNGNMTTGGGCTTTNAGGGG 
GAAATGGTGGGGCCATCT 
Sequence 2155 

GAGCTCCCCGCCGTGGCGGCCGCCCGGGCAAGGTACANGTGTGTGTGGGTGAAATGGAGA 

TTTGGAATTGAACTCTCTGCCTGTAAATGTTCCCCAAATAATTGTTGTGTGTATGATACC 

GTGTATAATAAAAGTATTCTTGTTAGAATCTGAAAAAAAAAAAATGTAAAAAAANNNTAA 

AATAAAGGTTCCTTGGCCCGCTNTANAACTAGTTGGATCCCCCGGG 

Sequence 2156 

CGGCCGCCCGGGCANGGTACGCGGGGACACMGACATCATCTTGAAGGAAGGATGGCTTT 

GGCCAGACCMGACCGAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGCTATGCACA 

CTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGTCACTGGTG 

CAGTCACGACAGTAGGTGTGGCAGCAGGCCTGCTGGCTGCCGCAAGCCTTGTGGGGATCC 

TGCTGGCCAGAAGCAAGCGGGAAAGGCAATAAATCCAAGAAATTGTCCCAACAACCACCA 

ATTNT TACG GAGGMTATTATTTAGC^ 

ACTGNTTTTNTTGGTTCATGGMTCm 

GTNTTTTGGAMTTGCACCTTrfTTATNGAATTN^ 

CCNAAAAAAAAAAAAA 

Sequence 2157 

CCGGGCAGGTACCATTTGGAGTTTMTTGCTTCGCTCCGATGAATGAATTCCTGGCCAAT 

GCACCAAAATGATACGGCTCCGATGACTGGAGGAACACCAGGGTCCTTGGTCTCGCACCA 

GTTTAGATAAMTGACACAGACACACATGTMTGGTTTTAAGGAGTGGAGAGTTTATTAG 

GCAAGAAGGAAGGAAGAAGAAAACAGCTCCCCCATACAGAGACAGAGGGAGGGGGGATTA 

GAACAAACAGAAACTTCCCCGCGTACCT 

Sequence 2158 

CCGGGCAGGTACAGCGTCATATAGGCTTTGCCTTTAATGATCTCTTACGGTTAGAAAACA 
CAATAAAAACAA^CTGTTCGGCTACTGGACAGGTTGTATATTACCAGATCATCACTAGCC 
AGATGTNACG TTGG CACATTTGAAGTCCTTFATTGAMTTCATAAATAAAAGMTTGTTC 
TTTCTTTGTGGTTTTAATAAGAGTTCM 

T MTM TCCCTTTTAAATTTTTTATCNTGTTGCTGTTTACCCT^ 

TATmAGAmGCCTMTGNCCAmTCAmCMGGNAMATTGCCCAAAGAGGGGTAT 
TTCC CCTT NGGG GGAAA ANNGGGGGNCCNTCTTTACCAAGTGGTAAAA NI M I NI I CCC 
TCCCTTTTMCCCTTTTTGCTTTMTCATTCCMNTGN^ 
MTNCCCCTTTGGTTGMGNGGCMNGTTTTGTTTGGMCCTTGGAAGTTT 
Sequence 2159 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATCCTACCCGG 

CCTGGGCTGACCCATGG GGM GGCTGGCTMTTTCAGTGCTTCTGCTTGGTTGTTCAGGG 

CCATTTCAGGTTTGGGTGTTTTCTGGGGATGTTAACATGGGATTCAGGCTCAACTCACAA 

GAMCTTTTCCATCTCATGATGGATGCTGTTGGGCATGTCCAATGTATGACTTCATGAGT 

TACACAGATGCTMTTCGTAGGGGCACTTGGMTCACATGGTTGTTTTGTGTCCCATGGT 

CAAGCATTCTATCTTATCAGGGCCTACAGTAACATGCCAAAAGTTGCTTCCAACATATTT 

CTCTGCTTTGGATG 

Sequence 2160 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCCCAGTCGTGGCCCTCTGGACAA 

GTGGCGGGCCCTGCACTCATGAGGGCTTCCAATGTGCTGCCCCCCTCTTAATACTCATTG 

TCMTTTGAGAAAMGGACATATGAGTTTTTGCATTTATTMTGAMCTTCCTTTGAAAA 

ACTGCTTT GMTTA TGATCTCTGATTCATTGTCCATTTTACTACCAAATATTAACTAAGG 

CCTTATTM7T7TTATATAAATTATATCTTGNCCTAAAAAAAAAAAAAAAAAAA 
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Sequence 2161 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCQGGCAGGTACCCACGTAAAGACGTTAGGTCA 

AGGTGTAGCCCATGAGGTGGCMGAMTGGGCTACATTTTCTACCCCAGAAAACTACGAT 

AGCCCTTATGAAACTTAAGGGTCGTTGAAAANAAANGANTCANAANTNTANAAAAAAATN 

ANGNNCCTN 

Sequence 21 62 

NCCGGNCAGGTACGCGGGGCACAGCGGCTTNCTTGATCCTTGCCACCCGCGACTGAACAC 

CGACAGCAGCATGNCTCACCATGAAGTTGCTGATGGTCCTAATGCTNGNGGTCCCTTTNC 

CMCNACTGTTTACGCANGGCTTCTGNCATGCTCCCTTTATTTGGAGAATTGNGANTTTT 

T CAAGN ACAATTN CAATT NCACCAAGTTGTTCTAATNACCTGAAATATCCAACAGANACN 

NTTTTTNMNGMGTTTrCANTMNACCNNACMNTGGCCNACTTTACNAAAATTGGCCA 

TTATTATTGAAATNTGTAANNGNANATTGTTT 

Sequence 21 63 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGACAACACAAA 

TTACTGCAACTCCAACAACTATGGCAATAACAATCACAGCAATGACCCAAACATCCAAAC 

TGTCMGTATTCCTTCCTCAGGTTTAGGATATCCTGGATMTTTGAGACTGGAGGCTTGT 

MGTAGGCCTAGGACCTGAGGCACTGGACGCTGGAGATTTTGTAGTGGMGAAGTCGACA 

CTTTMGACACTTTGGAACTGGGGGATCCCAAGTACTTGTGGAAGGTGGACACCATCACA 

TCGAGGGCCTTCTCCAGAGGGCACGCCATGACAGCAGTCAGGATCAAACCAAGAAAGAAG 

AAGCGCTTAGGAGAGAGGGGTTGTAGAGAGGGGGAAAGAATTCCCGCGTACCTTGCCCCG 

GGCGGCCCGCTCTAGMCTAGGTGGGATTCCCCCNGGCTTGCCGGGAATTCCNATTTTCA 

AAGCTTTNTCGGATNACCCGGCGACCCTTNCAAGGGGGGG 

Sequence 2164 

ACTATA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTA CI I I I 1 1 1 I I I I I I I I I 

TTTTTTGCGAGATGGNGTCTTGCTCTGTTGCCCAAGCTGGAGNGNAGGGGCGCAATCTTG 

GCTCACTGCAACCTCTTCCTTCCAGGTTCACGCCATTCTCCTGCCTNAGCCTCCCAAGTA 

GCTGGGACTACAGGNGCCAGCCACCACGCCTGGCTMTTTTTTGTATTTTTAGTAGAGAC 

GGGATTTCACTGTGTTAGCCAGGATGGTCTCAAACTCCAGACCTGGNGATCCGCCCACCT 

GGGG CTCCCNAAGNGCTGGGATTACNGGCATGAGCCACTGGGCCCNGGCAAAAAAACAAC 

TTTTTAAAAAATGGACCCTGGCCCNGGGCGGGGCCGTTTTTAAAAACTAANGGGGGATNC 

CCCCCNGGCCTNGCANGGGMTTTCCMNNATCAMGCTTTTNTTTGGATAACCCGCCNN 

CCCCNNTNNGGGGGGGGGGG 

Sequence 2165 

AGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAG GAACT AAAACTGCCATAATGCCGNGCACAGCTTGNAAAGCANTTAGAG 

TMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCANG 

AGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAANTNGCAATAAGGAAGAATGCCA 

TCCCCATGGTAGGCANCACNCAATANATTTNTGGAAACCCACCTNTTCCCGGGATCANGG 

CCTTTCCATTTGCTCACCGGATGCNTNACCGCCTGGGGCCGGGCCCGNCTTCTAAGAAGN 

TAGNTGGGATCCCCCCGGGGCTTGGCAAGGGMMTTTCTAATMTTCAAAGCCTTTAAN 

TNGGAATACCCCGTNCGGAACCCCTTCCGAGGGGGGGGGGGGGCCCCCGGGTTACCCCCA 

AGGCTTTTTTTGGNNTTCC 

Sequence 2166 

CCGGGCAGGTACGCGGNGACGMGTTCCGGTCCAGGTCTCTGACTTCGGGCTTGTTCGCT 
GGTGGCNGTCGGAGCCGAGCCGGACTGGTCAGGATGANCACGGACGTGCAGNTCGCCATC 
TTCGACAACATG 
Sequence 2167 

AG GTACT CCTGGGCAACAGCGAGACTCCGTNTCAAAAAAAAAAAAAAAAAAAGAAACCAT 
TTATTTTAAAMTGATTAGATTGCTATGCCTCAACTCATAGAAGATGAACCCTTCAAGAA 
AACGTGAAG NAGGA ACCGGGNGGGCCANNAAATGAAAACAGGCAAGTANAGNTATTANTT 
NGGAAAAACATTTTNTCAACACCMATGTTAAAMGACTTTCCTTTTGNTAAAACCTGGG 
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ATTATGAGNAAGMCTTTTTCMTTGGGGTTANlsnTC 

GCANCC ACCTT MNAAMCTTGCCATTAGAAAAMCCTGNAAMGGTTTATTGTTTCCCA 

GNATMCTTrrCCGTTGTTTTACCCAAATT^ 

Sequence 21 68 

AGGTACTCCAGGGCTTCATTCATATTTCCTTCCMTTTGTAGACGAACCCAAGGAGGCTC 

MGCTTT CCAGA TCTMTGCCTTTCTCCGMGTTTCCTTAAMCCMTTTCTTCAAAGAA 

TTTGATACTTTTATCCCTTTGTTAATGATAGCCCTGTTCTATTTTT 

MTGGATAATTTGCATTGACTGTCAGATTTCTTTTGAAATTCCTGNAAACNCGACNCATN 

AGTTGG AAAATTGN TATGTCTCTGCATTGTTTTCTTCTACCACNTGGNTTTCATCGCNAT 

TAAACAA I MINI I GAAAAAATTTCTCTTTCAAGCCTTTNTCTGTGGATTGGCCTGCCT 

TCTAATA TTACCMTTTC CTTGGCCANGGGTTCTTAGGATGTAGCCCACCCTCAAAAATG 

TGGGGCC I I I I I 1 1 I 1 1 CCCACCTGGCAAGAATTCAAANAATCGGAAAAATAATGGGGCC 

NTG 

Sequence 21 69 

CGAGGTACATTTTNAMGAGTTGTTTTTTGG 

CCAGCACTTTGGGAGGCXJGAGGTGGGCGGANCACGAGGGCTGGAGATNGAGACCATCCTG 

GCTAACAAAAGAAGAAANCCCGTCTCTACNAAAAATACAANAAAATTAGCCAGGCGNGGG 

NGGCTGNGCACNCTAGTAGGCCCCAGGCCTACTTTNGGCAGGCCTGAGTGCAGGNAGTAA 

TGGNCCGCGCANCCCTGNCAAGNGAAANTAAGGNTNTGCNGGTCGGAGNCCCAAAGNANT 

GCGGNCCCCTTGCCACNCCCAAGGCNCTNGGGCCAAACCAGAAGGCNAAAGGANCTCCCT 

ATTCCTCNMNAACNCMGATATACCNCAANNGCGAGGGAGGTTTNNGGTTNCTTACCTC 

ANTTGGGCNNCAAATTNATAAGGGNTCAATANNANCNAGGNNTAACCAATAAANNACCCN 

CACAGGGGNNTCCTTTAACCAAACCCNTAAAANNCACCTGGGGTCAAGTCCNAAATAAAA 
A 

Sequence 2170 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACCTGCTGGGAGAAGTT 

TCTGAACAACNATATCTGGGATACACAGAAAAGTNTGGAAGANGAGAAAGAAANGCCTAA 

ATNGGMTGAGATCCMGACTAMCGCNAGAGCTAGATTGAGCCX3CATTTGAAANCTCCT 

TCCCNTTGGGGCNTTGGCAGAGGGGGAGAAAAGGCTTCAAAGGAACTNGGTGGCATNANC 

ACCCCCCTCCCCCAATGAGGACACCT 

Sequence 2171 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACTGAGCTCAAGA 

CACACTTGGTGCTTGCAGTGCTGATACCTGCTCTTTGCCGGCCCCCGCGTACCATCCCAT 

GTGGAATCTGTGAGTGTCCTCTTAAGTAGCGTGGGCTAGCCAATCTGCTCGTTNATGGGT 

GTATTTGTAAACTCCGAATTCCATATGTAATANGGATGCAAGTCTAAGCGTTTCATGTGG 

ACATAAATGTATCTAMTMMCTTTCCCTAGCACTGTGGCTGACCTCACCCTTACTTTT 

ATACTTTAGTATGAAACTG ATGAG AACTTTGGTAGNGAGT A I MUM I ATATATATACA 

TATATATGTATCTATCTATATATATATCTCMGCATCTTTCAGGTCTTTGTGTNGTGGNT 

TTThnTAMGCCCCTGTTGTAAAAAAAATTACTATTGTGGGATGGGCAGTCTCTCACATC 

ACAGATGTNGAMAGNATAATTTTTATANTNNGTATTTNCAAANAAAATAAAATCTGGGG 

AMGGTNNCATTCCTNTACTGGNGGNCCAAAAAATCAATNGGTTTTGTNTGNCCAAAAAA 

AAATATTAAMTTAAAMTATANTTNTNTTGAACTTAAAAAAAAA 

Sequence 2172 

AGGGTACATTTTTAAAGAGTTGTTTTTTGGCCGGGCGCAGTGGCT 

AGNACTTTGGGNGGCCGNGGNGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGC 

TAACACAGTGAAATCCCGTCTNTACTAAAAATACAAAAAAATTTAGCCAGGCGTGGTGGG 

CTGNNAACCAGTAGTNNCAGNTACTTGGGAGGCTGAGGNANNGANAATGGGCCNTGAACC 

TGGAAGGAAGAGGTTGCANTGAGCCAAAAANTGCGCCCCTTGCAACTNCAGCCTTGGGCA 

ACAAGANCAAGACTCCCATCTCAAGAAAAAAAAAACAAATAAGNAAAAGTACCTTGCCCG 

GGCGGCCCGCTCTANAAACTAGNGGATCCGCCCGGGGCTGNAGGGAAANTCTATANCAAA 

GCTTANNGAATTCCGCCNACCTNGGAGGGGGG 
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Sequence 2173 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATCCCAGCCCCCCAGGTTCGC 

CTMTAGTCTTGGCTACTTCCCTACAGCTGCTAATCTTAGCGGTGTCCCTCCACAGCCTG 

GCACGGTGGTCAGAATGCAGGGCCTGGCCTACAATACTGGAGTTAAGGAAATTCTTAACT 

TCTTCCAAGGTTACCAGTATGCAACCGAGGATGGACTTATACACACAAATGACCAGGCCA 

GGACTCTACCCAMGAATGGGTTTGTATTTAAGGGCCCCAGCAGTTAGAACATCCTCAGA 

AMGMGTGTTTGAAAGATGTATGGTGATCTTGAAACCTCCAGACACAAGAAAACTTCTA 

GCAAATTCAGGGGMGTTTGTCTACACTCAGGCTGCAGTATTTTCAGCAMCTTGATTGG 

Sequence 21 74 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGT 

AGTTTGTTGGGCCGGGTTCTGAGGCCTTGCTTCTCTTTACTTTTCCACTCTAGGCCACGA 

TGCCGCAGTACGCGGGGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGAT 

GGCAGCCACCACGGGCTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACT 

CGAAGAAGGCCAGAAAGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGA 

AGACATGACACTTACAAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGA 

AAACCGAATATACAGCCTTAAAATAGAATGTGGGACCTAAATACCCAGAAGCACCCCCCT 

TTGTAAG ATTTGTAAC AAAAATTAATATG AATG G AGTAAATAG 

Sequence 2175 

CCGCGGTGGCGGCCGAGGTACCTTAAAGTCCTCCTGGCTCTGAAGCTTCATAAGATGCGT 
GAAGAAGGTGAAGAGG AAGGNGAAGGAGTCGAGCGGCCGCCCGGGCAGGTA C I I I I I II I 
* 1 1 I I M I I I I i I I I I I I GG GGTT TATCATATTTAGGTTATTTATTAATGAAAATATATG 
ACATTTTCAGGAATACAAATTTTGCACCCTGTGACCTCAAATGCGTGCAACAAGATGTTT 
MTACAGAAAATAACACAAAAACTGTTGTTACAGGGGTTAGAATTTTTAACTTTA 
Sequence 2176 

CCGCGGTGGCGGCCGAGGTACTITCTTACAGTCTTCAGGAMTTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAA 

CCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTC 

Sequence 21 77 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGTAGAGTGGTGCTGCTTTAATTCATAAATC 

ACAAATAAAAGCCAATTAGCTCTATAACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAGGAAAAAAAAAAAAAAAAAAAAAAAAAGGTCCGGNNCANTCGNG 

GNGGCNTAACCNCNCTNGGGGNGGNTNTAGGGNTGGAAAACNCTNAAAATNNNTGTTTTN 

AN NG G G NTAAAG N NTTG AAATTTTG CC AG NTTTNTAAAGTC ATTAAAG NC AA AAGTTTTT 

TTAAGGGGTN I I I II I I I ITAANAAAACCCATTAACNTTCCATTGGGCCTTTANAAANGG 

CCCCCGGGGGTT TTNGG NNAMNTTNnTTTT^^ 

GGGGCCCCCMNTTTTTNGNTGNCCCCNCCCNCCCCCANTTGGGGGGGNNGGGNGGGGAN 

ANCCCCCNTTTTT7TTTTT 

Sequence 2178 

GGGCGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA Cl I I I I I 1 1 I I I I I I I 

m i i i 1 1 1 1 I I I I GCCMTTGTTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTG 

CAATCCAGCTAGGCNTGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 

AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGNGGGGTGCTCTCCTGAGCT 

ACANMGGA ATGATC TGGTGGCMNAMTTAACCANTGCTTTTATTATCNANGCACTTNN 

CACTATGCACTTTTTCTGAAATATTTTTGTMCACTTTm 

Sequence 2179 

CCGGGCAGGTACAMTGATGAAACGGAMGACAMGGAAATTTTCCATTTTTGAAGAAAA 
AGTGTTCAGTGTTAATGGAGCCNGGGGAAACCACATGGACTTTGGTCAGCTCTATCAGTT 
CTTAMCACCAAAGGATGTGGGGATGTTTTCCAGATGTTCTTTGGTGTANAAGGACAATG 
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ACATCAAGAGTNGTTGAANGTATCTTGCCACTGNTGGCCTTTTG A I 1 1 11 1 NTCCCACTT 

TTTCTTGAMGATTMGTMTTTTATTTTAGTTCCATTCTAGMTGTTGGGGAGTGNGGC 

ACAAGAAAAAATANTATANCTGAAATGCATCTGTTAAAAATGTNATGATTGNAAGCATAA 

CTGAGTTTCA 

Sequence 2180 

TAGGGCG AATTGGA GCTCCCCGCGGTGGCGGCCGAGTAGCATGCCAGGAAAGGAAGAGAC 

TGGGATGGTTTTTTATCTGTCGCTTTCTTAAATCAAGGGCCGCCGGGCCGGAGATGGATG 

GAGGGACCGGGGATTTGGGAACTCGAAAACGAGCTGAGGGAAGGGAGCCTGTGGAAATAG 

ACTGGAGTCTGGGTAGTGTCGTTTCCTAGAGAATGGTCTCGAAGTAACTTCTCGGTAAAG 

TCTTCACGGMTTTCCAGACCACGCTTGCCX)ACTGGGAGGCTTTTAGGACCCGAGACGTG 

TGCAGGCTTTTCCAGCCAAAATGAAGTTTAATCCCTTTGTGACTTCCGACCGAAGCAAGA 

ATCGCAAMGGCATTTCAATGCACCTTCCCACATTCGAAGGAAGATTATGTCTTCCCCTC 

TT 

Sequence 2181 

1 1 1 1 1 1 I I 1 1 I I I I NGTTACATAAATTAACCCATTTATTATAGGCCAGTGATGTCTCAAA 

GAGTAGAGGAGCGTCTACTGGTCTTTCMCTCCTTCAGTCTTCTGATGGCGGACTTTACC 

GNGACAGCGGA AGTGG TATTGNACCTGATTTTATTTCCAGTTTTCATCCGAATCCACTGG 

GGMTGGGACGATTTTGCTTTTGTTTCTTGGCCAGGAATCGCTTAATCCT 

Sequence 2182 

AGGTACTCATCGGCCAGCACGGAGATGCACAGGTTAAATGGTTTACCATCCTGAAAGGGC 
ATATTGNGGCATGTCACCTCATACTGCCAAGCCCCATTCACGCGGCTGTTCATGACCACC 
CAATGACCAAAGTACCTGCCCG 
Sequence 21 83 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCCCGAAGCGTT 

TACTTTGAAAAAATTAAGAGTGTTCAAAGNAGGCCCGAGCCGCCTGGATACCGCAGGTAG 

GMTMTGGMTMGGACCGCX3GTTCTATTTTGNTGGTTTTCGGAACTGAGGCCATGATT 

AAGAGGGA 

Sequence 2184 

AGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGTAGTTTGTTGGGCCGG 

GTTCTGAGGCCTTGCTTCTCTTTACTTTTCCACTCTAGGCCACGATGCCGCAGTACGCGG 

GGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGATGGCAGCCACCACGGG 

CTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACTCGAAGAAGGCCAGAA 

AGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGAAGACATGACACTTAC 

AAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGAAAACCGAATATACAG 

CCTTAAAATAGMTGTGGACCTAAATACCCAGMGCACCCCCCTTTGTAAGATTTGTAAC 

AAAAATTAATATGAATGGAGTAAATAGTTCTAATGGAGTGGTGGACCCAAGAGCCATATC 

AGTGCTAGCAAAAATG 

Sequence 21 85 

CCGCGGNGGCGGCCGCCCGGGCCGGTACGCGGGGAACTGAAAATCCACAAGACAGANTAN 

CCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGCCTAATTTACAGC 

AACCATGAGGAAAGGCCiACTTAAGGATGCAGCAAGAAGGAGCCATCTGCAATCCAGGAAG 

AMTTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTTCTAGCCTCGAGA 

ACTTACAAATGGTGATGATCATNAGGTCAAGGATAGTCTGGAGCAATTGAGATGTCACTT 

TAC 

Sequence 2186 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 1 1 I I I I I 1 1 I 1 1 I I I I I 1 1 I I I t I NGGGNA 

AAGAAACCTTTAATGAGGATTCAAGGTTAATAAGGAAGACNCAGAGGGCCAGCACTCAGC 

CCCAACCTNCTCACGTGTACCAAGAAAAATAAAGAAGAGGCTGCAGAATATGCTAAACTT 

TTGGCCAAGAGAATGAAGGAGGCTAAGGAGAAGCGCCAGGAACAAATTGCGAAGAGACNC 

ANACTTTCCTNTCTGCGAGCTTNTACTTNTAAGTCTGAATCCAGTCAGAAATMGATTTT 

TTGNGTAACAAATAAATAAGATCAGAGTCGGTAAAAAAAA 
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Sequence 2187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACTATTGTAGAGACTATCCGGGTTAGTT 

TGCAAGGGAAGCCAATGATGAGTAATTTGAAAGAAATTCACCTGGTGAGCAATGAGGACC 

CTACTGTTGCTGCCTTTAAAGCTGCTTCAGAATTCATCCTAGGGAAGAGTGAGCTGGGAC 

MGAMCCACCCCTTCTTTCAATGCAATGGTCGTGAACAACCTGACCCTCCAGATTGTCC 

AGGGCCACATTGAATGGCAGACGGCAGATGTAATTGTTAATTCTGTAAACCCACATGATA 

TTACAGTTGGACCTGTGGCAAAGTCAATTCTACAACAAGCAGGAGTTGAAATGAAATCGG 

MTTTCTTGCCACAAAGGCTAMCAGTTTCAACGGTCCCAGTTGGTACCTCGGCCGCTCT 

AGAACTAG 

Sequence 2188 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGGACAAA 

TCTCTCCACTTGGAAGGAGATCCCAATCCTTCTGCAGCCCCAACATCCACCTGCGCACCT 

AGGAAAATGCCCAAAAGGATTTCAATATCCAAACAACTGGCTTCAGTGAAAGCTCTGAGG 

MGTGCTCAGATCTGGAAAAAGCTATTGCCACCACTGCTCTGATTTTCAGAMTTCTTCT 

GACTCTGATGGTAAACTTGAAAAAGCTATTGCCAAAGATCTGCTGCAAACCCAATTTAGG 

MTTTCGCAGAGGGACMGAAACCAAGCCAAMTACAGAGAGATCCTTTCTGAACTTGAT 

GAGCACACAGAAMTAAGCTAGATTTTGMGACTTCATGATCTTGCTCTTAAGCATCACT 

GTCATGTCAGATCTGCTACAAAATATACGGMTGTAAAMTTATGAAATGAACAGTTTTA 

MTATGCTGTATAAAATAATGGCAAAAGACAGTGTTATTAAAATGTTTCCATCTCAAAAA 

AAAAAAAAAAAAAAA 

Sequence 2189 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAAGTTGAAAGTCACA 

AMCCATAGMAAGTCTCAGTTTCCTACTGTAGMTTCTTGGCATTTCAGTAGTCACAGT 

TAMCTTGTTCTAGGTTTT TC^ 

TAAAAATTCTTTCTTTCAAC I I I II I G ATTATAAGCCTTGCTTGCTTCGATTTGGGATTT 
AGGCATCGTTGTCCTTTATTTTCTTTC^ 

TTACTTGGGTACCTAAAACCCGAAGAACCTTCTGTAAGAAGTGTGGCAAGCATCAGCCTC 

ACAAAGTGACACAGTATAAGAAGGGCAAGGATTCTTTGTATGCCCAGGGAAGGAGGCGCT 

ATGATCGGAAGCAGAGTGGCTATGGTGGGCAGACAAAGCCAATTTTCCGGAAGAAGGCTA 

AAGACCACAAAGAAAGA 

Sequence 2190 

CCGCGGTGGCGGCCGAGGNAC I I I I I I 1 1 1 I I I I II I I I I I I I I II IC i I I I I I I I II I I 

TGCNANCACANAGGTCAGCAACAAGTTTATTTTGCAGCTAGCAAGGNAACAGGGTAGGGC 

ATGGTTNCATGTTTAGGTCAACTTCCTTTGTCGNGGTTGATTGGTTTGTCTTTATGGGGG 

GGGGNGGGNNAGNNGAANTCCAATNTGAATCCCACAGAAACCAGGGGCTGTANAAACANA 

ACCTGANAAGAACGAGCACTCAACCNAGCTNACTNNGGTTCANNANAAAAAATNCCAANA 

AGGAAAGCCAA 

Sequence 2191 

CCGCGGTGGCGGCCGAGGTACAAAGATAAGTCATCTCAGTAAAAGGTCTATTATCTAACT 
TGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGC 
TTGAAAAGCAATTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGC 
CTGGCTGMGGCCCAGGAGGGANGGANTTTTNANACCCCCTTTAAANNTNNCTTTCGCNT 
TAAGAAGAGNCCCCCCGNGGGNGNCNCCC I I 1 1 J 1 1 1 I GGAANCNCCCTTTTCCCCGGGA 
ATNAGGG I 1 1 I I Ml I I II I INAAAAANTCCCCCCCNNGGGGGGGNCCNNCNNTNTTTTT 
I I I I M I I I INGAANCCCCCCCCCCCCCNGGTTNNNNCNNTCCTTCTTTNACAAAAAAAA 
AAAAAATTNCCNCNCCCCCAAAAAAAAANNNN I I I I I I I I NTCCCCNNCCCCCNNNTTTT 
TANAAAAAAAAAAAAAAN I I I II I 1 1 I I 1 1 I II 1 1 1 I M I N 
Sequence 2192 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCGAGA 

GAAGCAGTAGTCAATAAAGAGAGTGCCGTATTTCGCAGATTGGAGCTGAGCTGTGGCTGC 

CAGAAGATAGCGAACGAATGGAAACTGAAAGTGGAAATCAGGAAAAGGTAATGGAAGAAG 
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AAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAAACGGTCACGAGTAAACAGGTGCT 

TGCAGATATTGCTAAGCAAGTGGACTTCTGGTTTGGGGATGCAAATCTTCACAAGGATAG 

ATT TCTTC GAGAACAGATAGAAAAATCTAGAGATGGATATGTTGATATATCACTACTTGT 

GTCTTTTAACAAAATGAAAAAATTGACTACTGATGGGAAGTTAATTGCCAGAGCATTGAG ' 

MGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCACCAGAATCCGGAGGAAAAAACCTCT 

GGGGGAAAGACCAAAGGA 

Sequence 2193 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTCCATGGGTCCACAA 

AATCCTCTTCAGCTTCTGTGGCATCTGGGCCATGATTACTGGTAGGTGCTGGGTTCCCTG 

GAGGACAGTCAGCCTTGTMTCCTCCCCCGCGGCAGCTTTGTAGCTCATTTTTATGACAC 

CMCATCTTTACTGCAGAATCAATTTCATCCCAGGTTGTGGAAGACACTGCAGAGGTGGC 

CACTCGTCACGCT^^ 

TTTTTTCAGMGCCAATGTTCTCTAMTCTGCAGCTTCATTCCACAGCTTTACAGAATCA 
TAATCTCTTGMTATATTTCCAATGTTATTAAAAAATAAAAMTCATACAAGATATATTT 

AGCACATTAAAACTTAAGAGGTTACAGTATAACTGTCCAGACCTCCAGGTACCTGCCCGG 
GC 

Sequence 2194 

GTGGCGGCCGCCCGCCAGGTACATTTTTAMGAGTTGTTTTTNGGCCGNGCGCAGTGGCT 

CATGCCTGTAATCCCAGCACTTNGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGANTT 

TGAGACCATNCTGNCTAACACAGTGAATCCCGTCTCTACTAAAAATACAAAAAATTANCC 

AGGCGTGGTGGCTGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGANGCAGGANAATGGC 

GTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGG 

GCAACANAGCANGACTCCATCTNANANAAAAAAAAAAAAAANCAANGAGNNGTTTTCTNA 

TGTTCATTATAGGGCATTACAGTTACATNGNCCGAAGGTCTTACAACTAATCACTGGGTA 

GCMTANATGCTTCNGGCCCACATGATGCTGATTAGTTCNCANTTTTCATNCAGTTGACA 

ATATTAACCCCCATTNCTCCCTCCCTGCCCAAGGNTCATAAANTNGTGACTGCCTAACAA 

CCA AAATT NGGMGGCCCANTCTTNATTTTCANTCAGACTTCTTGGAACCTGAAAGATTA 

AACNTrTTGGCTAACCCTGGAMTATCTTTTTATCTCACTTATAGCNTTNMGGCCATTG 

NATNAAMCNTGATTTCTTGGAAAGGGCCATACCCATAMNn"CMGGGGGGTTTTTCCT 

AAAAGTAAANGNTTCCAAATNAANNCCAAACANTTTCAACCCCAAAAAAAAAAAAAAA 

Sequence 2195 

AGGTACACTGAACACCAAGGCTCTCACTCTTGAGTCTAGGGCACCATATATATAAAGGGA 

GTTCGGAACTTGTCAAGGCAATCCTATTGTTCGTAGTCAAATCACACTGAAATCCGGAGG 

CCTGGTGTGAGMCCTACNAGCGGACACCGGGCATTTAATATTTTTTGCACACCCACACA 

GCCAGGGCCAAAGTGGTCAAGGCACTCTCCTAACACAGACAGGATCTTCTGAGTTGCAAT 

TCTTTCTGMGGMCATTTTTCACTTGAMTTCCATCAGAAAATTTCCTGAGATCTTGTG 

AGCGCTGAGGTTTCTGGTTTCATCTAGATCCAAAAACATGTCCAAATCACATCCTAACTT 

CCCAAAAGTGTTGACTGAGGAGCCAAAGGGTCTGACTATGCAGTCTGGAAAATACCCCGG 

CGGCCATGTCTTCAATAAGAGAACAGGTGAGATATCGGAGCTTAAGTGTTCTCCTCTGTT 

AGCTGGMCTCCTTCMGMGAGTGTTCMGCTTGATCCGTCTATACTTTCTGCATAACA 

AAGTMTTCAAAAAGCTGCTTGTTTGAACCGTGGCAACTGANTACTTGACCCGTACCTTG 

CCCCGGGCGGGCCCTNTAAGAACTAGTGGATCCCCCGGC 

Sequence 2196 

CGCCCGGGCAGGTACAGTGGTCCTTTTCAGAGTTGGACTTCTAGACTCACCTGTTCTCAC 

TCCCTGTTTTAATTCAACCCAGCCATGCAATGCCAAATAATAGAATTGCTCCCTACCAGC 

TGAACAGGGAGGAGTCTGTGCAGTCTGACACTTGTTGTTGAACATGGCTAAATACAATGG 

GTATCGCTGAGACTAAGTTGTAGAAATTAACAAATGTGCTGCTTGGTTAAAATGGCTACA 

CTCATCTGACTCATTCTTTATTCTATTTTAGTTGGTTTGTATCTTGCCTAAGGTGCGTAG 

TCCAACTCTTGGTATTACCCTCCTAATAGTCATACTAGTAGTCATACTCCCTGGTGTAGT 

GTATTCTCTAAAAGCTTTAAATGTCTGCATGCAGCCAGCCATCAAATAGTGAATGGTCTC 

TCTTTGGCTGGAATTACCAAAACTCAGAGAAAATGTGTCATNAG'GAGGAACATCATAACC 
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CCATGGANGGAAAAAAGCCCCCCAAATGGGTGGGAACTGGATAATAAGCACTNATGCTTT 

AAGAATTGGGCACACTTCTCACCTAAGGTGAGCGCATTGNGNCCAGGGGGTGCTTAAATG 

CTTACATACCTCCAACTGGAAATGGNTMGGGAAGAAGATTGATNCCAATTTNAAAAAAA 

AATTTAAAANCCCANTTTNAAAAAAAAAA 

Sequence 21 97 

CGCCCGGCAGGTACGCGGGGGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGC 

TCTGGGCAGCAGCAAGAATTCCTCTGCCCCCCATCCTACCATTCACTGTCTTGCCGGCAG 

CCAGCTGAGAGCAATGGGAAATGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCAGAAGCTG 

GGTTGGTTTATCCAGGAATACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAG 

ATGGTGAACACCATCTGTGACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGTGCAGGGA 

GTGGCCATAGGTGGCTCCTATGGACGGAAAACAGTCTTAAGAGGCAACTCCGATGGTACC 
T 

Sequence 21 98 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CMAGTATTATGACTTTAAMACCCCATTATTGAAMGTACACAGGAGGCAMGTGTTTC 

ACATTATAGACTTCACTTCCMCTCCTTGGAATGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGCTTAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCAAGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCMTTTCTGGGACGCTCAAGGCCGTGTCGGGGCTTCAA 

GTCATCTTCACCACACCANGTACCTTNGGNCGNTTCTANNAACTANTGGGATNCCCCCGG 

GCTGTAGGGAMT7TNANTTTNMGCCTTATTCGATTACCCGTTGANCCTCTGGGGGGGG 

GGNC 

Sequence 21 99 

CCGCGGTGGCGGCCCGAGGTACAAGATNGNCATCTCAGTAAAAGGTCTATTATCTAACTN 

GCCAAACTTGTTNACTGAGAGCCCTANGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAMGCAATTAGAGTMGCANGATTAGTTTTTCCTCCCTTNCAGNCTNAGNAGGCCTG 

GCTGAAGGCCCANGANGGAAGGAANTNTANNANCCANCANTAAAAATAGCNATATGCAAT 

NNNAAGAATGCCATCCCATGGAGCACACCA 

Sequence 2200 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAATTTTTAGAAT 

AGTTGATGTCTTTAAGATGGATATTTTACMTTTCAGTAAAAAAAATACAACATGAAGCT 

GCTGCTGTCACAGATCACTGTAGTAAAAAGATATAAATGCAATACCATGTCGTAGAACAA 

TATATATATCCTCTGATATTTTACAAACTTTGTACTGTGTCCMCAGCTGAAGGAATTTG 

AGGGGAAGACTTTAGTGTCAGTCACCAAAGAAGGCCTGGAACTTCCAGAGGATGAAGAAG 

AGAAAAAGAAGCAGGAAGAGAAAAAAACAAAGTTTGAGAACCTCTGCAAAATCATGAAAG 

ACATATTGGAGAAAAAAGTTGA 

Sequence 2201 

GCTCCACCG1TTTGGCNNCCGAGGCCCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTC 

CAGGATCAGAAGAGMGCCATTTTGACATCCTCGATAAAACTGGGGATNNGCTGNANTTC 

TGTCTGTTCACGMGTGGTTGAAAMCMTTTCGAGATCCAGAAGTCTTGATGGGTTCAC 

CATCCAGGTGTTCACAAAAAATCAGAGAATCTCTTTCGAGGTGCTGGCCGCCTTCAACGC 

TCTGAGTAAGCATTGCTGGGTGTCAGGAGAGAAAAGCCAAAGAAGCGGGTGCCAGACAGC 

TCTGTGCAACCTCTAGGCCATGAGTGGGATAGATACCACTGCTGCTTTAAAAAATGGGAG 

ACCATAGACCCTCAGGAGAAAAGAATCCCTTCTACCCTGGACTCGCTCTCTTCTCTGGAA 

CTAACTTCTCCCCCATACCCTGATTGTCTTTGGAGAAAATGTTCTGGATTCTAGAATCTA 

AGGCAGAGCCTTTTAAGCCATACTGTACCTGCCCG 

Sequence 2202 

AGGTACAGACAGGGTTTCTTCATGTTGGTCAGGCTGGACTCGAACTCCTGGTCTCAACTG 
CCTCAGCCTCCCAAAGTGCTGGATTATAGGCATGAGATACCGTGCCTGGCCTCCATCACT 



• 
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GATTCTATGTATGCACATGACTCCTTTTTTACTTTTACTTAGGAGGCTTAAMCACMTT 

ATTCATTCAATATAATTATAMGCAGAGTCTTTGTTAATTTCTGAACTTGGTAAAAACAT 

Nil I ATATTTTATCTATAGCTCATTMTTTAGGTATCAAAAATGTGTTTTAGTTTAGGT 

AGCCCMGCTGCCMTTGTTTGATATCATCATCCTCTTCTTCTGCCCTCTGGGAAGATGC 

TGCTCGGGATCGAC 

Sequence 2203 

CCGGGCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTC 

AAATTTCCTGMTTGCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTTACT 

GMGMTGGAGAGAGMTTGAAMAGTGGAGCATTCAGACTTGTCTTTCAGCAAGGACTG 

GTCTTTCTATCTCTTGTACACTGTATGCATCATCAGTCCAMGTTTTTCAGAGTTCTATG 

TGACTGGTTGAGGAAACACTGCTTGAATTTCCCCATAGCCTCTGCAGCGGCATCACTGTC 

TATGMCTCTTGAAGAAGCTCTTTGTATTCAGGTATAGATATGTCGGAATTGATGGTCTT 

TTCAACCATGTCCTCCAGGAGTTTGCAGCCAGAATCTGCATAGCAGTGCAGGAGGAGGGC 

CGCCAGCATGAGGACCATCAGCAGCTTCATGGCGAGGCGGCTGCTGTCTGTGTTCAGTCG 

TGTGTGGCAGGGATCAAGGAAGTTGCTGTCCCCGCGTACCTN 

Sequence 2204 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTTGNCAGGATTCAA 

GCCGAATGGCTTGCCTNANAGGGNCAATGGCGTTCTGAGATGGTGGCCA 

Sequence 2205 

ACACTACTATAGGGTGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTGGTTTAT 

CAATGGGACGTTCCAGCAATCCACACAAGAGCTCTTTATCCCCAACATCACTGTGAATAA 

TAGCGGATCCTATATGTGCCAAGCCCATAACTCAGCCACTGGCCTCAATAGGACCACAGT 

CACGATGATCACAGTCTCTGGAAGTGCTCCTGTCCTCTCAGCTGTGGCCACCGTCGGCAT 

CACGATTGGAGTGCTGGCCAGGGTGGCTCTGATATAGCAGCCCTGGTGTATTTTCGATAT 

TTCAGGAAGACTGGCAGATTGGACCAGACCCTGAATTCTTCTAGCTCCTNCAATCCCATT 

TTATCCCATGGAACNCTTAAAACMNGGGTTGGTTTTGGTTCCTGAAGCCCTTTTTTGCT 

GGGGMTGGGGCACCTTAATGGAAAATTTAAAGGGGAAAANCCCTCAGGCCTGNNGGGGG 

NGGGGCCCCTTCACAGCCTTTTNCTTAACTNAGAAANCGGGGAAAATTGNAAACCATTGG 

GGGAGMMTGGACNACTTCNCACTNTTGGACNGGTTTTTTTCCAAAANTGTNAAAACAA 

GAATTCTTTATTAATGGAMNGGGGTTTTACCCCCCTTTTAAATT 

Sequence 2206 

CCACTATAGGTTTATGGGAACTCCCCGCGGNGGCCTCACTTTTNACCTTCTGGAANCCGC 

CCACCACTTTCAAGCTCACTATTGAATCCACGCCGTTCAATGTCACAGAGGGGAAGGAGG 

TGCTTCTAGTTGTCCACMTCTGCCCCAGCATCTTTTTGGCTACAGCTGGTACAGCTTTC 

TTCGTCCTCCATGCTAAGAGATGTAAAAGCTTAAGGGTCAAACAATACCAATTGTATAGG 

CTTCAAAAACCATCTMGTTAGGGCATTCTlsrTAGTTTTAGCTAAGATACACCTGGAACAC 

TGACAAGTCATCACTTACATAGMTMTGTGMGTAMTTTTTTGAMAATAMTTTTAG 

TGGAACAATCCTGAAGGATAACACCAGAAGAATAGCAGGTTACCAGTAAGGGGTCAGCCA 

ATTTGTTCCAGTCACTTTTGAATCCATGTTCTATMTCTAAAMTTATTCTCTTTCCCTA 

AGCTGAGAGCTTCCTATCATGTCAGTATCTATGNTATGAAG 

Sequence 2207 

GGCGCCGAAAGCGAAGAAGGAAGCTTCAGCCCCTCCAAAAGCTGAAGCCAAAGCGAAGGC 

TTTAAAGGCCAAGAAGGCAGNGGNGAAAGGGGNCCACAGCCACAAAAAGAAGAAGANCCG 

CACGNCACCCACCNTCCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCTNGGGCGGAAAACAANGGCAAAAGCNGGNNCC 

CGGGGGGGAAAANGGGNAACCCGNNNCACAAAANCCACNACAACAAACCGAGCCCGGGAG 

CAAAAAAGNGGGAAAAGCCCCGGGGGGGGGCCCAAAGGAGGGGAGGCCAACCCCANAATA 

AAANGGGGGGGGNGCCCCACNGGCCCCCNTTCCAAACCGGGGNAAAAC 

Sequence 2208 

GGCGAATTGNNGCTCNTTCGCGGTGGCGGCCGAGGTACTGAGAGAGGTGGTGGACATACT 
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TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCT 

GCAGGCCAGGATGCCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTT 

CAGTTTCGGTGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGT 

TCATCCTGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTC 
CTTCATT 

Sequence 2209 

AMTCCCGCGGCGGCCGNTCTAGMTTAGGGGAN^ 

Cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGNAAAGAGGGNNCCNGGGNGAAAA 

NGGGGAACCGCGCACAAANCCACACAACAAACCGAGCCGGGAGCAAAAAGGGGAAAGCCC 

CGGGGGGCCNAAAGNGGGGNGCNAACCCAAAANAAAANGNGGNGGGGCNNAAAGGGCCCG 

GNTTNCCAGGGCNGGGGAAAAACCCGGNCGGGGCCAACNNGGNAAAAAAAAGGAAAANGG 

GGNCCAACCCCCCCCNAAAAAAAGGGCCCGGNNNNGGGGGAAAAAGGGGGGGNGGCCNNN 

NACCGGGNNANCCCCGGGGNAAAAAAAAANAAANNNNNNGGGGGCCCCGGGGGGGGGNNN 

NGNNNGGGGGGGGGANNGGGGGGGGNNNAANGNNNAAANCCNAAAANGGGGGGGGGGAAA 

AAGGGGGGGNNNCCNCAAAAAAAAANANGGGGGGGAAAACCCCGGGGGNAAAAAAAAAAG 

GGGGGGGAAAAAGGGCCCCCX^NAAANGGGGGGGGGAACCCCNAAAAAAGGGGGCNNNNNN 

GGGGGGGGGNNNNAAAAAAGGGGNCNNCCCCCCCGGGGGGGGG 

Sequence 2210 

AANCCCCGNGGGGGCGGCCGNGGNNCCCCCGACGGNNNCAACCCCGNNGTANAAAACCCC 

CAGGAACAGAAGAGAAGGGGNCGGGNCAAGNNCGAAANCGNGGGGGAAAAGCAGAAGACC 

CGACNGNCCACGAAGNGGCGCGAAAAACAAAANCGAGAACCAGAAGGCCCGGGAGGGGAA 

CACCANCCAGGGGNCCACAAAAAAACAGAGAAGCCCNNGCGAGGGGCNGGCCGCCNGCAA 

CGCGCNGAGCNNAAAGGAAAACCCCAGCCCCCGGAACCANCGAGAGCNCAAAAGANCCNG 

GGAAAANACAAAGGCCAGNCCNGGGGAGAANNGCAGNCCGCANCACNGAACCCCAGCAGA 

NAGGGNGNNNGGACAACCGGGCCCGGAAAAANA 

Sequence 2211 

TACTATAGGGC GAATTGGAGCTCNCCGCGGNGGCGGCCGCCCGGGCAGGGNACTTTTTTT 

u » i 1 1 i i i > i iGGNGGl 1 II 1 1 1 n 1 I I 1 I I 1 1 1 I U 1 1 1 1 1 I > 1 ] I ) H n ) 1 1 1 1 I I 

I 1 1 H I I n 1 1 II I I II I I I I I I I I 1 1 i I | 1 1 1 1 NAAAAAAANAGCCAATTTTTNNAAAN 

NNAMNNTNCNNGGNrTAMNGGNNNNNGNNAAAMTTTGGGGGGNAAMGGGNTNNTTTN 

ANCCCACTGNGNGGGGCANCCCTTANAANNNGGGGGAANNCCCCNGGNAAAAAANGGGGC 

NGGGGNCCCNNNTNNGNANNNTTCCCNGGGGGGGGGCCCCANNAAAANNNNNNNCCCCGG 

GGNGGGGGGNNCCCCANTTTNNCCNAAAAANCTANGTCCCCGGGNCCNNNAAAAAAAAGG 
GGGC 

Sequence 2212 

CCGGGCAGGTACCCGGGTGGTTAAACTTCGCAAAATGCCTAGATATTATCCTACTGAAGA 

TGTGCCTCGAAAGCTGTTGAGCCACGGCAAAAAACCCTTCAGTCAGCACCGTGATGAAAA 

CTTGCGGAGCCAGCATTTACCCCCGGGACCATTCTGATCATCCTCACTTGGACGCCACAA 

GGGGCMGGAGGGTGGTTTTCCTGMGCAGCTTGGCTAGTTGGNCTTATTTAACTNTGTN 

GACCTGG ACCCT NTTGGTTCCCTCAATTCNGAGGTTCCCTCTTACCGAAGNAACACACCC 

ANGAAAAATTTTGGTCCATTTGCNCACCTTTTCAACCCAMNAATTCGGATNTTCAAGGC 

AMTTGTTANAMTAMTCCCCCAAAAMCMTCNTTMCNTGGMTGGCCTTTACTTTT 

CA 

Sequence 2213 

CCGC GGTGGCGGC CGCCCGGGCAGGTAC I I II M 1 1 1 1 1 1 1 1 M I II I M I NACCGGAAA 

TTGTCTTTTATTTTTCTTTGCTGGGGTTCMGGTCATTTNTTTMTCAAACCTC 

GGATATAGAGTCATCAAAAGAAAAANCATTCTTCTTCCTCTCTGTTTCCCTAAAAANAAA 

ACAGTCACAGGAGACTTTCCTTTTTTCTTAATTGTTCTCATCTCATCTTTMTTA 

GCTGCTGTTATGTGTTTATGTTTCCC I 1 1 I I I I I I AAGTTTTGAGGGTAGATCGNGTCAC 

TGGGCGTMGANACTGAGAAATCAGGTAGCTTCACTGGCTGGTAGGGGACTTTTGCTTTT 
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GAAACGTCACAGGCTGTCTTCCTTTGGATGTAATG6GACGTCCTGATGACCCA 
Sequence 221 4 

ACTTAGGGCGATTGGA GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI 1 1 1 1 1 1 1 1 1 1 

I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 NATTAGGGTTGAAAAGGTAAGTCCTATATTANATACATAGGG 

GCTGGTAGTCTTATCCTCTNTATAMTACNCACTTTTTGNCACTGCCCCTTCXX)TGCCAT 

GCAGAGTCCCCATGGCTGAATCTCCCTATGTCTCCAAGNGNGCGTCACTTGCTGTCCTNT 

GGGCTGGGTCCCATGNGCTGNGCCCACAGTCTT 

Sequence 2215 

CCGGGCAGGTACMTACACTAGAMCCMCATMTGTATTTNTTTAAAACCTGTGTGAAA 

AAATAAATGTTCCACCAGTAGGGATAGGGGAAAAGTAACCNAAAGAGAGAANNGGAGAAA 

GGGMTGCTGGTTTATCTTTTGTANGATTGTMTTCGMTGNGAGAAATTTTGCAGTATT 

TTAGCCACTTATTAAGGGAA 1 1 1 1 1 1 I 1 1 1 1 GTAAAA^JTGGAAGACCTGANACNTCTGTT 

CAAAATGGCTTTTCANTGGAACCCTTGNGThJTTGGAGACCGGTTAGNGAAAAGGCANACA 

AAAACCGTGGGTAACCCTGGTGGACCTAAAGGGGCNCTGNGTGCCAAGGGNACCTTGGGG 

NAAATTGGTCCATTNGATTA^AGGAATGGGTGGGGGGTTTTTTTCCCCCCCCTTTTTNA 

GNAAANTGGTNTTGNGGATTATTTTAAAGNTGGANTTTTAAAAAACACCNI 1 1 1 1 1 I 1 1 A 

AACTTCCCG G AAA 

Sequence 2216 

C CGGGC AGGTAC 1 1 I 1 1 1 1 I 1 1 I I II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 ATACATTAAATATTTATTAA 
T GTTT TGGGAATGTTGTGTATCAGTAATTATACCTAGTAAAAAGAAAGATCCACATCCAA 
ATTTTCCAGAAAACCGGGTGGCCAGGTNNGGGNGGAAGAAGAGCATTTTAAAAACCAAAA 
CCAAAACAAGGGA AAAGC TAAAAAGGAGTTTNTTCCAAATANTCCTGAATTCATTAGGAG 
GGTTGGMTTNACATTTTTTCAGGNCTNTATTCCCCCCAMGTTGAAAT7TTTTAAGGGG 
AC CTTGGA NAAAGNCTTTATTACCCCCAThfTGAAAAACAATGCCTGCCAATGGATTTACC 
CTIsnrmCACGGTTCTTCCAGGGTTCCCAANTNACCGNTTGTCCTGGTNMTCCCTTTT 
' TGGGCCC I I 1 1 M 1 1 1 1 1 I 
Sequence 2217 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGAGCTGTGTGCATGCT 

GAGCAACACCACAGCCATTGCTGAGGCCTGGGCTCGCCTGGACCACAAGTTTGACCTGAT 

GTATGCCAAGCGTGCCTTTGTTCACTGGTACGGGAAGGATTCCTGCCAGGGTGATTCTGG 

GGGTCCGCTGGTATGTGGAGACCACCTCCGAGGCCTTGTGTCATGGGGTAACATCCCCTG 

TGGATCAAAGGAGAAGCCAGGAGTCTACACCAACGTCTGCAGATACACGAACTGGATCCA 

AAAAACCATTCAGGCCAAGTGACCCTGACATGTGACATCTACCTCCCGACCTACCACCCC 

ACTGGCTGGTTCCAGGAACGTCTCTCACCTAGACCTTGCCTCCCCTCCTCTCCTGCCCAG 

CTCTGACCCTGATGCTTAATAAACGCAGCNACGTGGAGGGTCCTGATTCTCCCTTGGGT 

Sequence 2218 

CCGCGGTGGCGGCCGCCCGGGCAAGGTAC 1 1 I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 GGGATGGAGT 

CTTGCTCTGTTGCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTCTTC 

CTTCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGTGCC 

AGCCACCACGCCTGGCTMTTTTTTGTATTTTTAGTAGAGACGGGATTTCACTGTGTTAG 

CCAGGATGGTCTCAAACTCCAGACCTCNTGATCCGNCCACCTTGGCCTCCAAAAGNGCTG 

GGGATTACANGGCATGAGCCANTGGGCCCCGGCCAAAAAANAAAI^rnTTTAAAAANGGC 

NCCNNGCCCCTnTAAAAATTGGGGNGNCCCCNNGGGGGNGGGGGAAAATNNAAAAAAAA 

TTTNTNNCCCCCCCCC 

Sequence 2219 

GGCGGCCGAGGTCGGCCCACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCACGC 

ATGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGA 

AGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAN 

GTACCTGCCCG 

Sequence 2220 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTT 
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TTNTNNTTTNTTN I I I 1 1 I I I I I I I CCCGGCCATATTCCATTNTTTNANAACTTGAATTG 
CNCTGCCATCCACAMGGCTNGCNCGGTTTTATCCACATTAAAATCAAACTGTNGGAGCN 
CCAGGACTATTTCATTATTGCTTTTGTNGGGMCMCTGATCTMCTGCATAMTCTTTT 
TCCTTGACATTCACATGAGThCTTGGNGTTTCANCW 

NMTTTCNG GAAMNANG GGGGANCCTTTTNCCTTTTTACCCCAAAANTCCCNAANGGNA 
AAANCCTTN 1 1 1 I I I 1 1 1 1 AMNTTTTTTTCCCCCNNNTNCCCAAAAAMNNGNnTTTTT 
1 I M 1 1 11 1 I 1 1 1NTTGTG NGGGGGGNGGGGNGNGGGNANAAAMNANNCNNNTTTTTTN 
NNNNNNTN I I 1 1 1 1 I I I I ANAAAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGNGNNNTT 
TTTNNNNNANAAAATNNATNNNNN 1I 1 M I 1 1 I I I I 11 1 i 1 1 
Sequence 2221 

CACTACTTAG GGCG ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGT 

TTTGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGT 

TAACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGAC 

TGGAMTACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAG 

GAGATGGAGAGCAATTCTTTAAAGTTAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATAC 

AAGCCTNTTAAAATGGGACGCATTTGCCTCGCGCCTACTGGGGTGTCTGCAGCTCAGCTT 

GGTGCCCCACACAGGACACCG 

Sequence 2222 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCCGGAAGTCTCTACTGAGGAAAGC 

TATGAGGATACTCTGTTCGTAAGCTCCCGGTGAATTTTGTTCCACAGACTCGGAAGAAAG 

GTTGGATAAGAGTTCACTGGAGATTGACAAGTACCTGGAGGAGCCCCTCACCCAAGTGCG 

CATCTTAACGGCATTGGTGGAAGCTGGGGTCAGAAAAGAGAAATGACCATTTGGAGGGGC 

GGGGCCTCCTAGMGAACCTTCTTAGACAATGGGGGGAGGATGGGACTTTGT7TTTTCCA 

AGAATAAACTTCAACTCCTGT 

Sequence 2223 

CGAGGTACTTCCCACCACAGTGTTGAAAGGGAGAGCAAAGTCTTATGGATAAACCCTCCT 

TTCTTTTGGGGACACATGGCTCTCACTTGAGAAGCTCACCTGTGCTGAATGTCCACATGG 

TCACTAAACATGTTATCCTTAAACCCCCGATGCCTGAGTTGAAAGGGCTCTCTCTTATTA 

GGTTTTCATGGGAACATGAGGCAGCAAATCTATTGCTAAGACTTTACCAGGCTCAAATCA 

TCTGAGGCTGATAGATATTTGACTTGGTAAGACTTAAGTAAGGCTCTGGCTCCCAGGGGC 

ATMGCAACAGTTTCTTGMTGTGCCATCTGAGMGGGAGACCCAGGTTATGAAGTTTTC 

CTTTGMCACATTGGTCTTTTCTCAMGTTCCTGCCTTGCTAGACTGTTAGCTCTTTGAG 

GACAGGGACTATGTCTTATCAATCACTATTATTTTCCTGTTACCTAGCATGGGACAAGTA 

CCACCTGCTGGGGATCACTCTCTAGGATCACCCCTCCATCCACAATTTCATCCACAAGCC 

MGMCMGCCCCTTCATGTTCTCCAGCAGTGCTCGCTTTTCTAC 

Sequence 2224 

TCACT^AGGGCGMTTGGAGCTCC CCGNGGTGGCGGCCCGCCCGGGCAGGNACTTThfTT 
M I 1 1 II I I 1 1 I I I IN CI I I I H I I 1 1 1 I I 1 1 [ I I I n I I GGC C I MM I AATATCTTTA 
TTTGACAAMTAMAGTCAGCAACCTATCTCGATTTCC 

TCCAATTGANACCTAAAGCATAGCTCTGGCCTTGGAGAGATTTCCAGGAGAGTCANAGCC 

CANAAGGGAGCAGGATCCAGGAGGCCCTCATNTCCCAGCACTCCAGCTGAGCCAGCCGGG 

TTATGGAACATCACTGAGCAATTAAMTATTATCAACAGACAAAAAAAGTTTATTGAATA 

CAAAACTCAAAGGCATCAACAGTCCTGGGCCCAAGAGATCCATGGCAGGA 

Sequence 2225 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGGCTCTTTCAGGTCCCCCT 

TCCTCTGCAGACTCAGCCTCCAGGCTGTTTAGCTCTCCCAGATAGCTGCCCCTGCCACGC 

AGGCCAGGAGTCTTCACACTCAGGCACCAGGCCTGGTCCAGGAGGAGCTGTGGCACAGTC 

GTGGTTCAAGTGTCCACATGCACCTGTTAGTCCCTGAGAGGTGGTGGGAATGGCTGCTTC 

ATTCCTCGAGGATGCCCGGGCCCCACCTGGGCTTGTCTTTCTGTTTAGAGGGAAGTGTAA 

CATATCTGCCATGAGGAACATAAATTCATGTAAAGCCATTTTCTCTTAAACAAAACAAAA 

CTTTCTAAGTACMTTTGTTACAAATAACGCAGACTTCAAAAACAAAAAAATNACAACCC 
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AAACMACCAAMTTTMATGATCAGAATTGGCMGCACAAAGAAAACGCCCTNTCCTGA 
CTT^ATTGTGGGCAGTTCTGAACGCCCCCAGAAMTTTGTGCCAAAGAGTTTTAGAAAA 
NTAAATATTCCAATAAAAGTAAACACTATACNCCACCAAAACAGGC 
Sequence 2226 

GGCGAAT TNGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 
I I I I 1 1 1 I C TMCT TTATATATMTACTTTATTTTATACAATTCAGTATATCATGTTAAC 
ATATTTCACTTTTAATTTTATGATATGTGTGACATATTTTTAATTT^ 
TACTATCATMTTTTTTTAMGTTTG 
GATTTTTTTTCCTGTAGCTTNTGAGMCACATTTTTA^ 

TGAAGCTCCACAGTGTANACATGTTTTGGCCAACCTTGTTTTATCGGNGTATGAAATTTG 
NGCTAATAGGAGGGATCCAATTGTCATTACATTAGAAAACTAAATGGGAAA 
Sequence 2227 

CCGG GCAGGTACGCGGGGCCCTTCTAGAGGCAGGCAGAGGGAAGAGAAAGGGTCTGTTGT 

TTTTCTCTCCTGTTTCTCGCTCCCTCTCTGCTGATCACAAAGCTGCTGACCGGGTCAGAA 

AGTCCTGATGGAAATCCACCAGCGCTGGGCAGGCCCCTCCTCCTCCAGGGAGCTTGTCCT 

TGCCTAATTTTTCTTCGTCCTGATGAGAACAAAAAAGAGAGAGAGAAGAAAAGAAAAACC 

ACAAACTTCCTTTGAAAACCAGCTTGTAGTCAGGGCCCGGAGCGCATGCCATAGACTCGG 

CGACTCAGGAATCCTGAAGACTCTCTGAGCGACCTGGAGCACCTTGGGCTGTGTCCCTGC 

CTGCC7TCACCCTCCTCCAGTGCCCCCAGTACTMGGAATCTTTCTGTTTTGGGGTT 

Sequence 2228 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACACAAGACATCATCTTGAAGGAAGG 

ATGGCTTTGGCCAGACCAAGACCMGACTTGGAGACCTGATTGAGATTTCTCGCnTTGGC 

TATGCACACTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGT 

GAAATTGCTGGAGCTGGTGCGGCCAGTGTCCTGTCTGCCCTGACCAACAAAGCCATAGTG 

AAGAAGGAACTGCTGTCTGTGGTGGCTGGGGGAGACAACTACAGGGTCAATAACAAGCAC 

GATGACAGATACACACCACTGCCTTCCAACAAAATCGTCAAGCGGGCAGAGGAGTTGGTG 

GGGCAGGGAGTTGCCTTATTC 

Sequence 2229 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTANAAGATATT 
GATCCTAGTCAATTAGGCATTGTAGACTGTNATGACCACTTAATAAAAAATTNTGGACCT 
GANGCTCACGAGCATCCAG 
Sequence 2230 

NTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNGGTACCTGCAGNNCTNNTACACCT 

ACCT CTCTNTGGGCTTNTATTTCGACCGNGATGATGTGGCTCTGGAAGGCGTGAGCCACT 

TTTTCCGCGAATTGNNCGAGGAGAAGCCGCAGAGGGCTACGAGCAGTNTCCTGAAGATGC 

AAAACCAGCAGTGGC 

Sequence 2231 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTTCAGAGTTTC 

ATAAGACACAGTCTCMCAGGAGTTTTTTTAATAAATGCTCTATCTCTAGTTCCAGAAAC 

TGAATTCCAAGAAGCTACACTGAGGATAATTCAGCTCTGATATTGTGATTACTGTGATGT 

TCTTTCATTCATACAGTAAGTATCTGCCCAATACGTAACTACCGAGATCTATTGCTTCCT 

ACATAATTAGACAAGCTCCTTACACATATGGGCCCATGCCCGGCACATAGCAGGCACTTA 

ACAAGGGGITGCTTMCTAC^^ 

TTTACCCTTTTTCTATACNTTGNTATATTTTGGAAMCNACATGCTTGCAAAACTAAAAA 
TCTMCATGCATTACTAACTTTATAAAAGATCCTTCAGTATTTTTCAAAAAGGGAAAAAA 
TNATTAAAACCAATCCCCCAAAT 
Sequence 2232 

TACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCAGAGCTTAAGAGACCCTTCAG 
CTTCTCGGTGTAGGTCACCAGTCATGTCACGCTCACAACCCTTTCGCCTCCCTCTAGGTC 
AGGCACCGCATCCAAGTTAAGCAACCGACGCCGGCTCCTTGGTGTCCCAACAAGGTCTAG 
TAGAAAAGCAACTTTATAATTCGTTTCACAGATTTGATTAAGAGCCAACTGAGTAGTGGG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

372/467 

CCTTGTGCTAGGGCTCCGAGGGCCCAAGGACCCCATGGAACCAGATTTAGTCGCTGACX5T 

CTAGGAGCTCACAGGTGAGTGACTCCCXJCCGCGTACCTCGGCCGCTCTAGAACTAGGTGG 

GATCCCCCGGGGCTGCAGGAATTCCGATATCAAGCTTATCGAATACCCGTCGACCTTCGA 

GGGGGGGGCCCCGGTACCCAAGCTTTT 

Sequence 2233 

CCGGGCAGGTACCGNTGTGTCCGGGTGGGTGGTCAGAATGCCGNGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGNGTTTCCCAAGCTATGTGAGTTCANATAACCTCANAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 
ACT 

Sequence 2234 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCA 

CGATGTCTTGAGTTTCATTCACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGA 

AATCCCAAAACACACAATG AGGC TGGCTCAGGTTTGACTCTATCTTGGAAAAAAATAGGA 

AAACTTCATTTATGGMTAGTTTTGAATAACCGTGGATATCACAGGTCCATTGACCTGAG 

CATTTCCATTTTTGGAAACGGGTAG 

Sequence 2235 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGTCATCACACGTGACAGGATGTCA 

ATGACMTCCTCCAGAGTTTACTGCCATGACGTTTTATGGTGAAGTTCCTGAGAACAGGG 

TAGACATCATAGTAGCTAATCTAACTGCGACCGATAAGGATCAACCCCATACACCAGCCT 

GGAACGCAGTGTACCT 

Sequence 2236 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATGAACCT 

GTGCTTTGCAGCAGGCCAACTT GTAAA GCTGTTCTCAACCCACTTTGTCAGGTTGATTAT 

CGAGCAAAACTTTGGGCCTGTMTTTTCTGTTTTCAMGAMTCAGTTTCCTCCAGCTTA 

TGGAGGCATATCTGAGGTGAATCAACCTGCCGMTTGATGCXXJCAGTTTTCTACAATTGA 

GTACGGGGNGACAGCGGNTTCCTTGATCCTTGCCACCCGCG 

Sequence 2237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGGTACNCNGGGCCCTGTGCTG 
TCTGCACCGAGGAGAGCGGCCTGNCGGAAGNGGGCCACCATATCTGGAAACTACAGTCTA 
TG NTTN GAAGCGCANAAGGGAATAAACANTTAANGACTCCCCCNGGGACCTGNAGGATGG 
NCTTTTCCATGGGGGCCGGAGCNGCAGCTTACAATGNAAAATNACANACTGGNGCTI^TTG 

GAGAAAACTATANTTGGCATAATTCCCATTAACCACNATGACTTCAAAATTTTAA 
Sequence 2238 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCTGA 
AATGGCTTCTGCAGAGATGGACCTATGCCGGGGACAGCACACCTGGACTCAAAAGGCAGG 
G ATTGG ATAGGAAGAGGAATAAAATATAAAAATCAGAGAACTGCTGAAATTCTGTGACCC 
CI I 1 1 1 AGTTAAAAAAAAAAAAAAAANG AAATAAAAGTACCT 
Sequence 2239 

GGTACATTTTTAMGAGGTTGTTTTTTGGCCGGGCGCAGTGGCTCATGCCTGTAATCCCA 

GCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGCT 

AACACAGTGAAATCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGCGTGGTGGCTGG 

CACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGAAGGA 

AGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGGGCAACAGAGCAAGAC 

TCCATCCCAAAAAAAAAAANTTNAAAAAAAANNNNGGTACCTTGCCCGGGGCGGCCGCTC 

TANAMCTMGGNGGGGATCC(XCCGGGCTTGNMGGGMTTTCNATATTCAAAGCTTTA 

TTCGAATNCCGTTCGACCTTCGAGGGGGGGGGG 

Sequence 2240 

CGGTGGCGGCCGCCCGGGCAGGTACNTGGGGAAGGCGCNNCGTGCGTCAGCTTGACAGCA 

GAAGCAGGAGGAGAGCTGGCGGGAAGACATGCACCCCTTGAAGACCCAGAGAGAGGCCGT 

CTGTCTACCGCGTAGCAGTTACATCAGACTGAGACACTTCCTGTTTACAGGAGACTATAA 
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MTTCCTGCCCXGTGCTCATTTGGGGCTGACGCCATTTTAGACCTCAGCCCATCTGCACC 

CAGGCGCTCACTGAMCAGTGTGTTGCTCCACACCGCCTTGTTTTGCTTGNTGGCGCGCT 

CTCAGG GTTCC GACCAATCCAAGAGCCTTGCAGAAAGCATTAACGTGCTTTTNTCTTTGG 

CAAGAGTTTTTCTTTGCTCTGATCTTGGAGACCATCCCTCTGCCTAGGGGGAAAACATAN 

GGGAATACAGA 

Sequence 2241 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCATGTTAAGA 

GATAGCAATGCAACCGCAGCCAAGATGTCTTTAGATGTAATGATTGAACTCTACAGAAGG 

AACATCTGGAATGATGCAAAAACTGTCAATGTTATCACAACTGCATGTTTCTCTAAGGTC 

ACCMGATATTAGTTGCCGCTTTGACATTCTTTCTTGGGAAAGATGAAGATGAAAAACAG 

GACAGTGACTCCGAATCTGAGGATGATGGACCAACAGCAAGAGACCTGCTAGTACCCTCT 

CTTCCAGCACCCAGGCCAGTATTGAGATCGATTCTCTCTATGAAGGAATCGACTTCTATA 

CCTCCATTACCCGTGCCCGATTTGAAGAACTGAATGCTGACCTGTTTCCGTGGCACCCTG 

GACCCAGTAGA 

Sequence 2242 

TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGGCGTGGGCC 

TGCTTTCCCGCCAGTGCCCACTCAAAGTTTCTTACGGCATAGGCCGATGAGGAGCATGAT 

CAGGAAGGCCGGGTGATTGTGGCTGAATTTGACTCGTTTGTGCTGGTAACAGCATATGTA 

CCCTTGTCX)TCTTCTTCAGTGACTTAAAACAATTCCAGGATCAGAAGAGAAGCCAACGTG 

ACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCTGTTCACCGAAGTGGTTGAAA 

MCAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCATCAGGTGTTCACAAAAA 

Sequence 2243 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCTTCACCTG 

CTTT GGAG AAATCCTGAATGTGTCCCCAGAAATGAGACGAGAAGTCTTCTCCTGATCAAA 

GGTCTTTTCTTGATTGAAGGGTTCGTGGTTTCACAGGCTTCAAGGAAAGAAGCCATGGAC 

CTCAGTGGTGAGTGTTACAGNTCCATTAGAGAAACATGCAGACCCCCCCGCGTACCTN 
Sequence 2244 

GCCGCCGGGCAGGTACTTTCCCTTCCCTAGAAAAGCNNGACTNGNCGCTAACGGTGAAGG 
ACCANGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAGA 
GGTGT CCCT GNMTGATAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGNGACCAAGCGC 
CAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAANGGA 

ATTTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCTCG 
GCCG 

Sequence 2245 

AGGTACGCTTCGACCCACGCGTCCGAGAAAAGACTATTMTTATTTGTGCCTTTAAAGGT 

TMTTATTTCAACCTTAAAAGAATCTTATAAGAACCCATTAATAAAAAAGAATGCAAATA 

TATTGCCCAAAAAMTACACMATTATTGTCTTTCATATAGGGGAGGAAATAAATATGTT 

TCATCTGAATTGTCTGAAGTAAGTCATGAATGGAGTCAGGGTAAAAAAAAAAAAAAAAAA 

AGTGCGGCCACCTGCCCG 

Sequence 2246 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGCGAATACTGCATACTGTTTAAGGTAGTA 

TATAAGTTTATGAGAGAAGTGGAGAGCTTTCTTCCTTGAAMGTCGGTATTTGTTGAGAT 

ACCATTTGCCTCACAGAGAGGTGTTCCCCACTCCCATCCCCATTGCCAGATAATAAATAT 

TTTGAGAAAAGTGACCTAAAACAGCTGAAATCTTAGGTGCATCTGTCTGCAGACCTCCTT 

MGCAGGC TGTAT CTTACMTTCCCTTACTGCACTGGGTMGTGTTMCTTAGTTTTTGT 

TGTTTGCTCTTTTGCTTTAAATATTCTCCAAATTACACCT 

Sequence 2247 

AGGTGTACAAGCTTCNACCCACGCGTCCGAAAGATTCTTGGNTGAGCATGGTGGCTCATG 
TCTGTCATCTCAfSICAATTTGGGAGGCAAGGGCAGGAGGATAGTTTGAGCCCATGAGTTTG 
AGACCAGCCTGGACAACATAGTGAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAGTGCGGCCGGCCGCAC'I ITTTTTTTTTTTTTTTI I GCTCAATANAAGTNTGG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

374/467 

MTAATTCCAGGTAATTTAAAGCATNTTTTTNAATTGGTGTAAGCTGCTCCATGAAGTCA 

NCTNGCTCCTCTAATTGGGATGGTT^fTTNATNACACGGCATGTTNTAAGAGTTTGATGAC 

AGGANCATNAGTG TNNG GNCCCAGNACCCCCTTCTGNGCGGACTTNTNGGCATCCTCCCT 

TCAMNCTCMTNCTTTTNANGGGGCCCTGAGGAMGTNmTTATTAAMGGGNTNTTTG 

CAAAI 1 1 I IG 

Sequence 2248 

AGGTCACI 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 AAAATAA7TCTTTTAATTATTGATGCTTTGAATAAN 

AAGTCCATTTTACTAAATTTAGTATAAATTATCGGACGCGTGGGTCGAAGCTTGACCTGC 
CCG 

Sequence 2249 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTGACACGTTTCTCTTGACA 

CTGAGATGGTCGCAAACAAAACACCGTTCTTGCCTGGATGAAGCAAGAGTTCACATAAAA 

GAGCTTTATAAAATGTCTATGAAGGAGAATTGATAATATCAGAAGAGCTCCAGCACTTCA 

ATTGAATATMTCCTCTATTATTCTTTTCTTGATTTAATTTCTGTAGCTCCCGAAAACTT 

ACTT CAATCTTGTTGAGCTCAGAATAAACAGATATCTAAAGAAAAGAGTTTCAGAGTAAC 

TTTTACCATAATGTCTACATAGAAACAAATCTTCCAAGAGTAAGCTATTTAAGGACAAAT 

TTTMGAATAATTGTAATTCAGGTGATACTTGGGTTTTTGTACAAGCTTCGACCCACGCG 

TCCGACCTGCCCG 

Sequence 2250 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGATAATTTATACTAAATTTAGTAAAATGG 
ACTTCTTATTCAAAGCATCAATAATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAAAA 
AAAGTGACCT 
Sequence 2251 

TGAGCT AACTCACATTAATTGCCGTTGCGCTCACNTGCCCGCTTTCCAGACGGGGNAAAC 

CTGTCGTGCCAGCCTGCATTAAATGGAATCGGCTNAACGCGCGGGGGTAGGAGGCGAGTT 

ANGCCGTATTGGGCCGCTCTTCCGNTTTCCTCCGCTCACTGGACTTCGCTNGCCGCTTCG 

GGTCCGTNTCGGGCTGCGGCCGAGGCGGTTATCAGCTTNACTCAAAAGGCCGGGTAATAC 

GGGTTATCCACCAGAAAATCAAGGGGGATAAACGGCAGGGAAAAGAAACATGTGGAGGCA 

AAAAGGCCAGCAAAAAGGCCCAGGGAACCCGTAAAAAAGGCCCCGTTGCCTGGGCGGTTT 

TTTNCATTAGGCTTCCCGCCCCCCCTGGACGAAGCATTCACAAAAAATCCGACNGCTTCA 

AAGTTNAGAAGGTGGGCGAAAACCCCGAACAAGGGACTAATTAAAGGANNCCCAGGGCGT 

TTTCCCC 

Sequence 2252 

AGGTCTTCGACCCACGCGTCCGATTTTACTCTGTGTATTCGTGGTGGTGGGTTATAGTNT 
ACCAAGTTGTCCAAAATGGATAATTCATGTGAGATAGGCAAGANCAAGNGTTGTATCCCA 
ATACCATTTGGTGAAAAAGTTGAACTTCANAAAGGATAAGTANTTTGCCAAGATCACCTG 
NCAACCTGCATAGTCAAATTTGAGATGGAACTCAGCTACTCCAATATCTGTATTTGCAG 
Sequence 2253 

CCGGGCAGG I 1 1 1 1 1 1 I I I 1 1 1 I 1 1 I GCAACATTGCCCTTTGATGTCCCCATGAGGGCCA 

GGCCCAGGCAGAACCCATCCCATTTTATCCTTAAACTCANAAGGAAATTTGTCTAAATAT 

TAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTG 

TCTCTCTTAGAGTTCAATGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGG 

AATATCTAAATATTGTT GATT TAGATAACCACCCTAGATTTCTCACCACCCTAGAACATT 

TAGTGGGGAGACATTCTTrrCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGNG 

CAAATGGTATTTNTCCGGACGCGTGGGTCGAAGCTTGACCTnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATTGCGCGCTTTGGCCG 

TAAATCATGGTCATAGCnnnnnnnnnnnnnnnnnnnnnnnnTCCGCTCACAAAATTTNCA 

NCACAACATACCGAGCCCGGGAGCCATAAAAGTGTAAAAG 

Sequence 2254 

AGGTACATGTGCCANNTTTGTTATATCTTTANCTATATNCNNNNCTACTTGCCCCTGATC 
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ATCTATGNGTAGACMTMTTAGTMGGTCAMTTTTTCACAAACTCCTCCTTCAGCTGC 

TAGCMGTMTCMGAGCTAGTCTATTTTGATAGATAGCATTTCTCATCAGAGTCTCTTG 

CCAGGCAAGAACAGTCAAGGCTTGAGTGGTNTNNTTANTGNCTGCATTCTAAAACTGCTT 

GGCNCCATATGACANNGCCGANCATGTAGATGGGGGTTCGATATCCCCATGAGACATCTT 

GTGCCCAAGTGGCAGGTCCATAGTATTGTMGATTTTTTCAGGGGGCCATTCATCGTCTT 

TCCMTCAC CTATG GCTATGCTTCATTTTTCACAGGAAGCACAGACTGGGAAGCCCAGAA 

GTTTACCTGTTTTTATGGNCCCCTAAAAAAAAMNANGGTTCNATGGGTGCCAMTTTAC 

ACAGNTTACCTGGCCXACTTGATTNAGGGMNNTTTTNANNCATTAAGNNCCGAANCCCN 

NNGGGNATCAAAAAGNTTTGGCCCCTGGCCCGGNNNGGGCCCGGCCCNCC 

Sequence 2255 

ACCTGCCCGGGTGGCCGCTCGAGGCCGCCCGGGCAGGTTTTATTTAACATTCAAACTTCA 

TTMGACATGTGCMTATGGCAATTTTACTGGGGATTAAACCCTACCTAGGATTGCTTGC 

TGGGGCTTAGCMCAGGGTCCAGTTCACACTTAGCACTAATTAAATACTTTATTGAATAA 

ATACAATACCAAACAAAATGCATTCAAATGCCGGACGCGTGGGTCGACTCAAGCTAGGTC 

GGACGCGTACCT 

Sequence 2256 

ACI I I I 1 1 1 1 1 1 II I I I 1 1 I I I I I AAAAATTCAAAAAATTAGTTTATTAGCTTAATATAA 

TTAGGTCAATGG MTCC TGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGTG 

ACAAATTCAGCTGTTTTGCGGCATAGATAAGTGTCTAAGCTGGGCAGTTAGTCTACCCGT 

TTATAGTTCATGTTCTTCATGGCTTTGCAGCATTTGTCACTTTCTATGATGTGTTCAAAG 

ACCA GAAAAGGCCACACTTGACCTGTCAGCTGGTCCTTGAACAGCTGTAGG I I 1 1 1 1 U I 

TTTTGAGACAGAGTCTCCCTNTGTTGNCCAGGCTGGAGTGCANTANCGCAACCTCGACTN 

ACTGNAACCTCTGCCTCCAGGTTCAAGNGATTCTNCTCCTCANACTCCTGAGTAGCTGGG 

ATTACAGGCCCATGCCACCATGCCCGGCTGATT^^TATATTTTTAGTAGAGATGGGG^fIT 

CACCATGTTGGCCAGGCTGGTNTTGAAMCCCTCGGCCGISITTCTANAAACTAGTGGGATC 

CCCCCGGGCTTGCMGGGAAATTTCCNATTNTTCAAAAGCNTTAATCGGATACCCCGTCC 

AACCNTNNGANGGGGGGGGGGCCCCGGGANCCCCCAGCTTTTTGGTT 

Sequence 2257 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGACCTCAGATTAAAAATAT 

TGCTGAGGTCAGACGCCACAATTTTCATGACTTTCTTCAGAAGTAGCACATTTTCGTGAC 

TTCCGCTGTCCTCTGAA AAAC AAAGTTATTTGGAACATGTTCATGCAAAAGTGATTCTGA 

CCMGTCTA MTCG AGCTTTTCTACTGACATGAAACTGNTGGAAMCTGATCTTATTTTA 

TMG MAGTGTTTTCCCCTNAGGGGGGGNNGTGTGNTTTCNGAACANCCCNCNNATTATT 

TTTTCCCCCG GNNCCNT ATAAAATATTGGGGTCCCCCCCCNNNCGGGGGGGNNGGGGNAT 

TTTAAAMTTNTTTTTTTNNTCCCCCCCCCNCTGGGGGG 

Sequence 2258 

GGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCC 

GAGCAGAACTTGGCAGCAACAGAGGAAGGGCCCCTGGAGCCGGCTGTCGTGGATGCCTTT 

AATCAAGCCTGGCATTTGGTTGCTCACGAATGTCCCAACTACTTCCGCTAGGCCCATCAT 

GGCTCAGGCTGCCCMGGCTTTTCTGTCACCTNTTTTGTTCTCTCACACTGACCAAGTCT 

TACCTGCCCG 

Sequence 2259 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTA 
GTAATAGGAATTAACTGACCCCTTTNGGATGGGGGAGAGCATCAGGCTGGGGTCAGGTAA 
GTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCC 
Sequence 2260 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTAGTTATCAATTCATGGGCTATGGCCA 

CTGGTTTGCTGGATGGTCAGGGACTTGGAAGGAACATGACTGGAAAATTGGTGACAAAAC 

GGTCTGTGGAAGAGGTATATACACAGATCTTTCTGAATGGGTGAAAAATGTGAGGATATT 

TGTGTCTCATATGAATGCCCCCAAAGAATGACTTCAGCAGAAGAGGATTTTAATAACCAA 

GTAGATAGGATGACCTGTTCAGCCCCTTTCTCCAACCATGCAGGTAACTGCCCAATGAGC 
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TCATGAATAAAGTGGCCATGGTGGCAGGGGTTGTTCGGACGCCGTGGTT 
Sequence 2261 

TGGCGAATTGGACTCCCCGCTGTGGCGGGGNGNCGGNCGNCNGAGTAAGACTTCGTCTCA 

AAAAAAACAAAAACAAAAACAAAAACAAAGAAGTTAGCCTGGTTGGGCATGGTGGCTCAT 

ATCTGTAATCCTAGCACTTTGGGAGGCCCACTGGGGATCATTTGAGGCCMGAMTTTTG 

AGACCGAAGCCTGGGCAACGTTAGTTGAACCCTCATCTNTTACCCAAAAAAAAAAAAAAA 

AAAAAAAAAGTTGCCGNGCCCGCTTCTTANTAAACTAAGTTGGATTCCCCCCCGGGGCCT 

GGCAGGGGAAAATTTCGMTATTCAAAGGCTTTATTCTGAATACCCCGTTCNGACCCTTC 

GNAGGGGGGGGG 

Sequence 2262 

AGGTGGACTTGATTCATAACCAACCAACCX)AGGCTGTCATGCTAAAAAAAAGGGCAGAAT 
CTTTGCTGCTGMCTACTNGCATAGGGTTGACTACCATTGNTCCT^TAGT^f^TAGTTTGG 
GCTAGCAAAAAGGTGNGTCCTTTGCCATGTAAATAAAAGCCNTNTCNNGGTAATNAAAAN 
GGTNTNTN I I I I IAACTT 
Sequence 2263 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCACAGGAAT 

GAGCCACCACACCCAGCCAAATTGGGCACAAATTTAAAATTTGACTTTTATTAATGATAT 

GGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCAAATCATT 

TAAAMTATCTAMCTATMTTTNCTGTAGTTCACATGAATTGGATATTCTTGAAGCGGA 

CGCGTG 

Sequence 2264 

GGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTTGGGGGGAGAGACACAG 
Ann lACACTAATATATGGACCTAGCTTGAGGCAATTTTAATCCCCTGCACTAGGCAGG 
TMTMTAMGGTTGAGTTTTCCAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCC 
CGGGCGGCCGGCCGCCCGGGCAGGTNTCGGGCCCGAGMGACCTCCTTTTTC 
Sequence 2265 

AATTGGiAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGTCCGAGCTAACGAATGCTNGACT 

AACTAGATNCCAAAGCTTGCTCTGTGAAAATTCCCGNATAACCNNTGAAGTGGGCGACAC 

CNTAACCCTGCACACCTTACTCCTGGTNTCAGAGAGCCCAGTNTGAACATAAACTGNGTA 

GAGGTGTTAGACTCANCCTACCCTAGTAANGCCCAACCTCCGAGACCAACCTTAAACATC 

AGTAGACTGCGAGCTGTATGTGGATAGGAGCAGTTTNGNCAACCCCTGCNAAGTGACTCT 

GAAAAAGAC 

Sequence 2266 

CCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCTAGCAAAGCTGTTTCCACTGAATGCA 
TCTAAGCANNGATGGANCTATGCCAAAACCACCACAGGNGTTTCACTTNAATGATACCNC 
GAAACAAGG 
Sequence 2267 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATGCTACAGGGAAGA 

AAACGTTCCAAAAACAAGAAAAAGTTGGAAAAGGCAATGAAAGTGCTCAAGAAACAGAAA 

AAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I I I I I I I I I I I I 1 1 1 1 I I CAATN 

^ATTTATCAAAATAAATTTATTAAAAGTTTTCAAAGACCNCTTTAAAGTGTANCTGGCT 

TNMNACAGATTTTTGGNACTCNTAACGGACACTGCAGTTTTNAACNCCATAGCACTCAT 

TNTATTTCACACATCATTTTTAAC 

Sequence 2268 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGGTATGTAATCCAGG 
AAGTGACCAGCCTGATGCGTGTTATGACTCACTGNAAGCCTCCCATGATTAAGGAC 
Sequence 2269 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCAC 
GCGTCCGCCGGAGAnnnnnnnnnnnnnnnnnnnnnnnnTGATCTAGATCACGAGCAGCCG 
CAC I II I I | | | I II I H i I I I I i I I I I M I GGACGAATTCAGGAGTCCTTTATTAGCCGG 
CAGCCGANAGACAGCTAGCGCTCAAMTTNT^GGCCCCGAAGAAGGAGCTAGATTTTC 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

377/467 



TmATACTTTGGTTTAMCAGGGGAGAGGGGGAGW 

TAAAGTAGGCAAAAAGTTAAAAGGATAAACGGTTACAGGAAAGTAAACAGTTCCAGGNGC 

AGAGGCTTTAAGTNTATCCTAAGGNGATGGACCCCGGGCTTTGGGC 

Sequence 2270 

MTTG GAGCTCCXCGCGGTGGCGGNCGAGGTCMGCTTCGACCCACGCGTCCGGTTTGTT 

TTTTTCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGNCATTATAATTGCTTACT 

CTCATCGCTTAGCGCCC CAGG TGGGATGTGTTTCCAAMCACATTlTrGTNTTTATAAGG 

AMTGTAGTTAGGATTMTTTTATTGTCCTMTTAGMCTCACATTTTGGTTAAATCCTC 

AATTTCATTAAAAAAAAAAAAAAAAAAAAANGGN 

Sequence 2271 

CNCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTC 

GACCCACGCGTCCGGTGATATGTCACAATGCCGTGTAGCCAGAGCCTAGACAAAAGTTAC 

AGCACCTGGGAGATCAGTGCAGAGATATGTCACAATGTCCCCAGTAGGCAGAGACCAGGC 

MCAGTTGGATCACCTCGGGATCAGTGCAGAGACATGTCTCAATCCCCCTGTGGGCACAG 

CCTAGACAAGAGTTAAATCACCTCGGTTAACAGTGCAGAGATATGTCAATATTCCCCTGT 

AGGCCGAGCCTACACAAGTGTTACATCACTAAGGTGATCAACGCATAGATATGTCAAAAT 

ATTGCCGTGAAAGCAGAGTCTAGACAAGAGTTACATCACCTGGGCGATCAGTGCAGAGGT 

ATGTGACAAGGCCCCTTTAAGCAGAGCCTAGACAATAGTTACATCACCTGAGTGATCAGT 

GCAGAGGTCTGTCACAATGCCCCTTTAGGCAAGAGCTTAAACACCTCGGC 

Sequence 2272 

AGGTACTTGACCCCACAGCCATCTGGGATGAGCCGCTTTTCAGCCACCATGTCTTCAAAT 

TCATCAGCATNGAACNNGGTGAAGCCCCACTTNTTTGAGATGNTGGATCTTCTGGCCGGC 

CAAGGA AACT TGAACTTGGCCCTGCNGCAGGGCCTCAATCACATTGCTCCTTTGTTTCTT 

GCAAGCTTTTGGGTTGCCGGGATTGGGGACATNGATAAACTTGGGCCAAATTGTGAAACC 

CTGGGCCACAAGTGCCCTGGGGGNGCCTTTCCAAAAAGGGCCACCCTCCGCCATGCNCTG 

TTNTGGGAAGNCCTTGTTCAAGCCCCCCAANCNACAGGGGAACCAAACCAATTCNTTNGT 

TTGGAATGGCCGGGAATTGAACCGTGGNAAAGGGGGGGTTGGGGAAGCCCCGCCACCCCC 

NGGNATAATGGGGAAAGCCMTCTTTTGCCMCAMCTTTTTTTMCCMTGTTACCCTT 

GCCCGGGGCCGGGTCCGCTTCTAAGAAACCTAGGTGGGGGATCCCCCCCGGGGGCCTGGC 

GAGGGGAAATTTTCGMTTATTCNAMGCTTTAATTCTGAATACCCGTCCGGACCCTANG 

AGGGGG 

Sequence 2273 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTAGCTTGAGTCGACCCACGC 

GTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAGGGGNGGCCGGCCGCCCGGGCGGGTCCCNTCANANANAAGG 

GGGNGGGNGCTAATCCAGTACCAAACNTTC 

Sequence 2274 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTACTGT 

AGAGGAAAATATATGACAATTATCACTGTCACTGCCTGACATACAAAAATGGAACAGAAC 

AGTGGGGTAATGAAGGGAGGGGAGGGAAAGGGAAGAGCAGGAGAGAGAGGAGTTGGAGGA 

GAGGGGAAACAAAGGGAAMGGGTCTATTAMCAGAGGCCTAGAGAAGCTAAAATTTGGA 

AATGGCAAATCTGAGAAGAGCCTGAATAAAAAGTGGGGOTGAGGCCATGCACAGTGGCTC 

ACGCCTATAATCTCAGCACTTTGGGAGGCCGAGGCAGGTGGATCACCTGAGATCAGGAGT 

TCGAGACCAGCCTGGCCAACATGGCAAAACACCT 

Sequence 2275 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGAAAMTGGGAGACMmCACATGGACTTTGGAAMTATTTTT^ 

ATCTCTCAAACTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCA 

TTCAACCTTACCAAAAAAAAAAAAAAAAAGACCTGCCCG 

Sequence 2276 

CCGCGGTGGCGGCCGCCCGGGCAGGTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTAG 
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AGGGATGMCCACCATGCCCAGGTMTTTTTTMTTTTTAGTAAAGGTCGGGTCTCACTA 

TCTTTCCCTGCTCAGCTGGTCTGGAACTCCTGGGTTCAAGTGATCTTCCCACCTCAGCCT 

CCCAAAGTGCTGGGATTAAAGAAGTAAACTACCACACTCAGCCACACATAGGTAATTTAA 

MTATTTCCATAGTCACMTTAAAMCACATATAMTAGGTAAAATTAATMCATTTTAT 

TTMCCCAATATATTAAAATATTTCCACTTTAAAAAGAGACCTCGGCCGCTCTAGAACTA 

GTGGGATCCCCCGGGCTGCAGGGAATTCGATATCAAAGCT 

Sequence 2277 

ATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAG 
TTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAA 
AAAAAAAAAAAAAAAAAANGT 
Sequence 2278 

GNACC CAGTMTCACATAAAGGTNTGCAGGTCATGNTGTTTATTTAGCTTMCTGTTTTT 

MM l ATTTGTTGMGGGGGTNGGTGTTATTTCAGNCTTTTTCTTATTGGGTTGACCAGA 

CTTGGTAAAATCTGTAAGAAAGTTCCATAAATTATTGGGGGGGMGGNATTTCCTCTGAA 

ATTGGGCTAAAATTCCTTGTAGCTGAAAAAAAAAANAACAAAAAAACAATAAAATANGGN 

GGCCGGGCCCGCTTCTAAGAAAACTAGGNGGGGGATCCCCCGGGGGCCTGCAGGGGAATT 

CCGNATATCAAAG CCTTAT CGGATTACCCGNCGNACCTCGGAAGGGGGGGGGGGGGCCCC 

GGGGTACCCAAGC I 1 I r I GGTTTCCCTTTTAAGTGGAAGGGGGTTAAATTTGCCGCCGCT 

TGGGGCCGTAAAATCAATGGGTCATAAGGCTGGTNTTCCCTGGTGGNNGAAAAATTTGGT 

TMTCCCGCTTCAACAAATTTCCCACCACCAACCANTACCGAAGCCCGGGGAAGCCATAA 

AAAGGTGGTNAAAAGCCCTGGG 

Sequence 2279 

NGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAAAAAAAAAAAA 

ACAAM AAAAACCACTTCAACTAATTCATCTGACAATGCTGTTCATATTCATGACGCCAT 

M M 1 iGTTTGTTGTTGTTGTTGTTTTCCTAATAATAAAGAAGGAGACTTAGGGCTGTTG 

GGCTGATATATGTTTGGGGGTCCCACCTCCCCGCCTCATCCGTACCTCGGCCGACCAGTG 

CAAATATCTACCCAGTTAGAAGAGTAMTACCATCTTAGTGTTATTATCAAAATATTIVJTG 

AACTCATGAACCTCCTCAGACTGTTGCTGGGGACTCCCAGATATCAATACTCTGAGAACC 

ACTGATCTMTGT7TCTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAA 

GTAAT 

Sequence 2280 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTCAG 

GGCACAAAGTGATCATTTGGGATCCTAAGTTAAAAAGGAAATGCAAGAGTAGGATACTCC 

AATTCCAGAGTCTTTGCAGGAGGCTAATCCCACAAGAAGGGTAGCATCAGAGAAAGTGGC 

ATTG GTCT TAGTGGTGGATCATCAGGTAGACAAGTGATAGTGTGTGTAACCCATCTGAAA 

TTCATT1TACCGTCACCACTCTTACAMGGACAGTTTATTCCCAAGGACAGTGCTGACGG 

GGAGGGGGACAGGCAGGGAGTTAGGAGGGTTTTCGAGGATTTCAAACAGGTGGAACCCAT 

CCAT CCCTA TTCCCAAGGGCCACTTACAACTCTAAGGGGTGGTTACAGGATTAACTACCA 

GTTCATTTTCAAMTGCTGCTTTGAACTCAGAGGGTTGATACTmAATTTGT^ 

TGTAAAACTTTTTACAAAATAGT 

Sequence 2281 

CCGGGCAGGTTACAAGCTTCGACCCACGCGTCCGGTCATTATTACCCTCACTGTCAACCC 

AACACAGGCATGCTCATAAGGAAAGGTTAAAAAAAAAAAAAAAAAAAAAGTGCGGCCTCG 

AGCGGCCGCCCGGGCAGGTACAAAAAAAACCTTACATAAATTAAGAATGAATACATTTAC 

AGGCGTAAATGCAAACCGCTTCCAACTCAAAGCAAGTAACAGCCCACGGTGTTCTGGCCA 

AAGACATCAGCTAAGAAAGGAAACTGGGTCCTACGGCTTGGACTTTCCAACCCTGACAGA 

CCCGCAAGACAAAACAACTGGTTCTTGCCAGCCTCTAGAGAAATCCCAGAACACTCAGCC 

CTGACACGTTAATAC 

Sequence 2282 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCTACCCACAGCGTCCGTTT 
ATGTTCAGCAATAAAGGTTCTATCCGTAAAAAAAAAAAAAAAAAAAGNGCGGCCGGCCGC 
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CCGGGCAGGTACTATGACTAGACACATGATGCATGGCTAAAAAGCTCTTCTGGATAACTC 
CTTAGNGAAGGNCTAACNNCCCCACCACCATCAACCTACAGCCCTGCCI 1 1 1 1 1 1 I 1 1 1 I 
TTMNAAGTCTGTCACCACMTNTTGTCTGGNGCTNGATTTCAAATAATACATTTNTAGA 
ACCTGCCCGGGCACACGCCCNATMCGANTGGTTTTNNNTTATATCAATTAACGTTAAA 
Sequence 2283 

CCGCGGTGGCGGCCGAGGTACTCCAGCCTGGGCGACAGAGCAAGGCTCAGTCTCAAAAAA 

AAAAAAAAAAAAAAGGAGAGGAATAGTAAATTTATAGTGGAGAAATCTGCAGTCACTAAC 

TTAACCAAATAACCACATGGATGTATCCCTTATTAAAGGTAATCGAAAGGGCACAGCGTT 

ACTTCTGTGGAATTCTTGCCCAAAATGCATAATCTCAATCAAATCATAAGAAAACATCAA 

AATTGAGAGGCATTCTACAAAACCAATAATTAACCAATATTCATCAAAAGTGTCAAGGTC 

ATAAAAGACMGATGTTTATAGAMTACATTMTTTTC 

TTAAMTAATTGTTTTAGAGATGTGGCCTTGTTAGTTGGCCA 

Sequence 2284 

AT AGGGCGAATT GG AG CTCCCCG CG GTGGCGG CCG AG GTGG G ATTTTTTTTTTTGTTTTT 

TTCI I I I I I I I I I GGTTTGTTTTAMTCAGTGCATAAATTTTTCTTTTCTCATTTCAGCA 

GATGGAGAAACAGATGGACTCTACAGCTAAGTGGAATATCAAAGGTAGAGGGGTGATTCT 

GTGAGACTGATAGGCCTGACTATTCTCAATTCTCCCCACTGCAGTGTTCACGCGGACGCG 

TGGGTCGAANACCTGCCCG 

Sequence 2285 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCGTCGTAACAGTG 
TAACATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTCTAGTCCAAAGGCATA 
AAGAAAGTNTGCTTTAAAAAAAAAAAAAGGAATGGTTATCTTCAAAAAAAAAAAAAAAAA 
AAAAAAGT 

Sequence 2286 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGACTCATATAGGGATCTAGA 

TCACGAGCGGNCGGCCGCCCGGGCAGGTCTCCCATCTTGCGCAAGTTGGTCACGTGGTCA 

CCCMTTCTTTGATGGCTTTCACCTGCTCATTCAGGTAATGTGTCTCAATGAAGTCACCG 

GACGCGTGGGTCGAAGACCT 

Sequence 2287 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGAT 

TTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAG 

MTGGTCCMGGACTGTTAMCAGAGGMGTATTTACATTTTGAAAACTTGCGGACGCGT 

GGGTCGAAGCTTGTACACCT 

Sequence 2288 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCTGGTACATAAAANAATTTN 

TTNGTCTTTAAATNGATACNMTGTCTATCANCTTTAATCAAGTTGTAACTTATATTGAA 

GACANTTNGATACATAATAAAAAATTATGACAATGTCCTGGAAAAAAAAAAAAAAANAGT 

GCGGCCGACCT 

Sequence 2289 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATACTTGTCATTG 

CGTAGACTTCTTATCAAAATTTCACATTNATCTGTAGGAAAATGTAAAGTTGGTAAAAAT 

TGTTTACACAAATCACACATTTTCCATCCTTGACAATTGCAGNG I I 1 1 I I U I A AATATT 

GCTGTATTAAGACAATTTTAACTGAAGTAGGTTGTAGAGGCTANAAACCTGATTAATAGA 

GCAGTATTAGACAATTCTAACTGAAGTAGGTTGCAGAGGCTAGAAGAAACCTGATTAATA 

CGGACGCGTGACCT 

Sequence 2290 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAAGCTTCGACCCACGCGTGCG 

AGTCAGGGMGGTTTGTTTATGTTACATTTATTTCACCAGAACTATTTTAATATATCAAA 

GGGGTTTACTATGCCAAACAAAATTCTAGGGAAAAATACTGCTAAAAATGGATGCCTCAT 

CAGAACATGCTGTTGAGTCCAATGTGCCATMGACATTTTAGCATGTTAAATAGCACTTT 

TAATAGCAAAAAAAGGCACATCAACTGCGMGTTATCCTTAGTTTGCAAATGCTTTTTCT 
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AGATTAATGATTTTTCAATCATTAGGGTACCTGCCCG 
Sequence 2291 

CGCTACCTGCCCGGGCGGCCGGCCGCACI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 CAAGTTTTATGA 

TTTATTTAACTTGTGGAACAAAAATCGGACGCGTGGGTCGGACGCGTGGGTCGAAGCTAC 

CT 

Sequence 2292 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTACCTCCAAGCAGGATGATGGG 
CTAGACATGGAGCATACATAAACGGGCAAGATTCAGTCCCTGACCGCAAGGCACTTACAG 
TCTAGTTGGGAAGGGAGACACAAATGTACCT 
Sequence 2293 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAGACTCCAGGAAAAATCAAAAATTTGTTT 

TTGCAATTAGCCGAGCACGTAGCCCAGTCTCTAAATGTCACTTCATATTATGTTTGTAAA 

ACAACTGTAATAAGAGATCAATGGCCATAAGAAGCCTGAGAATTAGTGCCTACAGACCCA 

GTTCCTGATGAATTCTCAGCCCAAAAGAATCACCCTGATCATTTCTAGGTTCTAAAAGTT 

TCAATTATTGGACAATATTGCATAGCTAGAAAAAAAAAAAAAAAAAAAGTGCGGCCTCGA 

GCGGCCGCCCGGGCAGGTACTTTGAGCAGGATAATAACATAAATTTCATTTAAAAAGTTG 

TATTTATAGCCCCAGTAACCGGAAAGAATTATAAGTAATTATGGAAGTATTATATTCTGA 

CCATACCAAGAGTTAAAAACAAAGAGTTCCTACTAAAGAGGAATATTTTCAAGATGATCT 

GGTCCATCATGTGCATAGTTAAAGAATGGTTGGTTTAATAAAGATTCTTTTGCAAATAAA 

GAAT 

Sequence 2294 

AGGTACAAGCTTCGACCCACGCGTCCGATGAACAACTGGCTGCTCTGTCCCAGGGTCCAA 

TATGCAAGCCCAAGAGGAAAAGAGAGAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGG 

CCGCCCGGGCAGGTACAGATAAATATAGGAAACTACTTCAGATTATTGGACAAATAAGAA 

TTTCAGTGTGTCACTACCTATAATTAAGTAGCAGCACACCACAACCACAGCAAGTAAACA 

CTAATCTCCTCTACTGCCTTTTTGGGGTCTTTTCCAGTCACAGGAGCTAATTTACAGGGA 

TACCACTGGGTTTCAACCMTCTCTGAAAGTTCCTTTATTATGAGTTTTGAAATTTAACT 

AGTGCATCACCTACAATTCTGTTGGCTAG 1 1 1 1 1 1 1 CTTACTCTTCACTAATTAAGTTTT 

AATACAATATGTAATGGTTTCAAAATTTTT 

Sequence 2295 

TTAGG GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGAAA 

CTTTMTTTTTAGGAATAACATATTGACTTACTGAACTGAAGCATTCTGAGTTGAAAGGA 

GCCCCAGAGGAAAGGAGTTCTGTGTTGCTCACATGTTAAAAGCTTGCTCACCTTCAGAGC 

AGAGGGAATACCTATCTTCAGATATCCGCCCATTTTCATCTCTTCATTATAGTCAAACAG 

TGTGACTTGAGAGTGTTGCTCTGGTGTCTGTATTCTGGCTTATGAAGATTATTTGAAAAA 

GAACTCTTACTACATTGAAATGCAGACTTTTAAAAATTTAAATATTGGATTAGGCAGTCA 

AAAAACCAAACAAGCATAAAAGGTCAGTAAGTTGTAATCTTAAAAGTAAAGGTGGAAAAC 

TCATTATAAATGGMGAAMGTTTTGATTTCCTTTTTTGTTTGATGGGCAGTATGCCATA 

TTATATCCAMGTTGGTTTAAAAMTACTTCCATCMCTATTTTTATTTAAAATAAACAT 

TT 

Sequence 2296 

CCGCGGTGGCGGCCGAGGTATACTTTGCACCTTGAAAATATAAAATAAAATAAATTTAAA 

AATAAAGGTAATnTGTTCTTCCATGTCAGCTGAAAATAAGTGAAGACTGGGTCAGTAAT 

AACATTGCTTTGCTGAATTCAGAGAATTCTAATAAAATA I I 1 1 1 AGTTGGGAAGCTATCT 

GTATTAAAAAAAATGATCTAAGGCTGGTAACAGTGGCCATACTTATAATCCCAGTGCTTT 

GGGAGGCCAAGGCAGAAAAATCACTTGAGGCCAGGAGTTTGAGATCAGCCTGGGCAACAT 

AGTAAGACCTTATCTCTACCCGGACGCGTGGGTCGACTCAAGACCTGCCCGGGCGGCCGG 

CCGCACI I 1 1 1 1 1 I I 1 1 1 I 1 1 1 1 1 1 1 AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAG 

GCAGTGCACCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCC 

Sequence 2297 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTnTGCTCTA 
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TCCCATAGGAGTTGGTATGTTGTGTTTCCAATATCATTTACTAAMGAAATTTTCCTTTT 

TATT TCTTCATTGACCAACTGGTCATTTGCATGTTCTTTAATTACTATGTGTTTGTATAG 

TTTTCAAMTTTCTCTTATTMTTTCTAGGTTTTCCTGTGGTCAGAGAAGATGCTTGATA 

CTACTTTAA I lilt IGAATGTTTTAATACTTGTTTTGTGACCTAACATATGATCTACCCT 

TGAGAATTATCTATATGCTGAGGAAAATAATGTGTATTACACAGCCATTGGATGAAATAT 

TCTGTAACTATCTCTTAGGTACCT 

Sequence 2298 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCTTTATTT AI 1 1 CI 1 1 IA GGAAT 

TGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTAGGCAGGAA 

CATTAGACTCCAAAAGCAGAGAAATGCTTAATTTTTCTTCTACCTGTTTCACCACATTCA 

TGTAGAACTGTAGTAAAAAAGATGGTGAATCAGGCTGAATCAATCTAAATAACAACTTAA 

GGCTCCCAAATC ACATGA ACCTAGGACCACTAAATCCAATGTCAGACGTGTTTAAATGGT 

GCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTGTCATTATG 

ACATTCCATGAAAATGAANGAAAATCTTTCAGGAAAAATTAAGAAAATAAAAATGTTTAC 

TAAAGAAAGAATGGTCCGGCTAAGTGCTANAGTTTNTTTCNN 1 1 1 1 1 1 I 1 1 A 

Sequence 2299 

AGGTACTATCAAGCTTTATTTTACCTGCAAAAATATTTTAGCTACACTTGGAAAAAAATA 
AACTTGAGAATATAACTTCACATTTCTAAGGCCAGCGGACGCCGTGGTCGAAGCTCGACC 
TGCCCGGGCG 
Sequence 2300 

AGGTACGATTTTCCCTTCGCTTGAATATTATCCCTGTATATTGCATGAATGAGAGATTTC 

CCATATTTCCATCAGAGTAATAAATATACTTGCTTTAATTCTTAAGCATAAGTAAACATG 

ATAT AAAAATATATGCTGAATTACTTGTGAAGAATGCATTTAAAGCTATTTTAAATGTGT 

TTTTATTTGTAAGACATTACTTATTMGAAATTGGTTATTATGCTTACTGTTCTAATCTG 

GNGGTAAAGGTATTCTTAAGAATTTGCAGGTACTACAGATTTTCAAAACTGAATGAGAGA 

AAATTTGTATAACCCATCCTGCTGTTCCACCTGCCCCGGGCCGGCCCGCTCTAGAAACTA 

GGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCAAGCTTAATCGATACCCGTCGACC 

CTCGANGGGGGGGCCCGGGTACCCAGCTTTTT 

Sequence 2301 

GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCCGGTGGCGA 

GTGCCTGTAATCCCAGCTACTGGGAGGCTGCGACAGAAGAATCACTTGAACCTGGGAGGC 

AGAGGTTGCAGTTAGCCAAGATCATGCCACTGCACTCCAGCCTANGCAACAGAATGAGAC 

TCCATCTCAAANNAGAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCtGCCCGGGCGGCC 

GGCCGCACI I I I 1 1 1 1 1 I I I I 1 1 1 1 IGCCATACAGGTTCATTTTATTGAGTGGAAAGCTT 

ACAAAAGGGCCACTGGCCCCTTCC 

Sequence 2302 

CCGC GGTGGCGGCCGCAC I 1 1 1 1 1 1 1 I 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 I 1 1 1 I I I I 1 1 1 1 1 1 1 1 I I 
TTTTAGGGTTGAGGGGGAATGCTGGANATTGNAATGGGTNTGGANACATATNATATAAGT 
MTGCTAGGGNGAGTGGTAGGMGTTTTTTCATAGGAGGNGTATGAGTTGGTCGTAGCGG 
AATN GGGGGTATNCTGT TCGAANACCTG CCCGGGCGGNCGGCCNCACI 1 1 1 1 1 1 1 1 1 1 I I 
TTTTACTTTGGCNGGGGNTTTTTTCTTTTC 1 1 1 1 1 I I 1 1 I CAGCTACNGG AATTTANCCN 
ATTCANAGGAAATCTTCCCCATMTTANGGAACTTTNTTACANANTTTACCAAGTNTGGG 
CNNCCCNATAAGAAAAAGACTGAAATAACAACAACNACTTTAANAAAT 
Sequence 2303 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNNTNT NI 1 1 I I I 1 1 1 NTTNGCCAG 

MTANMTCTNAGCTTTTTATTTGTAGGAAAAAATAAACAGATTNCCCTCCNNAACANGGC 

GTNACAANAAANGAGGCAATNAAGGGAAAAANGCANATNCTAAACGGACNCIMTGGGTTNA 

ANCTTGACCT 

Sequence 2304 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 
CCGAGAAGAGTTTGCAAATGCAACAAAATATTTAATTACCGGTTGTTAAAACTGGTTTAG 
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CACM TTTATATTITCCCTCTCTTGCCTTTCTTATTTGCMTAAAAGGTATTGAGCCATT 

TTTTAMTGACATTTTTGATAAATTATGTTTGTACCTGCCCG 

Sequence 2305 

A GGGCGAATTGGAGCTCCC CGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCCGCA 

CI I I I 1 1 I I I I I I I I II I I I I GNTTTCTGGGTGMGTTTATTCTGTTTTCACATCTAGGT 

TGTTGGGGAGAGTGATAGACAAAGTTCTGGATTCTGGGCATCGTCGGCGCATGCTTGTAA 

TCCTACTTGGGAGGTTGAGACAGGAGGATCACTTGAGGCTAGGAGTTGGAGGCTGCAGTG 

TACCTGCCCG 

Sequence 2306 

ACTTAGGGCGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCA C1 rTTTTTTTI 
1 1 1 I I I 1 1 I ITTTTCTTCTTTCTCTTGCCTGGATTCAGTCCCAGAAATGTTAGGACTACC 
TCAGTTTTGCTCCAAACCAAACTCAAACAACAGCAGCCACTGGAAATCAAGGAAACTTCA 
CTMGMTTTMCAGATCATCAAMCACCGCCTCCTTCCCATTTTAGCCGGACGCGTGGG 
TCG AAGCTTGTACACCTGCCCG 
Sequence 2307 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTAAATA 
TTCTGAGAGGTGAATGTAAAATATAAAAGGTATAGG I I I I I I 1 1 I I I I AAAGAAAACAAT 
NAACTTTCAAAGAGAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 
GCACTTN I 1 1 M I I I 1 1 1 1 1 1 1 1 I AAAAN ANAN ATGAGGTTTTGNTATGTTGCCCAGGCT 
G 

Sequence 2308 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 
CACGCGTCCGTTCTAAATGATCGACAACTCTCAAGCAATAACTTGACTGTTGAATAGAAG 
ATTAAGAAAAGTTGGTTAAAAAAAAAAAAAAAAAAAAGT 
Sequence 2309 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCG 

ACCCACGCGTCCGAAATMTAAAGCTAGAAGTMTATTTTTCCTTTTGTCTATTTTCCAA 

ATTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTATTTAAGT^ 

ATTGATCACCTCTATGTGCTCAGTACCT 

Sequence 2310 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAGCTTCGACCCACGCGTCCGCA 

MGTTCACCAAATTCGCTACAGCCTAAGACTAGCCTCACCAGTCCTTTTTCCCATTAATC 

AAAACTTTGCAGMGAGACMTGATTTTTACCATTCATTCMCCAGTTTGCACAGAGAGA 

GGCTGMGCCTGACTTGTMGAMCTCTTGTCCTTTTGCCAGTGTGCCAGGTTTCTGGGT 

TCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGGGCCA 

AGCCATGTTACCCAAAAAAAAAAAACNNTNNNNNNNGGTGCGGNCNGNCCCTCGNGATCT 

AMN CCCCATAGGGGGNGGGATTMCMTTNCCATTCCCNNGGGGAAATTTTTTGGCNCC 

TTTTNTTGGGG AANAAAA I 1 1 I I 1 1 I I I GNAAATT AAAAN GGTTTAAAAANCCNCCCCTT 

TTTGNAAAAATTTANTGGNTTTTAACCCCCC 

Sequence 231 1 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I I I M I I 1 1 1 I AANGCTG 

CTCCTTGAGGATAAGGGCTAACTCACAGGCAGNGCACCAAGAGCCACTATGAAAAGATCC 

TTMTGAGCAAAATATATNCCCTATTATTTTCCTACMGTTGCTTTTTACTTGAGTAGGA 

ACCCTTGATTG 

Sequence 2312 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGGTATGCCAGA 

TGAGAATGACAGGAGCCATCCGCAAGCAGTTGGCGGCTTTCTTAGAAGGCTTCTATGAGA 

TCATTCCAMGCGCCTCATTTCCATCTTCACTGAGCAGGAGTTAGAGCTGCTTATATCAG 

GACTGCCCACCATTGACATCGATGATCTGAAATCCAACACTGAATACCACAAGTACCTGC 

CCGGGCGGCCGGCCGCACI I I I I I I I I I I I I I I I ACCTGAAAATGCTTATTCTAGCTTCA 

CATTTGATTGTTTGGCTMGAAGAAAATTATTTATTAGACTTAATTTTCCTCACGAGT^ 
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AAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGA 
Sequence 2313 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGCAAGCTTCGACCCACGCGTC 

CGCAAAGTTCACCAMTTCGCTACAGCCTMGACTAGCCTCACCAGTCCTTTTTCCCATT 

MTCAAMCTTTGCAGAAGAGACMTGATTTTTACCATTCATTCAACCAGTTTGCACAGA 

GAGAGGCTGMGCCTGACTTGTMGAAACTCTTGTCCTTTTGCCAGTGTGCXAGGTTTCT 

GGGTTCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGG 

GGCCAAGCCATGTTACCCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCA 

GGTGTCACTTTCMCTTGGTTATGCCTAAAACAMGTCTCCCTCATCTCCAAACAATTTC 

TCCCGACTTTCTTTCTTTTTTGAGATGGAGTCTTGCTCTGTCGCCCACGCTGGAGTGCAG 

TGGCATGATCTTGGCTCCCTGC 

Sequence 2314 

CCG CGGTG GCGGCCGCA C I I I 1 1 1 1 1 1 1 1 1 1 I I I M 1 1 1 1 1 I M I NAAATGCTCAAAAGA 

ACM TTTTTATGTMGTTTGTGATAGAGGCCTCAGGTAATTCTACAAAAATTAAAACCCA 

TTTT CAATG CAAMTTCCCGMCATAAACAMTGCTTTTAAAAATATGGATGGNGTGGTT 

ACTC TTTTAGTMTAC TTGGATTATCATCAMGATTTMCTTTATTTTTTTGNGTGTGTG 

TGTGI I I 1 1 1 1 I 1 1 1 I GNGNGNG I I I I I 1 1 1 1 I I I I I I I 1 1 ATTATACNCTAAGTTTTAG 

GGTACCTGCC 

Sequence 2315 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACC 
CACGCGTCCGACTTTTTGTCTTAGACCCAGTTAGGGTCACCTTACAGTGCAGGTGGAAAG 
AAAGCAGGACTGCTGAGAGGAGCTCAGGA 
Sequence 2316 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 
CCGGATTAMGTCCTACGTGATCTGAGTTCAGACCGGAGTAATCCAGGNNGGTTTCTATC 
TACTTCAAATTCCTCCCTGTACCTGCCCG 
Sequence 2317 

TAGGGCGMTTGGAGCTCCCCGCGGTGGCGGNCGAGGTGTACAANGCTTCGACCX^ACGCG 
TCCGCCACACGTMCTGAGATGCTCCTTTAAATAAAGCGTTTGTGTTNGANGTTAAAAAA 
AAAAAAAANAAAAAAAAAAGT 
Sequence 2318 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACTTTTNN i M 1 1 I I M I I I M 
ATTATCTAMTCAGNTTTATTTAAGAATTTCCAACANTGACAACTNTTATAAAGGGGCAT 

CCAAGCACAGGACACANAACTGCNACAAACAGCATTCTTACGGACGCGTGGGTCGAAGAC 

CTCGGCCGCT 

Sequence 2319 

CCGCGGTGGCGGCCGAGGTACAGAATGGTAAAAATTCCAATCAGTCAAAAGAGGTCAATG 
MTTAAAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCC 
GGGCA GGTCGAGCGGCAGCAC I I I I U I I » I I I 1 1 I I I I I II I ATGGNTTTTATTTTTCA 
ATTTTTATTTTGGTTTTCTTACAMGGTTG 

AAAMAAMTTCAMTTTTTGGGGGAGCGGGGGAAGGAGTTAATGAAACTGTATTGCACA 
ATGCTCTGATCMTCCTTCTTTTTCCGGACGCGTGGGTCGAANACCTCGGCCGCTCTAGA 
ACTAGTGGGATCCCC 
Sequence 2320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTCTTGTCTAGT 
ATACTCMGGCAGCCTAGTAAATTATTATTTATCTATACAATACTGGAAAAACTNGNAGA 
CAAAAACATG ACTTG AATTGCTAAAAAAAAAAAAAAAAAAAANG AG GGAG AATG AAAACT 
TCCGGACGCGTGGGTCGAAGCTTGACCT 
Sequence 2321 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTNCATTTACATACAACTGA 
TCCAAACAGGAAGTAAAAGCNTTATGAAAAAAGAACATGATGCAAATCATTTCCCNNNGA 
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CAAAAGGAGGGATCTGCGTGAATTACAGCAAATCAAATGATTTCCACATTATAAAAGCAA 
GCCATGGCTTCTACAGATATTTTAATTCCTTGGGAGGAAGTTTCACCTNATCGGACGCGT 
GGGTCGAAGACCTGCCCGGGCGGC 
Sequence 2322 

CTATAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 

CACGCGTCCGATTTTTGTCTCCTGAGGTCAMCTGTGCCCATCTTTACTGCCCCTGTGTT 

GGTGTCATTCCTTTCAGCGCACTCCATGTTCTGGCAAAACACCCACAGCTCACGTTCTCG 

TTGTAGCCACATTACGCCTTTGCATTACTCTTTTGCCACTGACCTCACTGTCAGTCTTCT 

TGTCACGCTGTTT CCTC CAGGTGACTCTGACATCATTAGAAGCATGCCAGAACAGACTGG 

TGAAMGTAAACCTTTTCACCTACAAAATTTCACCTGCAAACCTTAAACCTGCAAAATTT 

TCCTTTAATAAAATTTGCTTGTTTTAAAAAAAAAAAAAAAAAANAAAGTGCGGCCGCTCT 

AGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnCXaATATCAAGCTTATCGATACCCGTC 

CGACCTCGAGGGGGG 

Sequence 2323 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCG 

CTGATCTGTGATTAAAGCAGTAATATTTTAAGATGGACTGGGAAAAACATNAACTCCTGA 

AGTTAGAAATAAGAATGGTTTGTAAAATCCACAGCTATATCCTGATGCTGGATGGTATTA 

ATCTTGTGTAGTCTTCAACTGGTTAGTGTGAAATAGTTCTGCCACCTCTGACGCACCACT 

GCCAATGCTGTACGTACCTGCCCG 

Sequence 2324 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACCCACGCGTCCGCAT 

TTTCAGGCTTCTGCAGCTGTAGATTCTCACTGTGAATCCCTTGCTTGCTCATGCATAAGT 

GTATTTG CAAT ACCAAATATACAGGTTTAGTATTTTGCCTGTTAGTGATTGTTTCACATG 

TGTAACGTTTTGGTTGAGATGTTAAATGGTGGACGAGTACCTGCCCG 

Sequence 2325 

CTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTGGCCGCACTTTT 
I I I I 1 1 I I I I I I I I I I ACTATTCGTTAGGCTTTTA I I I 1 1 CTCTATGTTCTGCAGTAACT 
AAGGAAAATCATGGTAAATGTCAATCTTCACACAACAGCAGACACAAAGGGTTTCAGAAA 
CCGGACGCGTGGGTCGAAGCTTGTACACCT 
Sequence 2326 

GGTGTACMGCTTCGACCCACGCGTCCGGTTTCTGAMCCCTTTGTGTCTGCTGTTGTGT 

GAAGATTGACATTTACCATGATTTTCCTTAGTTACTGCAGAACATANAGAAAAATAAAAG 

CCTAA CGAATAGTAAAAAAAAAA AAAAAAAAAAGTGCGGCCACCTGCCCGGGCGGCCGGC 

CGCACI I I I I I I I I I I I I I I II I I ICACTGTTTMGGMTTTTTATTAAAGCAAGAATTT 

TATMTCCAAATTACGTTTCCTTGCTCAGTTATCMTTCTGTTACTTAAAACAGAACTGA 

CATTCTGAGCTATTCCACAGTAMGAATTACAAAATTAMGAAAGGAATGCTTTAAATTT 

TTGTACCTGCCCG 

Sequence 2327 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I 1 1 I I I I CTTTCTACTTTTAGGGTTTATTT 
ACTGGTAGTTTCAGGTCTGGAAATACAAAAAGGAACATGACATGGGTTCTGCCCTTGAGG 
TGCTCCCAGTCTGGTGAGGGACACAACACCTTTCAGGGGACAACCGTGGTACCT 
Sequence 2328 

TAGGGCGAATTGGAGCTCC CCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACI I I I I M I 

I 1 I I 1 1 I I I I II I I I I I I I I I I ANAATTTAGCCACTTNTCAGGCCTTNTTCCNNATAATT 

TGGMCTTTCCTTTGGATTTGATCMGTTGGATAGAGTTGATCAAACCCTGATCAAGTTG 

GATAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACANTAAAAAACAGACAACA 

ACAACAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATNTGATAACTGGGGTNG 

CTTTMTNGGMNGGANAAATTAAAACNNGGTTGGGCGGTTATTTTNACTTTAACCAAAA 

AANAMCCCCCAATTTTAAGTTNCTTTNCTTANGGGGCGANGTTNTTGGGGTTTTAAA^ 

NGGG I I I I rTC CCTTTT TTAGNAAGGGNGGGGGGNAAAACCCCANCCCNNCCCTNANAAA 

ANCNCCNCNTTnTrrCCNTTNNNGGGAGGGGGNGGCCAAMCNCCCNCCNNTTTNGAAA 
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AAAAAGAAAAI I I I I I I 11 I I I M I I AAGCCCNTTTTAACCNCCAAAAAAAANNAAGTGT 
TTTCCNN NCGNNNCTTTTNNAAAANNGNGGNACCCCCCNGGNGGGGGGGANNTTTTTTAA 
TTTTTTTNNCCCCCCCCTNGGGGGGGGGGCC 
Sequence 2329 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGAGGAAATATT 

TTGTAMGTGAGC TTTGG GTATMCTTAGCCCCATCATTATTTAGAGAATAGAGGAGGAA 

GAAAGAGGAAGGATTTTAMGGCAGACMTGACAGACCATTCAGGATAGGTAGGGTTTTA 

AAGGGAGCGGACGCGTGGGTCGAAGACCT 

Sequence 2330 

CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATAATAATAA A 1 I I I 1 1 II 

TAAAAA AAGAGTGT TGTCTTTGTCTTGTATTTTCTGCAGTTTGCATGTGATATTCTTAGG 

TATAGAI I 1 1 I I I I AGTATTTGTCCTGTATATTGTTATTCGAGCTTCTGGGGATCTGTGT 

TTTGGT GTCTA TCATTAACTTTGGMTATTCTCAGTCATTACTGCTTCAMCATTCATTC 

TGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACACCTTTTGTMTTCTCC 

CACAGTTCATAGATATTCTGTTGTATrrATTTATTTTTTCTCTTT 

AGATTTCTATTGACATCACTTAAAGATGATTGATGAGTTGATGAGAATTGAGAGAATTGA 
TGAG MTTGT TGATGAGAATTATTCATTTGTGTTAGTGTTTTTCATTTCTGCCATTGNCT 
TTTGATTTTTAGAGCTTCCATCTCTCTGCTTACA 
Sequence 2331 

AGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACAACCTTGACCGTGACCGTTTG 
CTATATTCCTTTTTCTATGAAATAATGTGAATGATMTAAAACAGCTTTGACTTGAAAAA 
AAAAAAAAAAAAAAAAGT 
Sequence 2332 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATTTTATAGAAAAGAA 

MCAGAGGCTCAGAGAGGTTAATTTTTCCTGGATGTCCTAGATGTTAAATGTTACAACCT 

TAATTGACCGATTCCAGAATCAGAGCTATTAMCACAAMCTATTTTAATTCTCTCTAAA 

TTCTTAAAGACCCAAGAAAAACAAACTTTATTGAG ATAATTAGGAA I 1 1 I I 1 1 IAAAATA 

TCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 2333 

AGGTCAAGCTTCGACCCACGCGTCCGTGGTGAACACAGAGAAGACAGTCTTGTATATATT 

CCTCTGTATTCTGGGGAGCTTTGACCTTGGAGCTTTGTACCTGCCCG 

Sequence 2334 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAACAC 

CTATGTTTATAAAAATTTGAAAACATTACATATTGTATTTAAAACTAATTAGNGAAGAGT 

MGAAAAAAACTAGCCAACAGAATTGTAGGTGATGCATTAGTTAAATTTCAAAACTCATA 

ATAMGGAACTTTCAGAGATTGGTTGAAACCCAGTGGTATCCCTGTAAATTAGCTCCTGT 

GACTGGAAAAGACCCCAMAAGGCAGTAGAGGAGATTAGTGTTTACTTGCTGTGGTTGTG 

GTGTGCTGCTACTTMTTATAGGTAGTGACACACTGAMTTCTTATTTGTCCAATAATCT 

GAAGTA GTTTCCTATATTTATCTGTACCTGCCCGGGCGGCCGGCCGCACI I I I I I I I I I I 

TTTTTTGCNGGGGN I Mil I CTTTT Cl I I I I I M ICA 

Sequence 2335 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCACTNNNTTTTTTN l I I 1 1 I I I I I ICTT 
TAMCACCANTTAGTITATTTCAGGACMGAATTTACCATNTMCANTCTTTTNACATAA 
ATTCTGNCTCCCCCCA CI iTTTTTTTn l GAAN ATAACCATTC C 1 1 1 1 1 II 1 1 
Sequence 2336 

CNMNNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTNTATCAGAGCTTTT 

GGGTGACCAGGCACACTGTCAATGAGCAGTMTACGGGGAMGGMTCTTTGGGGTTTTT 

TTTGGTTTGGTTTTTGTTTTGTCGTTA I I 1 1 I I I GTTTGTTTGTTTTTTGTTTTTGTTTT 

TTTAGCAGTAGGCCTCAACAGTGGACTTAAAATACTCAGTAAACCATGCTGTTAACAGAT 

AAGCTGTCATCCAG ACTTT GTTGTTCCATTTCTAGAGCACAGAGCAGATTTAGCAGAATT 

CTTMGGCTTTAGGATTTTCAGAATGGTAAATGAGCACTGGCTTCAACTTAGTCACCAGC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

386/467 



TGCATTGGCCCCTAACAAGACAGTCAGCCTGGCCTTTGAAGCGTTGCAGCC 
Sequence 2337 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTGCACATATTTAA 

MCATATMTTTGATACATTTTGACTCACAAAACAATCACCACAATCAAGANGATGAGNN 

TATAGATCACTCCCAMAGTTTCCCTGTAGTCTTTTGCAGTCCTTTCTTCATGGCCTTCT 

TCATCCATCCACCCCATCTCGGTAACCAATGATCTGCTTTCTGTCACCACAAATTAGTGA 

GCACTGTCTAGAATTTTATGTAMCTGMTMTAMGATTTTACTCTTTCG 

Sequence 2338 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCACI M I I 11 I I 1 1 I 1 I 1 I I I I 1 1 I I 1 I 1 11 1 I I 1 I I NGGAATCATCATTAACTTTAT 
TTGNCACTNTTGATAGACATTGGTCCACTCCAACATAAAMGNAGAATTCACCCACTTCC 
ACTTAATATTCTATAGAATGAAGTTGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGT 
Sequence 2339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAG AI 1 1 1 I I 1 A CACAA 

AGAMCTTMTGCTGTATTMTAAATTCAGTGTGTAGCTTCAATTGGGATAGTTCCAAAA 

GTGAAGATTTTGTGAGGAATAAGTGCAAA I I I I I I I i I I ATTTTAAAAAATTCTTTG AAA 

CTCTTAAGTCTTTGTGT CTGCAATAAAATTGTACCTGCCCGGGCGGCCGGCCGCACTTTT 

I 1 I 1 1 1 I I I I I M 1 1 II 11 1 1 TTTNGAMNGTTTGMGTTMCTCATTTTATTTNTAGGA 

TTNGGATTTCAACATTTTMTTTNTTTGGMTATAAGTC 

AGAATCAAA 

Sequence 2340 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGACTTGATTCATAACCAACCA 
ACCCAGGCTGTCATGCTAAAAAAAAGGGCAGAATCTTTGCTGCTGAACTACTGGATAGGG 
TGACTACCATTGCTCCTTTAGTTTAGTTTGGCTAGCAGAAGGTGGTCTTGCCATGTAAAT 
A AAGCCTCTC AGGTAATCAAAATGTTTC 1 1 1 M I I ACTTTTGCTGGTG I I I 1 1 1 I CTTTT 
CM I 1 1 I I I ICAGC TACA GGAATTTAGCCAATTCAGAGGAAATCTTCCCCATAATTATGG 
MCTTTCTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACTGAAATAACA 
ACAACAACTTCAACAAATAAAAAAAAACAGTTAAGCTAAATAAACAGATG 
Sequence 2341 

AGGTACTTCTTACATAGTGATTGATGTCTCATGTCTCCCTAAAATGTATAAAACCAAGCT 

GTGGCCGGACCACCTTGGGCACATGTCATCAGGACTTCTTGAGGCTATGTCACTGGGCAT 

GTCTTCAACCTTGGCAAMTAAACTTTCTAAATTAATTGAGACCTGTCTCAMTTTTGGG 

GGTTCACAGGTGAGTGGGCTCAGGCATGTGCACTAGTATGACTAAAGGTCATAGACTATT 

AGACTATTAGTCTATGACCTTCCTCTAGAAACACTCGACTGGTAAGGGAAGAATGCCTCA 

ACTGAGCATGTGCACAACTCCCATAAACACACTTGTGCTTGCGGAGCCTNTCAAGTGCTG 

GCAGGCCACTGCTCAGGTGGATTCTTCCCTCCTACCCGGAGGGAAGAATCAGGGGAGAAG 

GGACACAAGCCCCTGAATGCATGCCAACACGTAAA 

Sequence 2342 

CCGCGGTGGCGGCCGAGGTACCTTCAGGTATTGCCTATTTAATGATCATATATACTTGCA 
TMTATCATCCCTTCCCTTGATTTCTTTCAATCTAAAAATAAATATGAGAAAAACAAAAA 
AAAAAAAAAAAAAGT 
Sequence 2343 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGACTCAAGACC 

AGATGGCCAACTAGAAGCAGCCAGGAAGAACATCTCTCATGGAGAGACCAGGAATTTGGG 

AAGACTGGCACACTCTGAGCAGATCTTTTGAAGGAAAACATTGAGGGTGGATGAAGGGAG 

GATGCAGAGCATGGGCTGAGGGGGCAGGAATCTCAGAAGCCTGCACAGGGCTTCCAAGCA 

CTAGCATTCCTTACTAGCCCCCATTAACTCCTGGGGAAGGGGTGAGTTGAATAGGTGGNG 

GAGTGGCCCGCTCTTACCATGAAACTCCAGAATCCTAGCAGCAAGAGACCCCATGACCCC 

TGTGGACACGAGCTGTCCGGACGCGTGGGTCGAAGACCTGCCCG 

Sequence 2344 

CCGCGGTGGCGGCCGCTNNCTGCCCGGGCGGCCGGCCGCACI I I I I I I 1 1 I I CI I I H GG 
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AAAMT^NNNGACTCTGGGGATAAMNTNCNMTTAAATNNATNCNANriTTTTAAAGGC 

TATTMNNANAANAATATTNGCTAAATTNNCCTTNTGCATAACAAACTGTGGNTNTACCA 

TGTAANGTTTAAAAAAATGTNTAACCNCAATTTTACGCTCCTCTGTNACANGACAAGGAC 

TCCATTCANTGNCATTTAAGAACTNAATGGGTTGAN 

Sequence 2345 

AGGTCAAGCTT CGACCCA CGCGTCCGATAAGCCAAAAAATGGGAGACAATTTCACATGGA 

CTTTGGAAAATATTTTTTTCCTTTGCATTCATCTCTCAMCTTAGTTTTTATCTTTGACC 

AACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAGTG 

CGGCCGGCCGCACTTTGTTTCTACTGGGTTTAGACCGTCGTGAGACAGGTTAGTTTTACC 

CTACTGATGATGTGTTGTTGCCATGGTAATCCTGCTCAGTACCT 

Sequence 2346 

CCGGGCAGGTACCTGCTCCATTTCTCCTGCAACATGTGGATACAGTAATATGATCATACC 

CTCCCTTGTTCCCCTCTAGGCCACTTTCCCCTTTAAATATTAACACCATCATAATCATCT 

TTGGAGAAAGACACCTGGATCTGTCCTGAACCTTGGCAAAAAATAAATAAATAAATAAAA 

ATAAAAMCTCTTCTCAATTGATTMTACCTGTCACAGATACATTTTGGTTTACAAATCA 

ATGAACAATGGAGGGAACTCTGTCCTTAATCTTGGTACCT 

Sequence 2347 

CCGCGGTGGCGGCCGAGGTCTCTTGTTTGNTATTACACTTCTACGTAGATTATATAATAT 

TGCTTGTGGACATAATTTGATCAATAATATATAATGTCAGTCACTACAGTGATCCAGAAT 

CTTATTCTGGCTATGGAGGAAGCTTMTTATTAAGCMCATCTTCCTAAAAAGCTTNTGA 

ATTTCTGATTCATAAGAAAAACAAAACAAATGAAAAGAGTATCTNTAACTGAAATAACAC 

TGAAGTTCGAGCTTGGGCCCCCTCCTTTGTGTTCAACATAATTAACTNTTCAAGATGAAA 

CCGGACGCGTGGGTCGAAGCTTGACCTGCCXJGGGCGGCCGGCCGCCCGGGCAGGTGTTTN 

TTCT GGGATATCTTTTTCTTCTGGGCAACCTCCTCTTCTGGTTTAGGAACAATCTGTTCC 

TTTTCCGNAAGGATCATCTCAATGTGGCAGGGAGAGCTCAT 

Sequence 2348 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGC 

GTCCGGGCAATTATCAAAAACACTTGGAAAAAGATTTTTATTCCTACTTTTAAACATACA 

TCAAAATCTAAAATAAAACTAGGCACCTTCAGCTGGGCCTGGTGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGAAGTGGGCAGATCACTGGATGTCAGAAGTTCGAGACCAGCC 

TGGCCTACATGGCGAAACCCCCTNTCTACTAAAAATACAAAAATTAGCCGGGCTTGGTGG 

TGGGGACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGG 

AGGNGGAGGTTGCAGTTAGCTGAGATCACACCACTGCACTNCAGCCTGGGCCACAGAGCA 

GGACTCCATCTTAGAACAAAAAACAAACAAACAAACAAACCTCATGCACCTTCAAGAAAA 

TCAMCMGTTTTTATCTAATTAAAGAAAGAATTTT 

Sequence 2349 

AACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGC 

GTNCGCTTTAACACACACTAGGCTCTTTGTGTATTATGATNCAGTGCTATTTGTAACTGT 

GTCCCAGNGACCAAATTGCACTCGACTCGATCAGCTGTTCATCCATTTCGTG I 1 1 1 1 I CC 

TGTCAMCATTMTCCAGCAAATATATGAGGTATTTACCMTTTATTTTCTTAGTATTAC 

AAAATAATTCATTAGCATAAAGTACCTGCCCG 

Sequence 2350 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAAATCATAGAGC 

TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 

AAAAATCTAATTTMGTTGTTAATACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 

ACAMTTCTTTTATACAAATGTCGAAAATGCTTTCAACAAACCTAAGTGTCCTAATTACA 

TGCCACTTTTAAGCATCACTTTAAGGTAAACAAAAATGAAAACCATAATTTTAAATTAAA 

ATTTGCGGACGCGGGGGGTCGACTCAAGACCTCGGCCGCTCTAGAACTAG 

Sequence 2351 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCTTCGACCCACGCGTCCGGATGGCTTGGGTCAT 
CAGGACGTCCATTACATCCAAAGGAAGACAGCCTGTGACGTTTCAAAAGCAAAAGTCCCC 
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TACCAGCCAGTGAAGCTACCTGATTTCTCAGTATCTTACGCCCAGNGACACGATCTACCC 
TCAAAACTTAAAAAAAAAAAAAAGGGAAACATAAACACATAACAGCAGACCTN 
Sequence 2352 

GGGCCGNGGNCACAACATTCCCCCTTCCCCAAACAGTAATATGGACACTGATTTAACANG 

ACTTATAAAMAATAAGGCNCATTTATTTTGATNTGGTMTTTTAAAATAGAMCCCCTT 

C 

Sequence 2353 

GCGNGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTNC 

GAGCTTGAGTCGAC(XACGCGTCCGTGAAAATGTTGTCTCCrTTCTAMTTCTCTGCCG^ 

TTTGGGAAAMGCAMCTTGACTTTACCCCGAGGMTTGGTGAAAMTTACTTTTACGCC 

TTGCAGCTGTGGMCTTGGTCTTACAGCCTCTGCTCTTCTGCCCAAACGGGCCATGCAGT 

TTGGATCAAGAATTGCAAAAATGGAAAAAATTAATGAAAAGGCATCTGATAAATGTGGAC 

GGCTCCAMTCATGTCCTTAGAAMTCTTTCTATTGAAAAGGAGACTAAATTGTAATGTG 

ATTCACMTGTAACAATATAAAAATAAGTTTTTATATAATTATTAAAAGGNAGATACTCT 

GGTGCTTTACTATTGGATAAAATAAGTAMCCTGCCCGGGCGGGCCGGCCGACTTTTTTT 

TTTTT 

Sequence 2354 

TCCCCGCGGTGGCGGCCGAGGTTTCAAAGACCNGCCTGGCCAACATGGTGAAACCCCATC 

TCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACT 

CAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAGAGGTTGCAGTGAGTCGAGGT 

TGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAACC 

TACAGCTGTTCAAGGACCAGCTGCAGGGNCAAGNGGGGGCCTTTTTTGGTCTTTGAACAC 

ATCATAGAAAGNGGNCAAATG CTGCAAAG CCATGAAGAACATGAACCTTTAACCGGGTAG 

ACTAACTGCC CCACTTANACNC I MINI I GCCCCCAAAAACAAGNTTGATTTTGCCCCT 

i i ii i i i ii TTOTCCATTGGGGGGAAGTTTTGGAAAAANAAAANGGGGCCNCCCCCCCCT 

i i I I 1 I I I I NNNNCNAAAAANCC I I 1 1 I I I I GGTTTTTTTAAAAAAAAAAA 

Sequence 2355 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGGCCAGAGACTTC 
AAGTC TATC TGAAAAGTCTCCAGAGGTCTAACCCCAGATAAATAGCCAACAGGGTGTAGA 

GTACATTTTACACCCCAAAGAGTGTGCCCCATGGTGATGAAAATAAAGTGAACATGTTGC 

AAAATGAAAAAAAAAACCT 

Sequence 2356 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCAACACAAACTATTCAATAAAGTAATCTGCT 

TTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCACCTGACATCATTAGTTCAA 

GACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAAAATACAAACATTAGCTGGG 

CGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGAGGCAGGAGAATCACTTG 

AAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATTGCACTGCAGCCTGGGCA 

ACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAATAAACACAACAGAATTAT 

TCTGCAMTACAGATATTGGAGTAGCTGAGTTNCATCTCAMTTTGACTATGCAGGTTGC 

AGGGTGATCTTGGCCMCTACTTATTCTTTTNTGAAGTTCMCTTTTTT 

Sequence 2357 

CCGCGGTGGCGGCCCGAGCAAGTGGGCCTGTAGCCCGACTCTTAATCCAGGTTGGTGCTA 

TTCAMGAGATCATCTTTCACCCGAGGGATTTCTGGGCATCTATTTTGCGGATCAGAAAG 

TAGAGAMGAAGGTAACTTTGCTGAAAGCTAGTCTGGGGAGTTAGTAGCTGATACAGATC 

AGCATTTCCTAACTATGAGATTTCATAATATTCTCTCTTGTCTCGATTCTGAGTCACTGG 

TGCCTGCTGTGGTGGCATTGTTCATGAACATGTACCTGCCCG 

Sequence 2358 

AGGTCATGTGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCTGGTGCGC 
TGCACCCACTAACTCGTCATCTAGCATTAGGTATATCTCCCAATGCTATCCCTCCCGCCT 
CCTCCCACCCCACAACAGTCCCCAGAGGTGATATTCCCCTTCCTGTGTCCATGTGTTCTC 
ATTGTTCAATTCCCACCTATGAGTGAGAATATGCAGTGTTTGGTTTTTTGTTCTTGCG 
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AGTTTACTGAGAATGATGATTTCCMTTTCATCCACGTCCCTACAAAGGACATGAACTTG 

AGAATTCTTAATGCAGTGCTTTAATACAGTAAAAATTTTTAGTCTTTGTTTTCT 

TGCATTTGAAAAGTGCACTCTTGATCTTGNATTTTCTTTCCTTTCCTTTAGMTCATAT^ 

AGCAGTTGGACGTATATATAAAATATTAAGTGTACCTGCCCGGGCCGGCCGGTCTAGAAC 

TAGGTGGGATC 

Sequence 2359 

AGGTGTACAAGCTTCGACCCACGCGTCCGAGCAGAACTTGGCAGCAACAGAGGAAGGGCC 

CCTGGAGCCGGCTGTCGTGGATGCCTTTMTCMGCCTGGCATTTGGTTGCTCACGAATG 

TCXX)MCTCTTCCGCTAGGCCCATCATGCTCAGGCTGCCX)MGGCTTTTCTGTCACCTCT 

TTTGTTCTCTCACACTGACCAGTCTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTAC 

CMGTGMT1TAAATA ATTGGT GTGGATTGGCCAGTAGCTAAGTGGGCTTTTAAAGAGTA 

TTGAAGATTGAAAGGG I I 1 1 I CTTTCTTTTTTAAAAAAGAAAAACAAACTATTGATTGTA 

GATMTGA AAAGCT AGGGTFfGCCCTCTTCATGTCTACTCTCCTTCCAAATAGTTATATC 

CAAMCTGTTTTTTCCTNTCCCCTACCTTGTCC 

Sequence 2360 

CGAGGTACACAGCTATGCACTTTCCGTTTCTGACTTTTGCCACCCTGTCAGCCATGGGGA 

GCCCACTGTGGGACTGAAACCCTGAGCTGAATGCGGCCTCATGTCTCAGAGAAACACTGG 

CAAGTTGGTCAGAGCCGCGTCTGCATCGAGGCGTAGCTGANCGGCAGGATGGGGGGCTGC 

CTGCCCAGGGTCTCTCACCGTGGTGTAAGCAGAGCCATGGCTNGCCTAGGACCCTATAGA 

TACCATCACTCTTTCTCAGCTCGACTGGAGTTTGCACCTTTGCAGGGGCAAAGTAACTCC 

CTGCACCCTGAACCACCCCCCATTCCTGTTCATTTCAGCAGATAATGATGGAGGGGGGGG 

Sequence 2361 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCG 

TCCGCTATCATATTTCAAATCCTAATGGGAGACAAAACAAAAGTGATGGTCAGTATTTCT 

TTGAA ATTCTA CAAGGAACTTCAGGCACATGGTGCTGATGAGTTATTAAAGAGGGTGTAC 

GGGAGTTTTTGTAMTCCAGAATCAGGATACAATGTCTCTTTGCTATATGACCTTGAAAA 

TCTTCCGGCATCCMGGATTCCATTGTGCATCMGCTGGCATGTTGMGCGAMTTGTTT 

TGCCTCTGTCTTTGAAAAATACTTCCAATTCCAAGAAGAGGGCAAGGAAGGAGAGAACAG 

GGCAGTTATCCATTATAGGGATGATGAGACCATGTATGTTGAGTCTAAAAAGGACAGAGT 

CACAGTAGTCTTCAGCACAGTGTTTAAGGATGACGACGATGTGGTCATTGGAAAGGTGTT 

CATGCAGGAGTTCAAAGAAGGACGCAGAGCCAGCCACACAGCCCCACANGTCCTNTTTAG 

CCACACCTGCCCCGGGGGGCCGNTTGAGGGCCNCCCGGCAGGGNNNGTNNTAACANNGTA 

ACATGTATTATGGNMCCTTTMTCTTGGGGGNCTTANACAGGCTNNTNCCAANGGNTTA 

ANNAAAGNTTGNTTTNAAAAAAAAAAAMGGGATGGGTNTTTTTNAAAAAAAAAAAAAAA 

Sequence 2362 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACC 

AAAGCTGATCTCT CATTC GTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAA 

AAGTTTGAMTTCATTTTCTTCAGGAAAATACCCAGCCAGTCTCTCAGTGGAAACCCTGG 

TTATGTCGTGGGGCTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCC 

GTGTCAGCTCGTAGCACAGAAGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCAGATTAC 

GTGGCCCCTTTTGGAAATTCCCAGGCCCAGGACATGCTGGACTGGGTGCCCATCCACTTC 

ATCACCCAGTCATTCMCAGGMGGATTCCTGCCAGCTCCCAGGGGCTTTGGTTATAGAA 

GTGAAGTGGACTAAATACGGATCCCTGCTGAATCCACAGGCCAAAATAGTCAATGTAACT 

GCAMTCTMTTTCATCCTTCTTTNCTGAGGCCAACTCAGGAAATGAAAGGGCGATTCTT 

ATTTNCACTGGNGGTACNTTTGNGGATGTGTCTGCACCTGCAGANGCAGGCTTTAAAGCT 

TCANCAGNCATTAATGGCANGCTGGCCNTTAACTTT 

Sequence 2363 

TAGGGCGAATTGGA GCTCCCCGCGGTGGCGGCCCGCACTTT N I M » I I I I I I I I I I I I I I 
MMMMIM I I M I N I H l llllHl l l l lllUMlll l lMM I un i ni I N 
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1 1 1 I I I I I I I I I M I I I 1 1 1 1 1 I I I I 1 I I I I I I I I I I NNAAAAAAAANTTTNAANGGGGG 

AAAAAMNNGGGNAAAMNTTNNGGGGNNGGGNTTNAAAAAAAAAAAANCCCCNNNNCNN 

AAAAAAAAANNGGNCCCCCCGGGGGGGGNNAANNTNNAAAAAANTTTNTNCCCCCCCCCC 

CCCCGGGGGGGGGNCCCCCCCCCNNNTTTTTTNNNNNTTNANNGGGGGGAAAANNNNCCC 

NGGGGGAAAAAAAAANNAAAAAATTTTTTCCGGGGGAAAAMNNTTTCCCCCCCAAAAAA 

TCCCCAAAAAAAANNGGNNGGGGGNAAAAAAAAANAAAACNGGGGGGNCCCAAAGGGGGG 

CCA NCCN AMNTTTGGGGGGGGNNNNNACGCCCCTTTTTAAANNNANAAMCCNTNNGCC 

CCNT7TTAAAMAAMCCCCCCCCCCCCGGGGGNGGGGGGTTTCCNAAAGGGCCCCCTTC 

CCCTTTCAAAAAAAAAAMAAAGGGGGAGNGNNNNGGGGGGGGGGGGGGGG7TTTTTTCC 

MNANMGGGGGGGNTNTANTNTTACCAAAAAAMNGGGGGMCCCNCAAAAAAAATTTT 

TNNAAAAAGGCCCNAANGGCCCAANGCNAAAAAAGGGCNGGGGGNGGGGGTTTAANAAAA 

GNCCCCCCCCCCGNGANAAAAAAAA 

Sequence 2364 

CCGCGGTGGCGGCCCGAG GTCTT CGACCCACGCGTCCGCACTGAGTGTTCACCATAATTT 

GAGATTCTTGGCATGTTMCTTTTCATTATGGAATATTGAATAATTTCAATATTATTCAT 

ACATTTCTTTATGTTCAAACATACACAAAAATAGAATAATGAACTCTACCCATCACCCAG 

CTG CMCA AATATCMTACTTTACCGTTCTTAATACATCTMCCCCTTACTTTTTGTTTG 

TTTCTTTTGGTGAAGTATTTAATTGTAA I 1 1 I I I 1 1 1 AAGAGACAGGATCTCACTCTGTC 

ACCCAGGCCAGAGTGCAGTGGTACCTGCCCG 

Sequence 2365 

CGCACI I I I I I I I I I I I I! I 1 1 1 I AATAACAAACACTTATCCAACACTTAGTATGTGGCA 

GGCACTGTTTCAAGCACTTTACACATACAAACTCATCCCGGACGCNTGGGTNNAAGCTNG 
TNCACCNA 

Sequence 2366 

CCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCGTCCGGATTATTTCAGCTCTTGA 

CCTGTCCCCTCTGGCTGCCTCTGAGTCTGAATCTCCCAAAGAGAGAAACCAATTTCTAAG 

AGGACTGGATTGCAGMGACTCGGGGACMCATTTGTCCAAGATCTTAAATGTTATATTG 

ATAACCATGCTCAGCMTGAGCTATTAGATTCATTTTGGGAMTCTCCATAATTTCAATT 

TGTA MCTT TGTTAAGACCTGTCTACATTGTTATATGTGTGTGACTTGAGTAATGTTATC 

MCGTTTTTGTAMTATTTACTATGTTTTTCTATTAGCTAMTTC 

TGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCTAATAAAGGCAGCAAAATGCATTAATC 

CACTATGMTGGAGTTTTACATTTTAATTTATGCCTAATATTTATAMAGMTTTCATTC 

ATAGGCTACTCACAGTTGTTATCTGACGCTTACAGAAGTGGTAAACAACCAATTGCTAGT 

TCAAGTAGTTTCCTCATGACATCTAATGGTAAGCAAAAATTAGTATGCATATTTCAACAT 

CCCAGTNACCAATCTTTTTAAAATGGA 

Sequence 2367 

CGAGGTGTCMGCTTCGACCCACGCGTCCCGACTTTTTGTCTTAGACCCAGTTAGGGTCA 

CCTTACAGTGCAGGTGGAAAGAAAGCAGGACTGCTGAGAGGAGCTCAGGACCCATnTCC 

AGGACTATTGCTTCTCAAAACTTTGGAGAGCAGGAAAATAGATTCCCAAGTGAAAGAGGT 

GGCAGAANTAAAAAAAAAAAAAAAAAAAAAGT 

Sequence 2368 

CGCCCGGCAGGTACACAGTTCTGACTGCAATACCTTTTTCAGACTGCAAAGGGAGCTCAG 

GATCCAGAAGTCATTAAAAGAACGGACGCGTGGGTCGAAGCTTGTACCT 

Sequence 2369 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAGAAATACTGAMGACTTTTGCCTAAAGTGGCATTATTGACTGCTGGTGTGATGCT 

ACTGTAATGTGATAAATTATTAAATTGTTGCAAAGTGCAAAAAAAAAAAAAAAAAAAAGA 

CCTGCCGCGGC 

Sequence 2370 

CGCACI I 1 1 I | | 11 [ I 1 I 1 1 1 I I M 1 1 | | n 11 II I I I I A AAAAAAGCCTCATTATCCTG 
TAGTCCATTTTGGAAAGTAAAGCCCAAGAAAGCAAAAGATGAAGGTTCTAAAGCTAGTTT 
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GACTGACCTCAGAGTCCTCGGCCGCTCTAG 
Sequence 2371 

TTCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGNGAGCACANAGATGA 

ATAATANCAATAGGTTCACANAAAAGATGAATTGATTGAGAGAAAAAGA 

Sequence 2372 

CGAGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAG 

GTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCAT 

TGGACTTCTCAGCTTTCCTCATTAGAAGTTAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAAAT TAAG TCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

TAATATACAGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGA 

AAAGGAGTAGAATGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTA 

TTATAACTTTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAA 

ATATCTGTAGGATCACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAA 
AAAAAAA 

Sequence 2373 

CGAGGTGTACAAGCTTCGACCCACGCGTCCGAAGAAGGCTCTCCTCTGTTCCAGGAGAAG 
GAAGGGACAGATGAGAAGTCACTTCAAGTTCCCAGAATACTCAGGAGCTGAACTTGTCAA 
GGTTTAGATGTGGCAAAGCAGGCCAGGCATGGTGACTCATGCATGTAATCCCAGCATTTT 
GGGAGGCCAAGGCAGGAGGATCACTTGAGCCCAGGAGTTTGAAAGCAGCCCGGGCAACAT 

AGTGAAACCTCATCTCTAGAAAAAATACAAAAAAATTAGCCAGGCGTGTTGGTGTACCTG 
CCCG 

Sequence 2374 . 

CGCGGTGGCGGCCCGAGGTACATCTGCAAGCTTTAAAGCAGTAGGTTCAGACTTCCTGGA 

AGAACTGACACTTGAAGCTGACTAGGGNCTACTTGANCNCATACTCACTTTGGCTAAGCC 

ACAGTATGAGGGAAGGGTGTCAGGAATAACNCTTCCATTTTTATNTTGTTCATTCCCGAA 

AATCCMCAGGAGATTCTTTTCACTCCCTAAAATNAACTGNTCTGTGTATAAAGCATATC 

TGGATATCTTGAT CTTAAA TGGAAATGGTATNTGAAAANNGCNCCNACTTTTCTAAAACT 

TTAAMTTGG(XCTTTTTTATTTTTAGCCCTGGGGGAGGAGGGAGGGAATGATTCCCAAA 

AAACTGACTGGTTTCCTTGNTGCACTGTATTAATTACTGGTGATTATTTTTTGGGGGGGG 

NAATTTAGCNGGAAAAAA I I 1 1 1 1 1 1 I CAGGGTTAAAACGCGCCTNNTAAAAANTTGGGA 

MTGGNCGNAAMGGGGGCCCTTTTTTTCCCCCGCCGNTTTGGGGGNAACCCCCGCCCGG 

TTTGGMNGGTNMCNCCCTTTNA AAAAA NAAAAAACTTGNTTTCAAGG 1 1 1 1 1 1 1 1 IC C 

AAAACCGAAAAATTTGGGAAGGTCTTTTTATATAAANCTTGGAAACCCGCCCCCGGGGTC 

AAMCCC^CGNNNGGCCGTTTTAAAAAMTGGTACCCCCGCCNTCCGGN^CANAATT 

AAACTTTTTTNCCCCCNNCNCTGGGGGGGGCC 

Sequence 2375 

CTACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTCTTCGACCCACGC 

GTCCCGAGAATAGCTACTGAAGTCCTAAAGAGCAAGCCTAACTCAAGCCATTGGCACACA 

GGCATTAGACAGAAAGCTGGAAGTTGAAATGGTGGAGTCCAACTTGCCTGGCCAGCTTAA 

TGGTTCTGTCTGGT AACGT TTTTATCCATGGATGACTTGCTTGGGTAAGGACATGAAGAC 

A GTTC CTGTCATACCTTTTAAAGGTATGGAGAGTCGGCTTGACTACACTGTGTGGAGCAA 

GTTTTAAAGAAG CAAAG GACTCAGAATTCATGATTGAAGAAATGCAGGCAGACCTGTTAT 

CCTAMCTAGGGTTTTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCAACACAA 

ACTATTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCCAGGCAGGTGG 

ATCACCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGAC 

TAAAAATACAAACATTAGCTGGGCGTGGTGGCAGGCGCCTNTAATTCACTACTCAGGAGG 

ATGAGGCAGGAGAATCACTTGAAGCCAGGAGGTGGAAATTGCAGTGAGCTNAAATCGTCC 

ATTGCACTTGAACCTGGCAACAAAATGGGGACTCCGTTNAAAAACCCCCCCCCA 

Sequence 2376 

CCGCGGTGGCGGCCGAGGTATCACGAGCGGCCGGCCGCCCGGGCAGGTCAATCATAGAGC 
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TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 

AAAMTC TMTT TMGTTGTTMTACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 

ACAMTTCTTTTATACAMTGTCGAAMTGGCTTTCAACAAACCTAAGTGTCCTAATTAC 

ATGCCACTTTTMGCATCNNTTTMGGGTMCCAAAAATGG^MCCCTTATTTTNMTTA 

AAATTTNGGGNCCCGGGGTTTTANTTAMNNACT^^ 

CCCCGNGNGGGGGNAMTTTTTTTATATAAA I I I I I I I I i NCCCCCCCCCCCCCTGGGGG 
GGGNGCX^CCCCCCCCCCNI I I I i I I I I 1 1 I I ! I I I I I NNGGGGGGANNAAACTCNNCNCG 
GGGGATAMNANANGGATMTNTTTTTTTNNTTNGGAAATN I 1 1 I 1 1 1 I I I N 
Sequence 2377 

GCG AATTGGAGCTCNCCGCGGTGGCGGCCGCA C I I I I I I I I > I 1 1 1 I I I I 1 » I 1 1 I A AAG 
CACTGGTGCTCTTCTTAAGTGAAATGTTAACACAGGGCTCAATACATAAAAGAGAAAGTG 
MGTTGTTCXATTTGGGGGGTC^CATAAGGGGCCTCATGTTTCCCTAGGTGTTACCCCTT 
CAGACACAGCACATGCCTACAAAGNGGACNCGTGGGGTCGAAGCACCTGCCCG 
Sequence 2378 

CTATAGGG CGAA TTGGAGCTCCCCGCGGCGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGGTCTAT7TTGATTNTGGGGGGTNATCAGCATTATTCTTCAGMGGGGACCTGTTTTC 

TTCAAGGGAAGAAACACTCTTATTCCCAAACTACAGAATAATGTGTNAAACATGCTAAAT 

AGTTCTATCAGGAAMCAAANCACTGTNTTTATCTCCGNAGGCTATTTGNTCAGAGAGGC 

C"nTTGNTTAMTA TAAAT GTTTAAATATAAATGTTTGTCTGGATTGGCTATAACATGTC 

TTrCAGCATTAGGCTTTTMGAAACACAGGGTNTTTGTATTCTITTACT 

GCTNTTAATGTTGNTTANATGAGGGNGANTGTNAAGTACCTGCCCGGGCGGCCGGCCGCC 

CGGNCAGG TCTT NGACCCACGCGTNCGGGCNATTATCAAAMCACTTGGAAAAAGATTTT 

TATTCCTACTTTTAAACATACATCAAAATCTAAAATAAAACTA 

Sequence 2379 

CCGGGCAGGTCTTCGACCCACGCGTCCGATTGAAGCCTCTCTGAAGTTAAACCCAACTAT 

GTTTATTAAAATGTGTGAAACTGAAAGTGGGCTAGGTTCTACCAAGGCTGTGGAACTCTC 

CTACGAGTTCTGCTGATCAGGAMTTTMGAATTTATCTTAAAAATGCAAGGAAAAAAGA 

CTGCCTTGGCAATTGTGAATGCTGCTTTCAATCTCCTAGCACCGAGCCTGGCACTTAGGC 

AGCTTTCAGTAAGTGGGTGAATGAATGACTGAATGAATGAATGAATGGCTCAGCTGAGGA 

ATGTAACTTTGGTCAAGACCT 

Sequence 2380 

CCCCGGCGGGTTGGGCCGGGCCCCGCCCCCGGGGCCCGGGTTACCATTAATTATTACCAG 
GAGGGTTNTTAATTCTGGTTAACCATTCAATTAATTGNTNAAAAGGTGGGGGGNAANGGG 
CCAAAGGGTGGGGAA 
Sequence 2381 

TGGGA GCTCCA CCGCGGGTGGGGNCCCGGGGGGNCCCCNCCCGGGNNNNAAANAAANGGG 
GGGNNI I I I I I NNNNNAAAAACCCCCTTTTTTNNNNNNCCCNN I I I I I I I I NGGGAGTTG 
GGGCCC CCTTG GAA MGTTT GGGAACCCCCCCCCMNTTTTTCCCTTTAATTGGGAAAAT 
TT ITGGG m i t I AAM TT TTTAA AAAGGGGGTTNGGGCCCCCCAAAAAAAAAAAAAAAAA 
AATTTTTAAAATTTTAAAATTTTAAAAAAAGGGGCCTTTTTGGGGGN I ITTTTTTTTTT1 
TTTGGGGAAAAAAAA 
Sequence 2382 

A AACT TTTTATTAAMTGCTTANGANACAGATTGAC 

CTTTTCCTGAAGNAGG ACATA TATGCACTCTGATAAAACTGCATTACAGCCTGCAGGACA 

CCTT GGGCC AGCTTGGTTTTACTCTAGATTTCACTGGCGTCCCACCCCACTTCTTCCACC 

CCACTTTTTCCTTCACCMCATGCAAGTTCTTTCCTTCCCTGCCAGCCAGATAGATAGAC 

ACGGACGCGTGGGTCNAAGCTTGTACCTGCCCGGGCGGNCGCTCTAGAACTAGAGGATCC 
C 

Sequence 2383 

AGGTGCGGCCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCGCACAAACATTT 
TTTCAATGTAGCAAAATCAAACTTAAAAAAAAAAAAAGAAGAAAAGAAGAAGATGCCGAC 
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AGTGC GGAGT CTAGCCTTTTGTAAACTTCATATTGCACACTAGGACTATAAGCCATTGCT 
AGCTCATTTTGMTTTTM CGTGTM TTTTTGTTTTATTTTCTTTCTGTGGGGAAACAAT 
GCTTGATCCACCMTGCTCTTTTTAATGTTTTATAACTATGTATGTGTATATATATAAAT 
ATCAAATAATATGTATGCACATATGTGTGTGTATATATCTATATGTATATACATA 
Sequence 2384 

TGGAGCTCCACCGCGGTGGCCCCGGGCAGGTCGAGCGACCGCA C I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 

TTTACCTGAAMTGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAATTAT 

TTATTAGACTTAATTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATG 

AGGAAAGCTGAGAAGTCCAATGCCATTCTGATTCTTGCAACTTACAAGTAGTCI 1 1 1 1 1 1 

GTCT AGACG CTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTG 

ATATCTTTTGGGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTGCAGG 

Sequence 2385 

AGGTACTCATAGCATTTTTCCCCACATAGTCTrTCAAMTCTGCATTTATTTCAAAATCT 

GACCTTCATAACTCAACTATACATGAATTGCTGGTATTGTCTTTTAACTTGGCCAMGAA 

CAGTTTTCTGAGTTAGCTATTATTTCCACCATAAAATTGGGGGTAAGATTTGGCAAAAAA 

AAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 

Sequence 2386 

AGGTACTACCCATTTTAATTACACAGTAAACAGAAGCACGGGTAAGTGACATACTCATAC 
TTTAAGCAATAAGAATTAGAAGAAACCATAGAAGCTTGGGGCCTTCTCTCTAGCTCTAAC 
CCAAAG AAA ATG AATTTTA I 1 1 1 1 1 1 1 1 II I AAAAGAAAACAGCATCAATCACTTAAGAT 
TTTCTTCCTCI I I I 1 1 1 1 1 I I 1 1 1 1 1 1 ACACTTGCTTATTAGTATAGNATCTCGTTCCAA 
AGCCCGGACGCGTGGGTCGAAGCTTGTACACCTGCCCG 
Sequence 2387 

AATTGGA GCCTCCA CCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTCTTATnTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCAAATTCTCC 

AAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAAAAAAAAAA 

AAGTGCGGGCCGACCT 

Sequence 2388 

CGGCCGAGGTCTTCGACCCACGCGTCCGAAATGACTGGTTATTTAGAAAAGAAGGATGTT 

TAGAATAAAACAGGAAGTCCAAACATGTCATAAGTGGTTTGTGTATGTCATAATAAGGGA 

TTATAAAAAGAGGATTTATGTGAAAAAAATTTTATGTGATCAAGTTGTCTACAATTACAA 

GGAMTTATTTATAATAGACGTTCTAGAGATCTATATTAAAAAAAAAAAAAAAAAAAAAA 

CCTGCCCGGG 

Sequence 2389 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 

ACGCGT CCGTC TGAACCTACACCCCGTCTCTTCACGGTTTAGACTTACTAAAATAAATAC 

MGGTGATTTTCATCTTCAGGTAGAGTGAAGCCTTTTAATTAAGGCGTCACAGGTGCAGC 

TATTCTACCTTAATGAMTGGGTAGTGATTTTCCCACCATTATTTATTTCGGTGATAATA 

TGCTGCATATTCAAGTCTCTTGTAGTTATTTTCACCCAAAGTAGTTGACAATTTGATGCT 

TCTGGTGATGTTTATGGCTTCATTTTATGTMTTTTTTAAGTAAGTTCCACTAGAAACAG 

TTCATCTTATACCTTCAAAA 

Sequence 2390 

AGGTACTATMGMCACATTAATTCMTGGAAATACACTTTGCTAATATTTTAATGGTAT 

AGATCTGCTMTGAATTCTCTTAAAMCATACTGTATrCTGTTGCTGTGTGTTTCATTTT 

MATTGAGCATTAA GGGAA TGCAGCATTTAAATCAGAACTCTGCCAATGCTTTTATCTAG 

A GGCGTGTTTGCCA TTTTTGTCTTATATGAAATTTCTGNCCCAAGAAAGGCAGGATTACA 

TCI 1 1 1 1 I I I 1 1 I I IAGCAGTTTGAGTTGGNGTAGGGGTATTCTTGGGTTATCAGAATAC 

TCATATAGCTTTGGGATTTTTGA 

Sequence 2391 

GTTCGTATTTCTTATTCTACAACAAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCA 
I GTAMTAMGTTGTATTGGMCATGGCCACTCTCATTTGNTTTCTATTATTTA 
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TGGCTGCTTTCACTTACAA CCTGA GTGGNTGCCACAGAAACTGTATGGGCCTGCAAAGTC 
TAAMTATTTACTATGTAGCTTTTTCTTTTCTTTTTGGAGACAGTNTGCCACTCT 
CCAGGCTGGGAGTGCGGTGGTGTGATCATGGGCTCATTGCAGCCTCAAAACTCCTGGGCT 
NAAGCAATCCTCCCGCCTCGGTCTCCCAAGTAGTTGGGACTACAGGCATGA 
Sequence 2392 

CCGGGCAG GTACCCAGTAATCACATAAATTCTGCMTCATCTGTTTATTTAGCTTAACTG 
I I I 1 1 1 1 I I ATTTGNTGAAGNTGTTCGTTGTTATTTCAGNCTTTTTCTTATTGGG 
Sequence 2393 

ACCGCGGTGGCGGNCGCCCGGNCAGGTCTAGCTTAGTCGACCCACGCGTCCGGGCTTAAC 
TMTATTTGN^GNGTGCTACTAACAGGATTATAATAAATTTGTCATCAGTGAAAAAAAA 
AAAAAAAAAAAAAAGTGCGGNCCGNTCTANNAACTAGTGGGAT 
Sequence 2394 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGATTTCAGGTTGACTTTTCTCACCTTTAA 
CCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCATCTGGGTTATCTGTAG 
CTTGAGTTGTAAAAAAAAAAAAAAAAAGT 
Sequence 2395 

AGGTACTGTTGTCTCATGTMTGCTAAAACTGAMTGGTCCGTGTTTGCATTGTTAAAAA 

TGATGTGTGAAATAGAATGAGTGCTATGGTGTTGAAAACTGCAGTGTCCGTTATGAGTGC 

CA AAMT CTGTCTTGMGGCAGCTACACTTTGMGTGGTCTTTGMTACTTTTAATAAAT 

TTATTTTGATAAATAATATTGAAAAAAAAAAAAAAAAAAGTGCGGCCTCGAGCGGCCGCC 

CGGGCAGGT ACAGGC ACCTATAGMTTTAAAGGGGAGATTTCTTTATTTTGTATTCAATG 

TATTMTAAGATTTTTAAAACATATTTTGGAGAAATTGCTAAT 

Sequence 2396 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 
ACTGAGACTCAGMTGTTTCMTTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 
TGGAAACTGAAAAAAAAAANAAAAAAANGT 
Sequence 2397 

AGGTCAAGCTTCGACCCACGCGTNCGGAAGTNTTCATTCTCCCT C1 iTTTTTTTTTTTTi 

TTAGCAATTCMGNCATGTTTTTGNCTACAAGTTTTTCCAGTATTGTATAGATAAATAAT 

AATTTACNAGGCTGCCTTTGAGTATACTTAGACAAGAGACCTGCCCGGGCGGCCGNTCTA 

GNACTNGGTGGANCCCCCGGGCTGCCAGGNATTTCAATATNAAGNCTTATTNGTTACCGN 

GCGACCTACGAGGGGGGGG 

Sequence 2398 

CGGCCGCCCGGGCAGGTATCAAGTGCTTGGATTCTGAACTGNCAAAAGAAAACTGCACTT 
GCCCTCTGAAGTAAAAACCGAAATGAGNTTTCTTAGGCAAATGTATTCATCAGCCCAGAT 
AAAAAAAAAAACCANNTAATGNGAGCCNTTAGTCACTGCT 
Sequence 2399 

AGGTACAAGCTTCGACCCACGCGTCCGATACGACTCACTATAGGGATCTACCTGCTTGAG 

TCGACCCACGCGTCCGAACACATACAAAAGAATTAAACCCACAAGCTGCCTCTGACAGCA 

GCCTGTGAGGGAGTGCAGAACACCTGGCCGGGTCACCCTGTGACCCTCTCACTTTGGTTG 

GAACTTTAGGGGGTGGGAGGGGGCGTTGGATTTAAAAATGCCAAAACTTACCTATAAATT 

AAGAAGAG'I 1111 ATTACA 

Sequence 2400 

AGG TGGCCG CAC I II 11 I I I I I I I I II I 11 I I I AAAGTTTGGGGTCTGTCAGGAGACAGA 
GGCi i i i i iGAATTCACTGTGAAGAGAAGAACCCGAACCTTAAGACGGCAGATCCCTGAG 
AGTCTTTCTGGCTGGTTTGAGCGGACGCGTGGGTCGACACCTGCCCG 
Sequence 2401 

AGGTACTTCAA AGTTAT TTGCACATACACTTGTTTACTTTGNATGTTTTGCAGGATTAM 
CTTTGTATAATCTTTTTGCAAAAI I I I I I I I I CAGTATGCAANGCTTGCAAGATGAAAAT 
TAAAACC 
Sequence 2402 
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ACCGCGGTGGCGGCC GAGGT ACAAGCTTCNNNCNACTCGTCCGAGCTTGAGTCGACCCAC 
GCGTCCGCGTT1S1ATGATTTTTAAMCACACTTGAMTAAAMTGATTGMCTAMTTTTG 
GTCCGGNGACATCATTNTGCACTGCATAGCCCA 
Sequence 2403 

CCGGGCAGGTACATCTCTATCAGAGCTTTTGGGTGACCAGGCACACTGTCAATGAGCAGT 
AATA CTTTGAAAG GAATCTTTGGG G I I 1 1 1 1 1 1 GGTTTGGTTTTTGTTTTGTTGTTATTT 
TTTTGTTNGTTTGATTTTTGTTnTGNTTTTTTA^ 

MTACTCAGTAMCCATGCTGTTAACAGATAAGCTGNCATCCAGACTTTGNTGTTCCATT 
TCTAGAGCACAGAGCAGATTTAGCACMTTCTTAANGCTTTATGATtTTTCANAATGGTA 
AANTGAGCACTGGCTTCAA 
Sequence 2404 

Nm i 1 1 iGGGGGGGGGAAMCCCCNNACCCCCNCCANANAGNCANGNMNMGGGNTTTT 

TNACANNNNAGGGGGGGGCANCCCCCCAAAAAAACNNNCNANGCANGAAGNANANNNAAA 

CAAAACANGNANGNAAGNNNNCACGNGCTTTTTTTAAAANATTTTTTTNNNNGGGGGGGG 

GGCCCCCCCAAAAAAAANGAGGACGNNGGAGCCCCCCC 

Sequence 2405 

CTNCTATAG GGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGG ) 1 1 I 1 11 1 1 
I II I I I 1 1 I CTTTCTGTTCCTTGGACTAG ATAATCTG AAATCAACTGTCTTCAGTTTTGC 
AGACTC TTGTG CCAGCTAAAATGTTCTGTTGAGCCCCAGMGCTAATTTTCTTTTCAGTT 
ATTATGATTTTCAGCTTTAGMTTTATTTTTTTM 

TCTCCATTGGGNGAGACATTCACATACTTTCTTCCAG I I I I I I I I AGACGTAGTTTCCTT 
GAGTNCTTNGAGCATATTTAAAATAGTAGATTTAMGGATTTGGCTAGTTACTCCACCNT 
CTGAGNTTCCTCAGGGAAA 
Sequence 2406 

CCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTTTTGCTCTATCCCATAGGAGTTGGT 
ATAGTTGTGTTTCCAATATCATTTACTAAMGMNANTNTCCTTTTTATTTCTTCATTGA 
CCAACT 
Sequence 2407 

AC I I I I I I I I I I I I I I I I I I I I AGTCTTTCTACTTTN AGGGTTTATTN ACTGGCAGTTTT 

C AGGT CTGNAAATACAAAAAGGAACATAGAGATNGGGTNTCTGCCNTTGAGGNGCTCCNA 

GTTTTGGNGAGGGACACMCACNTTTCAGGGGACAACCGTGGTACCTNGGCCGGTNTAAA 

ACTAGTnnnnnnnnnnnnnnnnnnnnnnnnTCNATATCAAGCTTATCCGATACCGTTCTA 

CCTNTGAGGGGGGGCCCGGCACCCA 

Sequence 2408 

AGGTCAAGCTTCGACCCACGCGTCCGGCCTGGGACAGATCCTTAGTCTTTCCTTGACTTT 

TATAGACCCAGAGGTGAMGGCCAAATGTTTTGTAGMTGTCCTTAMCTTGGGTTTATC 

TGAGGTTTCCTTGTGATTGMTTCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTG 

ATGCTGTGTTCTMTTGCATCTTATCAGGTGACTTATGATTTCTGTTTGNCCCATTATTG 

ATGCTGNTACTTTAGATCACTTGATTAAGGNGGTGTCTCGCCTGGCTTCTCCAGTGTGAA 

ATTTCCI I I I ICTTCT 

Sequence 2409 

AGGTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTCTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTMTTGGGAATMGATGGMMTATGMTGCTAMTCAAATTTTTTAAAAAAATA 

CACCCGGACGCGTGACCTGCCCGGGCGGCCGGCCGCA CI 1 I 1 1 1 1 1 1 I I 1 1 1 I 1 1 ICAAT 

ATTATTTATCAAMTAMTTTATTAAMGTATTCAAAGACCACTTTCAAAGTGTAGCCTG 

CCTTCAAGACAGATTTTTGGCACTC 

Sequence 2410 

ACI I I 1 1 I I I | | | | | | | | I I I I I I I I I I I I I I NGGNGGGGAGGAAGGCAAGGCACTGGGA 

AAGACTNACAACANATATTGACCCTAGTCGTAAAGAAATCCATAATTGCCAGTAACACGA 

CNTATTTMGMCAGGAAAAGACNGACMGGAAAAGAGGGACTTTTTTTAAAAMCATTA 
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CTAAAGAATTNGGACATAANAAGAGTGAAATTGACAAGGAAAGGAGGAGGGGGA 
Sequence 241 1 

GAGCTCCACCGCGGTGGCTGGCCGCCCGGGCAGGTACATAATATACAGAGGTATAATCTG 

TAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGTAGAATGCTTGTATG 

TGACTAAAATTATGTTGGTATCAGTTTAAMTATATTATTATMCTTTAGAATGCTATAC 

CCATTCCCACAGTAATTCCCATAGNAACCAAAAAGAAAATATCTTGTNGNATACACACAA 

AAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTATCAAAATAG 

AAAACAATNATGGNGAAAAATAAAGACA 

Sequence 2412 

CCGGGC AGGTG CCGCAC I I I I I I I I I I 1 1 I I I I I < I I I I 1 1 1 I I 1 1 ATGGAAAAAATATT 
GTGATTATTTTTMTAGA TTTAC GGTATAAMGAAAMCTTTTATGATNCMTTTCGACA 
GACTACTTTCATAAAAMTTTTACTNTACATACMTGTATTGCAAATTTTTNGGCAAC 
Sequence 241 3 

NGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGATTTT 

GTGAGMTGATTGTTCTTTCACTTGGGCTGTTTGAGAGCATAATTATGGTAATCATGAGA 

TTMTGT TTCA TGATTTCTACCTCCAAAGTGTGMGACAAGTNAMCAATGNTTCTAAAT 

TGTCTTATTTTGTTGGCGGAGMGATTACMTGGGCTATTAGTGCTACATTTGGTCAAAT 

GTAATCACTTAAATAGCTTCTTGTCACCTTAACTAAAGCAGAATAAAAAAACCCTGCCCC 

GGGGCGGCCGGNCCCGCCCCGGGCAGGTACCATTCCCGACGTTTGCAATGGTGGGAGTTG 

GCAGGTGTG 

Sequence 2414 

CTATAGGGCGAATTG GAGC TCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 

TCCGCTTCTCTTAGAAT TTTGG GGAMTTGATAGTCCAGTGACTCCTACCCACTTTTGGG 

TGAAGGACGATTTGGAATTTTGAAGTGTGGGGAGACAGGCCTGTGAAGTCCGAANGACTC 

ACTTGGGGT 

Sequence 2415 

CCGGGCAGGTCMGCTTCGACCCACGCGTCCGAAAAAAAAAAAAAAAAGTTTGTTGGAGA 

CCGACTGGGGGTGAGGGGCTGGCAGCAGGAGACAGATAACANGTTCNCTCAGAATGCAGA 

GT 

Sequence 2416 

ANGI I I I I I I I I I I I I I I ICANTGCTTCCCAAAGCTGCGGACATAAGGGTAGCTGANCTG 

GACTCTGNCCTTGCTGAAGACTTGGAGATGTCTGAAGTCATAACTGGGGNGACCTCCTTG 

GTCCAAAATGTGCTGCCCCCTATGAATTCCACATCAGGAGTTGTAGGAGATGAGGTTAGA 

GGGATGCTTGGGCTCATCTGGCTTCCTTCAGCATAGCCCGGACGCGTGGGTCGAAGCTTG 

ACCTGCCCGGGCGGCCGGCCGCAC I U I I I I I I I I I I I I I I I 1 1 I I 

Sequence 2417 

CCGGGCAGGTTTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGT 

GGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGG 

TGGATTTCTTATGATTGGCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAG 

TGATTTCTTGTGAGATCCGGTTGTTTAAATCCAGAGGCACCTNCCCCTACCCTCTAGCTC 

CCATTCCTGCCATGTMGACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTT 

CTGAGGCCTCCCCAGAAGCTGATGCCAGCCCTATGCTTNCT 

Sequence 2418 

CCGGGCAGGTCTTCGACCCACGCGTCCGCACATTTTGATGGTCAGTCAATAACTTAAGCA 

GNTACCAAAATACTAGGTATCCAAGGAGCGAGAGGTGGGCCGAGCATAAGAAACACATTT 

CTNATGGCACAGCTCTGCCAAAGCCCTGCAGMTCATTTACACATAGGTCTTTGGTTAGT 

AGCCCCTGGCACAGAANTCTGATCTTAAACAAATATTGTCTATAATCAAGTAGAGCAATG 

CAAT TAAAAAMAMAMGCACAGGNTTTTGGGGCCATNGCTGAAATCCCAGCCTTGCTA 

TTTTGCTTGGCTGNGTGACCGGGGTTTCCT 

Sequence 2419 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 



• 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

397/467 



CAGTGCCTGCCACATACTAAGTGTTGGATAAGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAAGTTGCGGCNCGGCCGCCCGGNCAGGTACTGGCGTGGATTCTGCATANTGGNGATCAC 

ACGTTCCACCTCATCCTCAGTGAGTTCTCCCGCCCTCTTGGAGAGGTCAATGTATGCTTT 

CNCTCAACACCACATGAGCATATCTTCGGCCCACACCCTTTAATGGGCAGTGATGGCAAA 

GGCTATTTTCCGCCG 

Sequence 2420 

AGGTATTCAACAAGGGCCCTGAGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAG 

CANAGACAGGANTCAGCACGTGTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACC 

TGGCGATGAGGAGGCACGGCAGGGAGGTGGAACAGGCAGGAGAGACTCTTCANGAATTGA 

GGAGATAGAATAGAGGACACTAAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGAN 

CATCGCTTGAGGCTAGGAGTTTAAAAGCAAGCCTGGGCAACATATCCGAGACCCCCATCT 

CTAAACACAAAAAATAAAAAA 

Sequence 2421 

CCGGGCAGGTACCCAGTMTCACATAMTTCTGCMTCATCTGTTTATTTAGCTTAACTG 
I I I 1 1 1 1 I ATTNGTTGAAGNTGTTGTTGTNATTTCAGTCTTTTTTCTTATTGGGGTTGAC 
CAGACTTGGGTAAMTCTGTAAGAAAAGTNCCATMTTATGGGGGGAAGATTTCCTCTTG 
AATTGGCTTAAATNCCTGTTAGCTGAAAAAAAAAAAAAAAAAA 
Sequence 2422 

CCGGGCAGGTCT TNGAC CCACGCGTCCGAGCAAAATTCAACTAAAAATACAATCTGGATT 

CCATAGCCMGGGT7TTATTTACMTNTCCTAGTAGGMGTCTTTATTTTAGCTTTCAAT 

GTGTTGAACTTATAAGGAMTTTAACGTATACATGAGTATTATATTTATGGAATGTGAAG 

ATATACAGAATGGAAATGGAAAATAATGTTAATTCGTATTGACTTTGAGGAATCTTANAA 

TCATGTAGCCCTGTTGCAACAAGAAATAGGGAACTTCTGAA 

Sequence 2423 

AGGTCAAGCTTCGACCCACGCGTCCGGTTTG I I ITTTTCl I ACGGCAACTCAAAGCAAAG 

AGCTGGAGGAGCCAGCCATTATAATTGCTTACTCTCATCGCTTAGCGCCCCAGGTGGGAT 

GTGTTTCCAAAACACATTTTTGTAmATMGGAMTGTAGTTAGGATTM 

CCTMTTAGMCTCACATTTTGGTTAMTCCTCAATTTCATTAAAAAAAAAAAAAAAAAA 

AAGTGCGGCCGGCCGCTCGACCTCGGC 

Sequence 2424 

AGGTGCTTCGACCCACGCGTCCGACTTAATTGAGAAGGTGGAATCCTCCTATCCCTGAAC 

TCGGGGGAATGGAATCTCGCTGATCTTCCAGGACTAGCTCCCTGATCATTCCAGCCCCTC 

TGAACAACAGGGCCCCTGGAGATAGMGTAGTCCTATTTCACCCCAACTACAACATTAAT 

GGGAAAAMGAAGCAGGMTTCCTGMTTTTATGACTATGACGTTGCCCTGATCAAGCTC 

AAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTCCCCTGCACCGAGGGA 

ACAACTCGAGCTTTGAGGCTTCCTCCAACTACCACTTGCCAGCA 

Sequence 2425 

AGGTACAACTTCTCTTTTTTGGAGTTTTACTTGCTTCTATCMGAMGACMTTTTCCTG 

TTT CCATGACGTT GGAGTTTGGCTCACTTCCAACAGGGAAAAGGAGTG I I H I » I I I GTT 

TTGTTTTGTTTTTCCTGCTTCTTGAATGGTAGAGAGCAGTCTATAGCCAGAGACTCGTCC 

CTAGGTAAGTAACTGMTTGGGGTTTGTCTTGGTTAAAGTTAAGATTAACGACCAACTGG 

TCTTAATTTCTCCTTACCATTAGAGCACTCAGTTATCATATAAATTGCGCCATTGTTTGT 

TTTGCCTAA 

Sequence 2426 

AGGTCAACCGCCAGGGTCAAACGGAACACAACCCACTCTCAGGAAGACATCCCTAACACA 

AATCCAGGGACTTTGTTTCTTMCCTTAAMTTGGAMCACTTCTTGCTACCAGGGATGG 

GGGGTGGGGCTCAGCAGTTTGGGGAAACGGAGTGGGAGTCTTTTGCTGAACCGGACGCGT 

GGGTCGAAGCTGGACCTGCCCGGGCGGCCGGCCGCAC i I I H I I I I I I I I I I I I I I I GAG 

AGTGGCTATTTCATT AANA TTTAATAG I MINI I GGACTAAGTAGTGGAAAAACTTTTA 

TACTTAACTGAGACATTTTGTCAAGGCTAAAAAAAAGT 

Sequence 2427 
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AGGTACATCCCA CTATT TCTTTTCTTTTAGCTAGAAAGGTATAACGTTAAAACCCTTTTA 
CCAAATAAAATGATTTTATTTAGAAAATGCCGGGCACTAAAAAAAAAAAAAAAAAAAAAA 
AAGTAC 
Sequence 2428 

CCGCGGTGGCGK3CCGAGGTACCTATTTTGTATATGTGAGATGTTTAAATAAATTGTGAAA 
AAAATGAAATAAAGCATGTTTGGTTTTCCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGC 
CGCTACCTGCCCGGGCT 
Sequence 2429 

AGGACAATGCTGTAGATAATGCAGCCCATGCAATACACCCAAGAACACTAGAGTCCTACA 

CCCAAGTACAATATGATAAAGCAGCCCTCTGCAAGTGGTGCTGGATACCACTAAGAAGTC 

TACTGCAGCCATGTTGGTTATGATTTTCCATGCAGAAGGGTACAGTTAGTTCATATTTAT 

GTATTGCACATAATCATGCTATTCAGCATTGATGCTATATTGTATTATGTAAATAATAAA 

AGCCATGTACAGAGGGAAAAAAAAAAAAAAAAAAAAA 

Sequence 2430 

CCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAGTTTTTATCTTTGACCAACCG 
AACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAA 
AGTGCGGCCGhJTCTAGAACTAGTTGGATCCCCCGGGCTGGAGGAATTC 
Sequence 2431 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCACGCG 
TCCGAGTCAAGAGAACAGCACATTAGTTCCAGAAGAAAGATGGAAATTCTGAAAACTGAA 
TGTCAAGAAAAGGAGTCAAGAACAATTCACAGTATGAGAAGAAAAATGGAAAAAAAAACT 
TTATTTAAAAAAGAAAAMGTCCAGATTGTAGTTATACTTTTGCTTGTTTTTCAGTTTCC 

Sequence 2432 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTGTTCGTATTTCTTATTCT 

ACMCAAGGGTCAGCCTACAGGCAAMCACATCXJCATTGTCATTTTTTTGTAAATAAAGT 

TGTATTGGAACATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACA 

A CCTGAGTGGTTGCCG CAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAG 

CTTTTTCTTTTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTGGAGTGCGGTGG 

TGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATNCTCCCGCCTTGGT 

CTNCCAAGTAGTTNGGGACTACAGGGCATGAGCCCNCCATACCCCGGNTAATTTT 

Sequence 2433 

CCGCGGTGGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGGAAAATGTmCTTTTTTT 
AATTTAAGGTTT AAAATTCCTTTGCCAAATCAAAAAAAAAAAAAAAAAAAAAAAAACG NT 
NGCNTGCCNGGCCGGCCGGCCGNNCTTA C 1 1 I 1 1 1 1 I I 1 1 1 1 1 
Sequence 2434 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCA 

CGCGTCCGATAAAACACCGCCTTCAGTGTAAAAATTTTACATGGTATCTGAACAACATTT 

ATCCA GAGGTG TATGTGCCAGACCTTAATCCTGTTATATCTGGATACGTGAGTATTTTCC 

TGTTCCI I I I 1 1 AATTAATATA CTCCT TGCCACCAACCTTTATGTGGTTCTAAGAAAATT 

GCTGAA ATACT TTCTTTATTGCTTTTGAGATTTTACATTATMTCATTAACATTACTTTC 

TAAATTATTTTAAAAATATATAAATCACATGGATTAAAAAACTTTTCATAACTTGAAAAT 

TTTCCTTTAGATTAAAAGCGTTGGTCAGCCTCTATGTCTGGGATGTTGGAGAAAACAATC 

AAGGAGGCAAACCATTAATTATGTATACATGTCATGGACTTGGGGGAAACCA 

Sequence 2435 

GGCGAATTGGAGCTCCC CGCG GTGGCGGCCGAGGTCI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I AATG 

GTTMGCTACTCAMTTGTTTTCAGAGCCAGAGTAAAAACCAAAAAAAATACTCATCAAT 

ATCAATTGCCAAATTCAGTCTGAAAACATTTCACACACAGCTTACCCAAGTATAAAGCTG 

CTGGGG GGACTTC TGAMANTTGGCAACATTCATTNGGGGCTTNGAAATGCTTTACAAGG 

GGAAGGNTTTTTTAANGCAGGGCTTACIvrrGGGTTTTCCCCCMGGCCCCTGGNNANGTTT 

NCCNAAAATNAAAGGGGGGGGCCCCC.CCTNGNNGNGGGTTTTTTCCCCNNNCCNCCNNAN 
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CATNANGGGGNGNGCCCCNCNCNNTT^ 
TNTGAAAAMGGGTTTTTTTAAMCC 

TTmAAAAAGGG GGGGGG GNNGNTGGGNNTTmNTGGNGGGAAAANNTNTNACACCCC 
NNTAAAAANGGGGTTTTTTNCCNNA 1 1 I I II I I I I NCNNCCCCAAAANANGGGGGGGGGG 
GGGGGGG 
Sequence 2436 

CTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCATTTTATTTTTAC 
GTTGTTACGATATGGGGAGTAGTGTGATTGAGGTGGAGTAGA7TAGGCGTAGGTAGAAGT 
AGAGGTTAAGGAGGGT 
Sequence 2437 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTtCGACCCACGC 
GTCCG GCCTG GGACAGATCCTTAGTCTTTCCTTGACTTTTATGACCCAGAGGTGAAAGGC 
CAMTGTTTTGTAGMTGTCCTTAAACTTGGGTTTATCTGAGGTTTCCTTGTGATTGAAT 
TCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTGATGCTGTGTTCTAATTGCATCT 
TATCAGGTGACnTATGATTTCTGTTTGTCCCATTATTGATGCTGTTACrr^^ 
ATTAAGGTGGTGTCTGCCTGGCTTCTCCAGTGTGAAATTTCTTTTCTTCTTTGTAATATT 
TTGTGGGGGAAGTAACTTTAAGACTATGTAACTGTTTC A I I id 1 1 I CTTTTCTTTTGA 
Sequence 2438 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCAC 

GCGTCX)GAMGATTCTTGGCTGAGCATGGTGGCTCATGTCTGTCATCTCAGCAATTTGGG 

AGGCAAGGGCAGGAGGATAGTTTGAGCCCATGAGTTTGAGACCAGCCTGGACAACATAGT 

GAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGTGCGGCCGGCCGC 

CCGGGCAGGTACATGACTATATCAGGATTTCAAATTGAGGAAACCATTGACCGCGAGACT 

TNTGGCAATTTANAGCAACT 

Sequence 2439 

CCGCGGTGGCGGCCGAGGTACCTATTAAGCTCATGAACCATAGAGGTATCTCGGTGGCCC 

CTCATTACCATCTGCTGTTCT1TCAGCTGTTTAGCTACATCTTTGGCTGAGGAACCAGAC 

ACTTCAATCCATGTCTTAGAGAAGAATGCACATGACCCCAACATGAAGATGATATAAACA 

ACGACATGGACAG 

Sequence 2440 

TNCTATAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGCCGCA C 1 I 1 1 I I 1 1 I I 
MINIMI GAAMTAGTTTATTATTTTATTGTTTTTTGTTTATACATGTTMGTTTCM 
CTTTCMTAATAAMTTCMTAAATTTGATTCCTTMTCATAAAAACTTGCTTTACA 
Sequence 2441 

CTATAGGGCGNTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 
ACGCGTCCGATAATTTATACTAAATTTAGTAAMTGGACTTCTTATTCAAAGCATCAATA 
ATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAGTGACCT 
Sequence 2442 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGACAAGCTTCGACCCACGCGTCCGCTT 

AATAG C AAAG G ATAATTG AA ATCCC AAACTTAC A AG GTTTTC AAC AA AAGTG AAGTTTG C 

TTAAAGTTMCAGTGTMCATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTC 

TAGTCCAAAGGCATAMGMAGTTTGCTTTAAAAAAAAAAAAAAGGAATGGTTATCTTCA 

AAAAAAAAAAAMGTGGGGGGAGACAGMTTTATGTAAAAAGAGTGTTATATGGTAAATT 

CTTGTCCTGAAATAAACTAACTGGTGTTTAAAGAAAAAAAAAAAAAAAAAANGT 

Sequence 2443 

CTNCTA TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I M I I I I II I II I M I I 
mmi I ACCTGAAM TGCTT ATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAAT 
TATTTATTAGACTTMTTTTCCTCACGAGTT^ 

ATGA GGGAMGCT GAGM GTCCAA TGCCA TTCTGATTTTGGCMCTTACMGTAGTCTTr 
TTTTNNNTAGACCCTTT TCAGG ACC I II M I I II I I CCTAAGTCAGGGGGTTTNCAAAAC 
CTTTCNAAAGGGGNTNTTTTTTNGGGNNNCCTGGGCCGGTTTTTAAAAANTAGGGGGGAN 
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CCCCCCCGGGGNTGGGGGGGAATTTGAT^NAAAGTTTTNTNGATCCCCGCCCCCCNTGG 
GGGGG 

Sequence 2444 

TACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCA C 1 I 1 I I 1 I I 1 1 1 I I 1 1 I I 1 1 

I I t I I 1 1 I GAAATGTTTGMGTTMCTCATTTTATTTCTAGGATTTGGATTTCMCATTT 

TMTTTCTTTGGMTATMGTCACTTTTTGCAAGCTAAAAAATAGAATCAAACTAAGGTG 

ATCTAAGTCCTCTAGGCATCCAGGCTGATCCTTGGAATCATGAGCAGAATGATGACATAC 

TACANGGGGCTAGCMTACCGGNTIsrTAMCTNTTAAATAATANCCCTNCATGGTTTTATT 

AGGGAACCAGCCAAA AGTCCC GNCCCTTTTAAAACTNGGGGGANCCCCCGGGCTTNNNGG 

GANTTC GATATT NAACTTTTTTNGAAACCCGCCCNCCNCNNGGGGGGGGGGCCCCGGGCC 

CCNACNTTTTTTGTTNCTTTTTAANGGGGGGGNNNNAAATTCGCCCCNCTGGGGGGGAA 

Sequence 2445 

ACTTAGGGCG AATTGGAGCTC CCCGC GGTGGCGGCCGCA CI M I I I I I I 1 1 1 1 1 I I I I II 

I I M I I I I I i i ATTGGTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTGCAATCC 

AGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCA 

CAAAAGATTCTAGGTTNNTGCGAGCAAATGGGGTNGGNGGGGNGCT1TNNTTGGNTAAAN 

AAGGGATGNNNTTGGGGGTTTAAAAAAAACCCNNGGCANNCNTTTTTNGGGTNNNCCCCN 

ANCCCCAANNNGGGGGGNNTNTTTTGTGGGGGGGGTNGNIsrTNTTTTGGGGGGACNCAAAA 

ANGNNNNTNTTGGNCCCCCNAAAAAAAANTTTAANGGCN I II I I I I II I I I 1 1 I NTAAAA 

AAAANNNAAAGTTNNNTTGCCCNANNNCNCCCCTN I 1 1 II I I I GNGGGGGGNGGNGAAAA 

AAAAAMNNNGNNGNCCCCNNNTTmNNNGGGGGGGGCCCNAANTTTTT^ 

AAAAAAATCTCCCCCC 

Sequence 2446 

CCGCGG TGGCGGC CGCCCG GGCAGGACACAGGAGGCCTTATTACTTTTAAATTATACAAC 
A I I I I I I GCTTAAAATTTTTTTAATAAMTTTTTT 

TTTAMCATGTTTTAACTTTTTGACTTATTACAAACATTCTm 

AATTT ATTTC AGGACMGMTTTATCATATAACTCTTTTTATATAAATTCTGCCTCTCCC 

CTTTAi I l l l GAAGATMCCATTGTTTTTTTAAAGCAAACTTTCTTTATGTGTTT^ 

AGACTGTCTAA GGCCA CMGATTAGMGTTACCATMTACATGTTACACTGTTAACTTTT 

AGCAMCTTCACTTTTGTTGAAAACCTTATMGGTTTGGGGATTTCAATTATCCTTTGCT 

ATTAATAAGACCTTGTTCAGTCTAAATTAACTTAAA 

Sequence 2447 

NCACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGAGAATTACCCGAAAAACAACCACACTGCTTCAATCCTGGACAGGATGCAGG 

CAGATTTTAAGTGCTGTGGGGCTGCTAACTACACAGATTGGGAGAAAATCCCTTCCATGT 

CGAAGAACCGAGTCCCCGACTCCTGCTGCATTAATGTTACTGTGGGCTGTGGGATTAATT 

TCAACGAGAAGGCGATCCATMGGAGGGCTGTGTGGAGAANATTGGGGGCTTGGCTGAGG 

AAMATGTGCTGGTTGGTAGCTGCMCANCCCTTGGMTTGCTTTTGTCGAGGTTTTGGG 

AAATTGNCTTTGCCCTGCTTGCCTCGTGAAGAGTTTTAANAAGTNNCTTCCNNAGGNTNA 

NNTMGGGTNTCTTNGGTCTTTNTNANNNCCTTCNTTANTTTGGGGGGNG 

Sequence 2448 

ACTTAGGGCGATTGGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACTTTTTT 
m i i m i i i i I I I I I I I II I AGMTTTAGCCACTTCTCAGGCCTTNTTCCCCATAATTTG 
GMCTTTCCTTTGGATTTGATCMGTTGGATAGAGTTGATCAAACCCTGATCAAGTTGGA 
TAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACAATAAAAAACAGACAACAAC 
AACAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATATGATAACTGAGTGCTCT 

MTGGTAAGGAGAAATTAAGACCAGTTGGTCGTTMTNTTMCTTTAACCAAGACAAACC 
CCAA 

Sequence 2449 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTACAAGCTTCGACCCACGC 
GTCCGCGAAAGCGAAGAAGGAAGCTCCTGCCCCTCCTAAAGCTGAAGCCAAAGCGAAGGC 
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TTTAAAGGCCAAGAAGGCAGTGTTGAAAGGTGTCCACAGCCACAAAAAGAAGAAGATCCG 

CACGTCACCCACCTTCC 

Sequence 2450 

CCGGGCAGGTACTTTTTCTMTATACTTTTCNATTACACATGAAAGCCATGACAGGAACT 

GAGATAAGATTTCmGTTTTTTGAMCATCTTATCTACTMNAAAAT^ 

TTNACTTNAMGCTATTAGTAGTTTTATACTCNCTTMTAAGTATTAATAAATTTACATA 

CTNGACTTAGTMNCTAAGCAATTTGG^MCGThriTTNTTTATTNGAGNGANTT^ 

CANTTGGATATTTTTNCTACCTTACTATTACNTTATAMTATATTTCCCCAAATATATCN 

TTCCTCTTTAAAAANTATGTTTTGNCAACNAACCTTNAAA 

Sequence 2451 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCAC 
ATACACTTGTTTACTTTGTATGTTTTG 

I I I 1 1 1 1 I I CAGTATGCAAGCTTGCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAATGT 

Sequence 2452 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAGCTGATTACACCAACTT 

GAATGAAACGACTTCTCTTGTGAACTATCAAGGGGCCGCCAGAATCACCTCTGCAAGTAT 

TGGGGTCAGCATAGGGACTCACTCCTCCAGTACAAAGGAACCGAGGGGTGACCACCTCTG 

AGATGTCCTTGACTTTGTCATAGCCTGGGGCATATTGAGCATCTCTCTCACAGCTGCCTT 

TCTTATCCCCATTCTTGATGTAGACCGGCCGCCCGGGCAGGTGCACATACACCAAATGTC 

TGAACCTGCGGTTCCTCTCGTACTGAGCAGGATTACCATGGCAACAACACATCATCAGTA 

GGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGCTCACGTT 

Sequence 2453 

CCGGGGTGGCGGCCGCAC I I I I I I 1 1 1 I I 1 1 I I t I 1 1 I AAGAAATAGGGTCTCACTCTGT 

CCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTTCTGACTCCTAGGCTCAAGGGATCC 

TGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGGCAGGAGATCGCTTGAACCGGGAGG 

CGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTTTCCAGCCTGGGTGCCAGAGCAAAA 

CTTTGTCTN AAAAAAAAAATTN I I I I 1 1 1 I AATTAAATAAAAAGGGNCAGGGGATTTTTT 

GGGAAAAGGTCNMAANAAAATTGNNCTTTTTGAAAAAANCCTTGGNAAAAANCCAAAAA 

AAAMTTNGGNGGGAAAAAAATNNTTTNNTTNNGGGANAAAAAANAAAANAAAAAAANNT 

ANGGGGGGGGI I I I I I I I INGGGGNGNGGGGGGGGGGNAAACNCAANCCCCCCCCNNTTT 

ANAAAAAAAAAGGGNGCCCCCCCCCCCNNGNGGNAAANGNGNTNTTNNTA A I I I I i I I I I 

TTTNCCCCCCCCCCCCCCNGGGGGGGGGGGN 

Sequence 2454 

GAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCC 

C GCTTT CAGCCTCCCAMGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTT 

CTTTTTCATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTG 

MCTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTT 

AGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTCAT 

CAATGCC 

Sequence 2455 

CCGCGGTGGCGGCCGAGGTTTCAAGGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

ACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACTCAG 

GAGTCTGAGGAGGAGAATCACTTGMCCTGGAGGCAGAGGTTGCAGTGAGTCGAGGTTGC 

GCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAAACCTA 

CAGCTGTTCMGGACCAGCTGACAGGTCAAGTGTGGCCTTTTCTGGTCTTTGAACACATC 

ATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACATGAACTATAAACGGGTAGACTAA 

CTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGCTGAATTTGTCACATTTATATAT 

TGCAATAT 

Sequence 2456 

AGGTCTTCGACCCACGCGTCCGGTGGCTTATGCCTGTAATCCCAGCACTTTGGGAGGCCG 
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AGGCAGGCGGATCACAAGGTCAAGAGATGGAGACCATCCTGGCCAACGTGGTGAAACCCC 

ATCTCTACTAAAAATACAAAAAGTAGCTGGGCGTGGTGGCACACGCCTGTAGTCCCGGCT 

ACTCGGGAGGCTAAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCG 

GAGACCACGTCGCTGCACTCCAGCCTGGTTGACAGACCGAGACTTCTTTTTCAAAAAAAA 

AAAAAAAAAAAGTGCGACCTGCCTNGGCGGNCGGTTTAAAAAATNGTGGATTNCCCCGGG 

CTGNAGGMTTTTCGATNTTCAAAGCTTTATTNNATTACCGTNCGACCTTTNGGGGGGGG 

GCCCCGGTACCCCAACTTTTTT 

Sequence 2457 

CTATACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGA 

CCCACGCGTGCGCAATnTTAGGCCCACAAGGAGTCAAGCACCTCAAGGAGATCTTCAGT 

TTGAACTTGGTGTAGACACAGGGATACTGATGAATCAATATTCAAATTAGCTGTTACCTA 

CTTAAGAAAGAGAGGAGACCTTGGGGATTTCGAGGAAG 

Sequence 2458 

GCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGAGTCGGGAT 

GCACAACTTCAACCACCGACTTATCAATGCAGCCGCCTGTGTATTGCAATTGGCCGTTAC 

CTTAAGCACTGAGCCACCCGGGTTTAGTTCAGCCATTTCAAGAAGTATATTTAACGTCGG 

TAGTTCTGCTTTATTAAAATGCAGCAGAGGTACCTGCCCGGGCGGCCGGCCGCACI 1 1 1 1 

I 1 1 1 1 1 1 I 1 1 i 1 1 1 1 AATGAAATTGAGGATTTAACCAAAATGTGAGTTCTAATTAGGACA 

ATAAMTTMTCCTMCTACAmCCTTATAAATACAAAAATGTGTTTTGGAAACACATC 

CCACCTGGGGCGCTAAGCGATGAGAGTAAGCAATTATAATGGCTGACTCCTCCAGCTCTT 

TGCTTTGAGTTGCCGTAAGAAAAAA 

Sequence 2459 

CCGCGGTGGCGGCCGCTACCTGCCCGGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACG 

CGTCCGAACTAATTGGCTTTTTAGAAACACCCCACAAAAGCTCAGAAATTGGCTTTAAAA 

AAACAACCACCAAAAAAAATCAATTGGCTAAAAAAAAAAAAAAAAAAAAAAGTGGGCCGC 

TCACCT 

Sequence 2460 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGGACAGCTCGTGTCCACAGGGGTCATGGGGTCTCTTGCTGCTAGGATTCTG 

GAGTTTCATGGTAAGAGCGGGCCACTCCCCACCTATTCAACTCACCCCTTCCCCAGGAGT 

TAATGGGGGCTAGTAAGGAATGCTAGTGCTTGGAAGCCCTGTGCAGGCTTCTGAGATTCC 

TGCCCCCTCAGCCCATGCTCTGCATCCTCCCTTCATCCACCCTCAATGTTTTCCTTCAAA 

AGATCTGCTCAGAGTGTGCCAGTCTTCCCAAATTCCTGGTCTCTCCATGAGAGATGTTCT 

TCCTGGCTGCTTCTAGTTGGCCATCTGGTCTTGAGTCCTNTGTACCTCGGCC 

Sequence 2461 

AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCA CI I 1 1 I H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 
TTTTTTGGGTTTCTCTTTGAMGNTTATTGNTTTCTTTAAAAAAAAAAAAACCTATACCT 
TTTATATTTTACATTCACCTNTCANAATATTTAATGGNACCTGCCCG 
Sequence 2462 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGAT 
TACCATGGCAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGNCTAAA 
CCCAGTAGAAACAAAGT 
Sequence 2463 

ACGA CTNCTATAGGGCGAATTGGAGCTCCGCGCGGTGGCGGCCGAGGTACTTTAAAGTTA 
TTTTAGTCATGAAATTTTATATGCAGAGAGAAAAAGTTACCGAGACAGAAAACAAATCTA 
AGTCGA 
Sequence 2464 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTTTCAAGACCAGCCTGGC 
CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 
TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAG 
AGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTC 



# 
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TGTCTCAAAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGC 
CTTTTCTGGTCTTTGAACACATCATAGAAAGTGNCAAATGCTGCAAAGCCATNAAGAACA 
TGAACTATAAACGGGT 
Sequence 2465 

CCGCGGTGGCGGCCGAGGTACAAGATAGGTATGGATTCCAGNCTGGAGAAACCCNTAAAC 
CACTACACCCTGCCTCANAGNAGGGNAGAATTNTCAGTATGTATGTGGAGACAGGCTCGG 
ACGCGTGGGTCGAAGACCTGCCCGNN 
Sequence 2466 

CCGCGGTGGCGGCCGAGGTCTGATTACAAAAATTTAAAAAATATGCAACTTCAGGCATAA 
AGGGTTAAGGGGCAGATCTTCACAAACTCCACACAAGCCAGCAAGCACAGCACCTGCCCG 

Sequence 2467 

TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I 1 1 I I I I 1 1 1 1 I 1 1 1 I 1 1 I 

TTTTMCCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGNGAGGGTAATAATGGCTT 

GTTGGTTGATTGTAGCGGACGCGTGGGTNGAAGCTTGACCTNGGCCGCTCTAGAACTAGT 

Sequence 2468 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGCTTGTACAGTATGTGAAGAAGTTGCAC 

CTCATGAGCTGATCACTGATCAAGTTAAAAACACCAGACATACATTATACGTCTAATAAA 

TTTCCTTCAGGTCATGACCAGCATGAAAAAATNAGACACAGACACTCAGCAACCATTCTG 

TAAATGTGTACCT 

Sequence 2469 

CTACTTAGGGCGATTGGNNCCTCNCNGNGGGGGCGGCCGAGGGCCTGATCTAACTGCTGC 

CAAGACATGCCAAGGTCACCAGCTTCCCGTCCTGACCAACAGCCACCTCTCAGAAGGCTG 

GCTTAACTACTCANCCCCAGTCTTNTTAGCAGATGGCAAGGAGCTCTGGCCMGATTTTA 

GTCTAAACAGAACCCTAGGCTGCTGATGCAACATCAAGCAC 

Sequence 2470 

GGGATGTGGTTTAAGTGGATTACTGCCTANCTGAGCCAAAATGTTTGCTTGTACCTNnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnACTTIsrrTTCCTGGNTTAAMTGTTATTTCGGTTTAN 

MTTTTTCCCCANNCATACCMNNCCCGGGGGGNTTNAAMTTTNTNAAAANCCGGGGGG 

GG CCTTNATGG GGGNGGNGCNANNCNCCCNNATTAATTANNGNNNGNGGNTCNCNNTNGC 

CCCI I I I I I I I AAAAMNGGGAAAAMCNNTTTTCCNCGNCCNANTTTGGNNTNTATTTN 

AMTT NGGCCCAACCC CCCCGGGGGGNGGGGGGCCNTTTTGGNNTTTTTGGCCCCNNNTT 

CCCCNTTTTNTTTTTTTNTANAAANNANGGGGGGGGGGNGGG 

Sequence 2471 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGCGGCCGGCCGCCCGGN 

CAGGGTCACGCGTCCGGCCTGCTGCTTTGGACATGATGCTGACTGGTAGAAGCATTCGTG 

CAGACAGGGCAAAGAAAATGGGACTGGTTGACCAACTGGTGGAACCCCTGGGACCAGGAC 

TAAMCCTNCAGAGGMCGGAC AATAG MTACCTAGAAGMGTTGCAATTACTTTTGCCA 

MGGACTAGCTGATMGMGATCTTTTCCAAAGAGAAGACAAGGGTTTGGTGGGAAAAAT 

TGNANAGGGGTTNTTGCNNTNACTTTTTTCMTTTGTTTAGGGCMCMGGTTTT^ 

AAAAAAAAAAAAAA 

Sequence 2472 

CTACTTAGGGCGAATTGGANCTCNCCNCCGGGCGGGCGGCCGAGNACTTACACNCNNGCN 

ATCGNTTTCNTGNCNGCAGNNGGATNCACTAGTTNTAGCTTCGGNCGCCACCGNANCCNN 

CNNGAGCATGCATGG MCA CATATACCAMCATCTTTCTGATAACATTAMCATTTTTAA 

MGATGTTAAATGTTCTTTTCATTGTGGTGCTTCAGATTCCTGATTCTAGMCTTGTGTG 

TGTGGAACCTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAAT 

TTTCAAACTAAATGATGGGGGGAAGGAACTA 
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Sequence 2473 

CCGCGGTGGCGGCCGCA C I I I I II I I 1 1 II I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I GGGGTTTCTCTTT 
GAMGNTTATTGNTTTCTTTAAAAAAAAAAAAMCCTATACCTTTTATATTTTACATTCA 
CCTCTCANAATATTTAATGGTACCTGCCCG 
Sequence 2474 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGATCCTCTTCTGCTGTTCTG 

GAAAGAAACAGACCAGTGGTATGGAGTATAAGAAAACTGATGCACCTCAACCGGATGTGA 

AGGAAGAGGAAGAAGAGAAGGAAGAGGAAAAGGACAAGGGAGATGAGGAGGAGGAAGGAG 

MGAGAAACTTGMGAGAATCATGCTGTTGCATTTAGAACTTTCTGCTTTGCACAGGAAA 

GAGTCACACAATTMTCMCATGTATATTTTCTCTATACATAGAGCTCTATTTCTCTACG 

GTmATAAAAGCCTTGGGTTCCAACCAGGCAGTAGATGTGCTTCTGAACCGCANGGAGC 

AAACACTGAAATAAAATAGTTTAT 

Sequence 2475 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCA U 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 INNGTATTTTN 

AGNANAGATGAGGTTTCACCATGTTGACCAGGCTGGTCTCGAACTCCTGACCTCAGGTGA 

T CCACC CACC TCAG CCTCCCAAAATGCTGGGATTACAGCGTGAGCCACCANGCCCGGCCA 

A 1 1 i 1 1 GTAACTTTT ACAMG ATATTTAATTTAAAATTGATTTTAATAAAAGGTAGACAT 

CCAAAAACACAGGATGATGAATGCACTTCAATGTTAGGGGAATATC 

Sequence 2476 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTACTTTAAACAAAAAA 

TGTTTACTTNTAAGGATATACGCACAAAGGGAACATAATATACAGATAAATGAGAAGTTT 

CGATTCTGCATCAAGCATTATT CAATCGG ACGCGTGGGTCGAAGCTTGTAAACCTGCCCG 

GGCGGCCGCTCGAGGCCGCAC N I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 GAGACGGAGTCTCGCTCTGT 

CACCCAGGCTGGAGTGCAGNGGCGCAATCTNGGCTCACTGCAACCTCCACCTCCCAGGTT 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAG 

Sequence 2477 

CTAACTATAGGGCGANTGGANCTCNACCGCGNGGNGGCGGCCGNGGNCAAGGCTTCGACC 

CACGCGTCCGNAAAAATTANCCAGGTGTGGTGGCACACTCCTGTAATCCCAGCTACTCAG 

GAGGCTGAGGCAGGAGAATCGCTTGAATCCAGGAGGNGGAGGTTGCAGTAAGCCGAGAAC 

CTACTGCACTCTGGCCTGGGCGACAGAGCAAGACTGTCTTGGGAAAAAAAAAAAAAAAAA 

AGTGCGGCCGNCCGNCCGGGCAGGTCCTAGCTTGAGTCGACCCACGCGGTCC 

Sequence 2478 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAATTGCTAAAACAGCTCCAGG 

GNAAGNNATCTTATTTAGCATTAGCTCCCTCACMCTNTTTTCATTCATACTTNTATTGG 

CCAGNCTCATACCTGMGTATTTTAAATGAGTTNACAATTATTNCACTTACCNTCAGAAA 

AAAAAAGGAGCAAAAACTCTTAATGACTGGTNACATGCACATTTGGTGTAGGAAATTATT 

ATGNGGTAAAATTTATATATTTCTATTTATTTTTATTAATTTATTNTTNACACATTATTT 

CAC 

Sequence 2479 

ACTATAGGGC G AATNG GAGCTCCCCGCGGTGGCGGCCGCA C I II I I II I 1 1 1 I I I 1 1 I I I 
i i I I I l l l l TTGGTTTTCTCTTTGAAAGTTTATTG ITTTCn I AAAAAAAAAAAAAAACC 
TATACCTTTTATATTTTACATTCACCTNTNAAAATATTTAATGGTACCTGCCCG 
Sequence 2480 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGTGGTGAACACAGAGAAGACAGGTCTTGTATATATTCCTCTGTATTCTGGGGAGC 
TTTGACCTTGGAGCTTTGTACCTGCCCG 
Sequence 2481 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 
CAAGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 
TCMGGCCAGCCAG^ACATACTTTTTGNAAAAGCCAAGAGTGGAANCAGGGTTGGTTTT 
TAATCCAATTTTTGGGC 
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AGGNCGACNTTNTCACAGGCAGNNAAACCGGCCAGNTNAAAACACTATGCTANCTCGCGG 

GGCCANTNTTAGGATGGGTGAGGCAGATGAANCCATTCTCCNANTGGCCAAGGCCGAGGG 

CATCAGCCTCAMGAACNTTTGACNGGAGAGAATCACANACGTGNNNTATTCGTCATAAA 

NAAANAATGAAAAACCNACC 

Sequence 2483 

AGGTATTCGACCCACGCGTCCGTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAAC 

CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 

GCCCGGGCAGGTACCAGTAGCAACATATGAGTATTTCTCTAGATAA CI I 1 1 1 1 1 IGACAA 

GGTCTCACTCTGTTGCCCAGGCTGGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTT 

GACCTTCCCTAGCTCAGCTGAACCTCCCATCTCAGGACACCATTGCCTCCACTGCCCATC 

CTGCATCTGCCTGCCTACCCCAAAAGT 

Sequence 2484 

AGGTACATTTTCACTCCTGCTCTAAAACTTGCCTCAGTCTCTCACTGTGCCTTATGCCCC 

TCAGCTGAATTCTTTCTTCTGAGCAGGCAGGAATTGAGGTTGCTGCAGACGTGTATGCAT 

TTGCCACCAGTAACATACTTGGTGCCACATGACTAGGATATGTTCTCTAGTGCTAACATG 

TTCGTTTACAGTTCTT AGGACTCCCTGATAGAAAAAAACACAAAAAAAAACACAAAAAAA 

CCCAACCA ACCA AA CAAAC AAACAAAAAAAAACACAGGAGTTTCTCCCAAAAAGAAGTCT 

GCAGTGTCTTTTCTGTTTTCTCTGAAGGTATCCCAGGGTGTTAGAT 

Sequence 2485 

AATGTTGCAGGCTACTCTCTGCCGACCAGGCCGCCGCCCGTCCCTCGTGACTACAGCAGG 
TACCTGCCCGGANGNNNNNNANGNAACGNATGTTTCCACCTNCTTCTCCAACCTCTACCC 
CACCATTAGT NGTA TNTTNACTNTNAAAACAGTGGAACCACAGCCCTAAAGACCTGCTNA 
TN N AMGTN CTTTTGTCTTAATTGTATTTAAAAAAAAAAA 
Sequence 2486 

GGGGAAAACCCAGCTCCACCGCGGNGGCGGCCGCCCGGGCTTTTCAAGCNTCGACCNACC 
CGTCCGATA ATTTA TACTAAATTTAGTAAAANGGACTTNiTTATTCAAAGCATCMTAATT 
AAAAGAATTATTTTAAAAAAAAAAAAAAAAAAAGGGACCT 
Sequence 2487 

AGGT AC CCTCACCTTGGTCATCTATCCTGAM TAAGGCTTAGTTAGTATTGGCCTGAATG 
TTTTGTGI I I I 1 1 t I 1 1 G I I I I I 1 1 I I I I I I I I I I ACTGTTACTTTG AAAAATATGTATG 
TATACCTTATCATATCTGCCTATATCACTTACTTTGGGGAGATACTCAGAGCTTTGTGGT 
TATCAGTATACTAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCG 
Sequence 2488 

AGGTACGCGGGGGAGCCTGTCXJAGCTGGCCCGGGCCCTGGCCTGGTTCTCAAGTGTTTCC 

TAGAC AGAGAG GC ACCTG GGTCAGTATTAGTCTATTTATCAGAGGTGTAAATAATCTATG 

TATAAG mil CTCCTTTTAGATTATTTTGTATTTGTTTAAAAGMGTTTTGTCAAAATA 

CMA MTATA MGAMTGACTGAMGTTGTTGACAGGGTTTTTMGAMTMNTTATTCT 

MTTGTTTTTGTTTGGTTTGTTTTTGCCTTGTAAACTAGCGCCAAGGAAC 

Sequence 2489 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATNCAGCTNATAGCAAAC 

TTTGTGGATCCCAATCCGAGATTTNCCTGCTGAAAGACAAGAAGTNTCTNAAATAAAAGN 

GCTGTANCAGNATTTGTATACTCCAGAATAAGNTTCTGTGATTCTTANCTGCCAATGTGT 

TCAAGGCGTGATGACTNGGTNTCTGTTTCTNTGAACATNAATACTAGGGTCTGTATAAAT 

TTCMTGCATGCCACCAGCTNATCMCCCTTTTGGCTTTGATTTTTGNATGNNGNATTNT 

TATTCCCTANGANTTCCGGCCAAGTACCTTTGGNCGCCACCCGTGGTGGGAGCTCCAATT 

TCGCCCTTATMGTGMGTCCGNMATTACGCGCCGCCTCANTTGGNNCGGTNAGTTTTT 

ACAACGCCNGANGACCTGGGGAAAAACCCTTGNCCGTTACCCCAACTT 

Sequence 2490 

GNCGGGCAGGACGCGGGACCAGAATGCAGTTCCAGCTTAGGAAGCCACAAACAAGCCACC 
CAGGAGGAACAAAACACCGNAGCGTGGATTTTTCAAATTTCCCCCGGGAAAGTAAGTCTC 
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GCTTCCTGCCAAANAA 
Sequence 2491 

CG GGCAG GTNCCTTGCTMGTTCAGACCTTCTCTTCCTTTNCTTT(XTTTCCTCTCCTGC 

CCATTTTCCTGTTCTTCTGTCCTTCAATACTTCTGNAGCTTCCCATTCATGTTCTCTTCT 

CCCACGCAGGCCNCATTGTGTGCAAAAACTGNGGNGGGGGCTGTGCTGNACTCNTNCX3TG 

CCTCCTGC CTCCTGC GGCTGTTGGATTTGGGAATGACCTTTGGTGAGAGNCTCACTGCTC 

C AGGGTC TATTTTTTGGTCCAMGGCTAGMCCTATANMGTNGGGMTCACCTNTTTTT 

TCTTTTTCNNGGGTGAAMTAAMTGGGTTTTTTTCAANTTCNAACAAAAANAAAAAA 

Sequence 2492 

AGGTACAGAGAACTGAATTTACACAATAAAGTGTTACCCTATACCAGTGATTCTAAAATT 

TTGG TCTGGGGMCCTTTCATGGGGTTCATGAGGTCAAAACTATTTTCATGATAATGTTA 

TTTTGCCTGTTATATTCTCATTCTCGAGTATACAGTAGAGTTTTCCAGAGGCCACATGAT 

CCATGACATCACAACAAAATGAATAGAGACAGATGAAAATCTAGCTGGCTTCTGCTACAT 

GAATCAGACATTAAGGGGTCCAGAGACCATAAAGTGTGACAACCACCACTCTTCACAACC 

TATATATMTATCTCAAAATAATGTTAATTCTCTATCCTCAAAGTTTATTTCCTTATATC 

TACATTTTCTATGATCAACACACTCACTACAAAA 

Sequence 2493 

CCGGGCAGGTACCACAAAAACAGTTACATGGTAGAGTTCCAATCACACAGAAAGGAATCC 

ATTCAGCAAATTTCCAAAGTTGCAGACTGTTTGGGCCCACCCAGAATTCCACCAGGACTG 

GACGGCCCCCACCAGGCTATACTGGATATGTGGACACAGAGCTTATGCTAAGCTGCCTGA 

TCAGGACAGGTAGCTGTATAATTGGCACCATTAAGCCATCTTTCTTCTTACTGCCCATAA 

AAACAGGTGAACTTCTAGGCTTCCCAGTCTATGCTTCCTGCAAAAAATGAAGCATAGCCA 

TAAGTGATTAGAAAAATAATGAATGGCCCTCTGAAAGAATCATACAATACTATAGACCCG 

CCACTTAGGCACAAGAAGGCTCATGAGGATATC 

Sequence 2494 

AGGTACCTT GGGA TTGCAGGTGCCCACCCTTGCGCCTGGCTMTTTTTGTATTTTTAGTA 

NAGACGGGGTTTTGTCATGTTGTCCAGGCTGGTCTCGAAGTCGAGCTCAGGTGATCCGCC 

CATCTCAGCCTTCCAAAATGCTGGGATTACAGGTGTGTGCCACCATGCCCAGCCAACACA 

CACATTTATTTAATGCAAGTTTTACCTAGCACAGAGAAGCAGTGAGAGTCAGTTACTTAT 

ATATTGAATTGGACCAAGTAACTTGTGAAGAAGCTAGTAAATATATGGAGGCTAAAAAGC 

TGAGTGGTTCTGTTCTAACAAGGTCTGCACAGTAATCTCTTGGCCTCGACTTCTCATCCT 

TAAAAATAAGGAGATCGTCCTATGTTTC 

Sequence 2495 

NNGG GGGGGGGGGGCCCCCCCCCNGGGNNNAAAAAAAAAANA I I 1 1 1 1 1 T T NTTIMTTTTT 
1 1 I I I NAAAAAAAAAAAAAAAANNNGGGGGGGGGGGGGGGNGGGGGNGGGT C I 1 1 1 1 1 I N 
AAAAAAAAAAAAAAAAAAAAAACCCCCCCCCCTNNN I 1 1 1 1 1 1 1 1 1 1 1 II INTNTATNAA 
AANNGGGGGGGGGNNNNNNNNNNNNNNNANATNTNTTTTTNNNNNGGNGGGGGGGGGGGG 

Sequence 2496 

TGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGGGGGCGGAGGTCANGGG 

ACAAGATGGTGCCACCGGTGCAGGTCTCTCCGCTCATCAAGCT 

Sequence 2497 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGTGGAAGGGTTTANGGCAG 

CAGTGT TCTG ATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAG 

GAGGCCTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGC 

CGTGCGGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTAC 

GCCATCTTTAAGGTTCTAAATGAGAAGAAACTTCAAGAGGTTGATAGTTTATGGAAAGAA 

TTTGAAACTCCAGAGAAAGCAAAAAAAAAAAAAAAAAAANANAGTACCTGCCCG 
Sequence 2498 

AGCTCNCCGCGGTGGCGGCCGNGGACNAGGGNCTGANTGTCTGNGTNTCAGAATGGGATN 
AGTGNCCTTATAAATGAGGGAGCTNGNTTGTCCCTNCCACNACATGAGGTTACAGCAAAA 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

407/467 



AGATGGCTGTCTATNAACCAGCAAGTTNGCCTTTGNCANACCCCAGATCTGCANACTACC 

TTGATNTTGGACTTTNCATCCTGCACAMTCTMGANANAMTTACTGNTGTTTATCAAC 

CACTCNGTTNATGGTISnTNTTCGTTATAGCAGCCTGAACTAAGACAACAGGTNGATCTTA 

AGGCATNGCTACNATNAAGTCTTTCCNTGCTCAGAATCTCC 

Sequence 2499 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGAANTGGATGACTT 

TTCTTGTCCATATCACCATGGAAATCTGTGTGCTGGGCATGGAGAGTGNGAAGCANGCAG 

ATGCCAATGCTTCAGTGGCTGGGAAGGTGATCGATGCCAGTGCCCTTCAGCAGCNGCCCA 

GNACTGTGTCAATNCAAAGGGCCAAGTGTGCAGNGGAAGAGGCACTTGTGTGTGTGGAAG 

GTGTGAGTGCACCGATNCCAGGAGCAT 

Sequence 2500 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTATATGTCTGAATGTCTGTGTCTCAGAATG 

GGATTAGTGACCTTATAMTGAGGGAGCTTGTTTGTCCCTTCCACTACATGAGGTTACAG 

CAAAAAGATGGCTGTCTATGAACCAGCAAGTAGGCCTTTGCCAGACCCCAAATCTGCAGA 

CTACCTTGATCTTGGACTTTCCATCCTGCACAMTCTAAGAMTAMTTACTGTTGTTTA 

TCMCCACTCAGTTTATGGTATITTTGTTATAGCAGCCTGAACTAAGACAACAGGTAGAT 

CTTMGGCATAGCTACMTTAAGTCTTTCCATGCTCAGAATCTCCATCTGCTGGCX)AAGC 

ATAGTGGTTCGCACTTGTAATCCTGGCACTTTGGGAGGCCAAGGGCGGGTGGGTCACCTG 

AGGTCAGGAGTTTGAGACCAGCTTGGCTAACATGGCAAAACTCTGTCTCTACTAAAAATA 

CAAAAAATTAGCCGGGTGTGGCGGNGGGTGCCTGTAATCCCAGC 

Sequence 2501 

NTGGGCG AATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCCGGGCCATTNTTCTTTCTTT 
I II 1 1 1 II NGCGGATGGGGACTTGTGMTTNTTCTAAAGGCGCTATTTAACATGGGAG 
Sequence 2502 

AGGG GCGAAATTGGGAGCCTCCACC GCGGTG GGCCGGGCCCGAGGTACTTTTCTTTTTTT 
TTTTGNTGGATGGGGGACTTGGNGAATTTnTCTAAAAGGGGGCTATTTNAACATGGGGA 
AGGANAAGCGTTGTGCCGGTTTCCA 
Sequence 2503 

CCGGGCAATCTAAGAAGACATGATCACTAAATGTGATGTGGGATCCCAGATGGGATCCTG 

GACCAGGTAAAAACTAAAGTAATGTTTCAACTTCAGTAAATAATAATGTATCAATATTGG 

TCCATTAATTGTGGCAAATGTGCCACACTAATGCAAAGCGTTAGTAACAGGGAAAACTGG 

GAGCAGGGTATATGAGMCTTTTTGMCTGTTTTCACAATTCTCCTGTAAATCTAAAACT 

CTTCTGAAMTAGAGTTTATTCTTTAAAAGTGTCTGGAGGATGTGCACAAGGGTGTGGCA 

GCAGAGGGGCTACAGGTAMMATCATGACATCTGGMTATTTCCTTCMTTTTTGCTCC 

ACACGGTGACTATCTTACCCTGCTCCC 

Sequence 2504 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACCCGGGGGGTCTGAGG 

ACCCGAGGTCGTAGGTGGATCTTTTTCACGGAGCAAAGAGCAGGAGGACAGGGGATTGAT 

CTCCCAAGGGAGGTCCCCCCGATCCGAGTCACAGCACCAAATTTCATGCGCATCTGTGTA 

MGAGACCAACMGCAGGCTTTGTGTGAGCAGCMGGCTGTTTATTTCACCTGGGTGCAG 

GCGGGCTGAGTCTGAAAAGAGAGTCAGTGAAGGAAGATGGGGTGGGGCCGTTTTGTAAGA 

TTTGGGTAGGTAAAGGAAAATTACAGTCAAAGGGGGGTTGTTCTCTGGCAGGAGTGGGGG 

TCACMGGTGCTCAGTAGGGGAGCTTTTGAGCCAGGATGAGCCAGGAGAAGGAATTTCAC 

MGATAATGTCATCAGTTMGGCMGMCAGGCCATTTTCATTT 

Sequence 2505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAACAAATGCAGTAGCCAACAAGATTACCATG 

CMTCATTMGGAGMCCAAAGTMGAGAGCCACTCAMCCAGATTTTGAACGCTACTAA 

AATTAAAGTAGTTCTTTGATGMTATGAATGAGTAGGGAAAGGATTCTTTGTAATAGTGA 

TACCTCTGTGGTMGAGMGGGTGGTATGTGAGTTTTAGTCTACAGATTATGGCAAATTC 

AGTGACAACAATCAAATGGTCTAAGATTGACAGTAGCACAGTTTTACTCTGTGAAGGTAA 

TGTTCAGGACAAATTTCAAGAAAACTAGAAAACCATTCTTTACAGCTGAAATCTTTCCCT 
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MCCATTGTTATTTCCACTTTTAAGTCCTCAAGAGATGAGAAAAGGGAGGTAAGGCTTCC 

TTATACATTTCCTGCACAATGAAACATTTTTCCTCCTCCAGGCAAAAGATTCAAGCAGAA 

CTGGCAAATATCTTATCTTGCTCTTCTTAATAATATAATGGTGGTAGGATATAAAGGTCT 

ATACAATTAACCTANAT 

Sequence 2506 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNCNGGCCAGGTACACNTTC 

TTAGACCTCAACCTCGAACTNTCAAAATTNAGGATGTCTCANCCCTACTGAGGCCGGGAG 

TCACCTNNACACTGANGGCCCTNGGTGNGAAGATGMCCTTNCACCGTCTNTANTGCATT 

CTGGAGTGCAAAAATAAAATCCACTNAAGAGTCACAAGGCCCGCTGTGCATAATNGGNTT 
CACTTTTACC 1 1 1 1 1 1 I 

Sequence 2507 

CCGCGGTGGCGGCCGGCCGGGCAGGTACATGTAATGCTCCTGAACTGTATGCTTCGCACG 

GCTGACATGCTMGNTTTGTTCTGTGTATTTTATGACTATTTTTTAAAAAGTAAACAAAA 

AAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAAAGCAGGTGAGTGG 

NTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCANAATGGCCATGGGCTTTGCCT 

TCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGTTCAACACTGCCAG 

TGTAGTTAATGTCACTGMTTATACACTTTAAATGGCTAACATGACCAATTTTATGTTAT 

ATATATTTTACTACCACAAAAAAACTANCTGGCACCTAAAAACATTCCATTGAACAGGCC 

CCTTCAGATCTGTGTCTT TCCTG CATGCAAATTACNCCACAGAGCAAGCACCTATGGCAN 

CGTGGATCACAGGCTCTGTTTTANGATAGANAAAGGACACAAGGNGTCCCCC 
Sequence 2508 

ACTTAGGG CGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTTCTTTACTTTT 

"ii GAGACGGAGNCTCACTCTGTCGCCAGGCTGGCGCGATCTTGGCTCACTGCA 

ACCTCTGCCTCACGGGTTCAAGCGATTTTCCTGCTTCAGCCTCCTGAGTAGCTGGGACTA 

CAGGCACACGCCACCATGCCCAGCTCATTTTTGTATTTTTAGTGGAGATGGGGTTTCACC 

ATGTTGGCCAGGATGGCCTCCATCTNTTGACCTTGTGATCCGCCCGACTCGGCCTNCCAA 

AATGCTGGGATTACAGGCGTGAGCCATCAAGTCTGGCGAGAGAGATTGTTTCTAGATGAG 

GGNGGGGGCCGGNTGTCCTTANCCCAAAGCTTGTGCCAGTCTCTATCAGAAATAAATGCC 

CCCAAAACCAAAAAAAAAAAAAAATAAGGTACCT 

Sequence 2509 

GCCG GGCAG GTACTACNTCAGCAATTTCTCCANNGNGCNGNNCGACAGCATATGGCACCA 

GCCCATTTTCAATTTGCTGAGCATCGGCCAAAGCCTGTATGCGAAAGCCAAGGAGCTGGA 

CAGAGTGAAGGAAATTCAGGAGCAGCTCTTCCATATCAAGAAGCTGTTGAAGACCTGTAG 

GTTTGCTAACAGNGCATTAAAGGAGTTCGAGCAGGTGCCGGGACACTTGACTGATGAGCT 

CCACCTGTTCTCCCTTGAGGACCTGGTCAGGATCAAGAAAGGGCTGCTGGCACCCTTACT 

CAAGGACATTCTGAAAGCTTCCCTTGCACATGTGGCTGGCTGTGAGCTGTGTCAAGGAAA 
GGGCTTTATTTGTG 

Sequence 2510 

CCGGGCAGGTACMTTGNrTTTAGMGATANTTTGTTTTTCTCT 

ACTAMGACAAATCATGGTAGGATTGGlvrrTGTTTATTATACTTGGCCTAACTATTTGTAT 

ACMTGCAGCAAGAATGATTATTTTTTACTTAGGCTTTTAAGTAGGCTCTGATGGAACTT 

TGTTCCATAGCAGGAATCTCAGATAAGACTTTGTAAAACCCGTAAAACTCANCCGAGCCA 

TGGATTTATGCCATTAMTACCCATGAGTTGGGTGAAATTTCCTNTCCTTTNGAGGGCCC 

AAGATAAACCTGGGGCGTCTGCACTTGNCAAAAAGTGATATTCTTTACTTACACAG 
Sequence 251 1 

AGGTACACNGNTNNAATCTTACTTCAACTTTNAANGGGCCCACNNAACCCTCTATATCCC 

GTTGTAAATTTAACTGCTAGTCCACAAGAGGAACAGCTCTTTNTACACTAGGAAAAAACC 

TNGTNGAGAGAGTANAAAATTTAACACCCATANTNNGCCTAAAAGCAGCCACCAATTAAG 

AAAGCGTTCANGCTNA ACAC CCACTACCTAAAANATCCCAAAAAAAANAAAANAAAGNAC 

GCGNGGANGTGTNAAATTTTNANAGMGAATTTNTNTTTAGTTCTTTGCACGAAGGNANA 

GATAAAGACACTTTTTCAAAA 
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Sequence 2512 

CCGGGCAGGTACGCGGGGACTGAAAATNGGACTGTTCAACTCACCTGGCAGCCACTCCCA 

GAGCTCCTGGMCTCTGGCCCMGGTTCTCTGACTGACTCCTTCTTGGCTTACTGGCTGA 

AGACTGACGCTGCCTGATCGCCTCAGAAGCCCCGCAGACCATCATGGACGCCGAGCTTTA 

GGTAACTCACAGTGGAGGCCCGCCTGCACCCAGGTGAAATAAACAGCCTTGTTGCTCACA 

CAAAGCCTGTTTGGTGGTCTCTTCACACAGACGCGCATGAAAGGGAAGACATACAAAAAC 

AAGGCCTCTGAGGTAGGTACCT 

Sequence 2513 

CCGCGGTGGCG GCCGA GGTACTT N I 1 1 I I 1 1 1 1 I I 1 1 I I I GGGTTTCCGGGGTCGTNTTG 
GGGGTCTCAATATmTGGCTCCTCTCCTTTACAGACACCTTGTATTTTCAMG 
TTGGAGT CNAAT TCCTGATCAGMGTTTGMTGGTTGTTACTGCTGTGTTTTTCATGTCA 
ACAATTTCTTTTIsniTGTTNCCTGACMGGTGTCCTC 

GCGCTTTCTTTCCTTTTCTCCTGTATTTTTTGGTATCTTNTTGAACACAANMT 

ACACAAGCTTGMTCAAMTMGCTTGTCAATAGCTTCCTTTCGCATTAGATTTAACTGC 

TC CACGN GATMTACTTMGGMCACTTTTGTTTTTCCAAGAGCCCAGTTATCGAGACCA 

GCTTTTTCCAAAATGGTGGCACAAGTGTCAGGGCTCATGCGGG 

Sequence 2514 

TACTT A GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I 1 1 1 1 I I I 1 1 I I I I 
TTTTTACTTTTTAAACTTCTTGTTAAAMTGMGACAC 

TTACACAGGGTCAGGATTATCMGATGTCACTTAGGCGATTGAAATTTTTCAGCTCCATT 

ACCATCTTATGGGACCACCATCCTATAAGCAGTCTGTCATTGACCTAAACATCATTATTC 

AGCACA AGCGTATTTCAAATTTAGAGTTTTACTTTGATGTTCTT C I Nil 1 1 CTTTTTCT 

TTTTTTGAGACGGAGTCTCACTNTGTCGCCCAGGCTGCAGNGCAGTGGTGCAATCTCGGC 

TCACTGCAACCTCCAGTTTGGGCCACAGAGCGAGACAGCGAGACTCGGTCTCAAAAAAAA 

AAAGAAAAGAAAAGTACCTGCC 

Sequence 251 5 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATAAAC 

TCAAAGGATAGGTTCAACACCATAATAGAGAGGACAGACAAAAAGAATCTGTGAACTTGA 

AAATTGAACAATAGGAATGACCCAATTTGAATAACAGAGATAAATGAACAGAGCCTCAGG 

GAATTGTGGGAGTATAACAAAAGATCTAACACTGATGTCGTCAAAGTCCCAAAGGAGAGA 

GAGGGGATGGGGCTGAAAAAGCACCTGAAGAAGTAATGGCTGGAAGCTCCCCAAATTTGG 

CAAAACACATAAGCCTACAGATTCAAGAAGCTGGGTGAATCCCAAATGAGAGAAATCCCT 

TCAAATTCACACAAAGACACATTATAGTCAAATTTNTGAAAACTNNCANATAAAAAGAAN 

TNGNCAAATNNGGTACCTGCCCG 

Sequence 251 6 

CCGCGGTGGCGGCCGAGGTAC I I 1 1 I 1 1 CI M M I I I I I I 1 1 I I I I I M I 1 1 GAGACGG A 

GTCTCACTCTTGTCACCCAGGCTGGAGTGCAGCGGTGTGATCTTGGCTCACTGCAAGCTC 

CACCTCCCAGGTTCAAGCCGTT CTCC TGCCTCAGCCTCCCAAGTAGCTGGGACTATAGGC 

GCCTGCCACCACGCACAGCTAATTTTTTTGTATTTTTAGTAGANACAGGTTTTCACCA 

TTGGCCAGACTGGTCTCAAAACTTNTAACCTCAGGNGATCCACCCGCCTCAAAGNGCTGG 

GATTACAGGCGTGAGCCACCGCACCCGGCCCAGACTCCTTAAAATGTGANAAGTAGCACT 

GAGGMTGTGATCAGATTATGGCCTTGATTGGCACATGGGGTCGCTTTCCACGGTTGGCC 

TTCTTGTTCTCCACGGCATCTTGTGCATMGCCATTTGCCATTTCAGGAGCTCAGCATGC 

ACATCCCGGGATTTCTGNGCTTGGGT 

Sequence 2517 

ccgcggtggcggccgaggtacctgtgacatcatmttgcaccctccgacatgatatctct 

tccaaatgcttgatgaaaaaggtgggaggatcacttgagccagggagttgaaggctgcag 

tgagccctgtttttgccaccacactccagcttgggattgattcctaaagactcatg™ 

gtgaggaagcagctcagaagaggaaaggaaaggagccaggcatggctcttcctcagggac 

gcctgactttccgggatgtggctatagaattctcattggcagagtggaaatgcctgaacc 

cttcacagagggctttgtacctgcccg 
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Sequence 251 8 

CCGCGGTGGCGGCCGAGGTA C 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GGGGCTGGAAGGGG 

GGCAGAGTCTCACTCTGTTGTCTAGGCTGGAGNGCAGTGACAGAATCACCGCTCACTGCA 

ACCTCTGCCTTCCAGGTTCAAGTGATTCTNGCGCCTTAGCCTCCGGAGTAGCTGGGATTA 

CMGCTAATCCCANCTAGGCGTGCGCCACCACACTCGGCTMTTTTNGTTATTTTTATTA 

TAGTANAGACGGGGTTTCACCATGTTGGCCAGGCTTGTCTCAAACTCCAGACCTCAAGTG 

ATCCACCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGCGAGTATATGCTTTTAAA 

GGGTATCCAATCAAGCTAACTATGGTGATGGAATGTCTCCAGTTCCTCTGTAATACACGT 

ATCGTCCAGCCCGCGTACCTGCCCG 

Sequence 2519 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGAGA 

TTANGGGAGTGGTGATGACTCTTAACAAGCATGCTGCCTTCAAGCATTTGTTTAACAAAG 

CACACCTTGCACAGCCCTTAAGCCATTTAACCCTGAGTTGACACAGCACATGTCTCAGGG 

AGCACAGGGTTGGGGGTAGGGTTACAGATTAACAGCATCTTAAGGCAGAAGAATTTTTCT 

TACAGAACAAA ATGGA GTCTCCTATGTCTACTTCTTTCTACACAGACACAGTAACAATCT 

GATCTCTCTTGCTTTTCCCCA CAAC CTCAGCCTCTCAGAGTGCTGGGATTACAGGCATGA 

GCCACCGCGCCCAGCCTCXJCTTTTAAAGCACTTTCTGAAGTCAAGCCTGATTCAGGATTG 

CAAGCCTGCAG AGAAC TATGGTGGTAAAGCCTAAAAAGATAGAATCCTTCCCACACCTGA 

GAAGGCAGGTATTTTTAGAAGGAAACACCAGAATCACACTTAAGTCACTGCAAAGGCATT 

CATGTTTATACATTTCTGAACTGTCTTACTTGGAACTTTATGNGG 

Sequence 2520 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCATGGTGGCTCACGCCTGTAAT 

CCCTCCACTTTGGGAGGCCAAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACCAGC 

CTGACCAACATGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAGCTAGGCNTGGCGG 

CACATGCCCGTAATCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATCACTCGAACTCGGG 

AGGCANAGGTTGNGGTGAGCTNANATCACAGCATTGCACTNCAGCCTGGCCAACAAGAGT 

GAAACTCCATCTCAAANAAAAAAAAAAAGGAAACATGAAGCCTTCCTTNAATGATGATAG 

TTTCTAAAGTGAATTATTTGAATCTCTTTGCATGTTTTGGCTCTGTTAATCTAACTCTTG 

TCTCTAAATAGATGCTGAAAGTGTAAATCTAGATGACTATAACATACACGTNATTGCAAG 
TGTATTCAA 

Sequence 2521 

TAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTATT CI 1 1 1 1 1 1 1 1 1 I 

I I I 1 1 IGGTT ATGA AAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGAT 

TGAATCTA CATTTTG GCAG AAGTT AAACATTCCCACATAATGTCAAAATTATACATCATG 

CAGTTCTG 1 1 1 1 I I I GTTTGTTTTATTTTGTTTTGTTTTTGAGTCTGGCTCTGTCACCCA 

GGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCTG 

CCTCAGCCTCCCGAGTAGCTGAGATTACGGGTGCGCGCCACCACACTTGGCTAATTTTTG 

TATTATTAGTAGAGACGGGGTTTCAGCATGTTGGCTAGGCCGGTCTNTCCTGACCTCAGG 

GTGATCAGCCCACCTCGGCCTCACAAAGTGCTGGGATTACAGGCGTGAGCCACCTTGCCC 

AGCCCACATCATACAGTTTGAAATGAAACTTTGCCACAACCAGCCTTTGCTGTAGCACAC 

ACATATATCACTGAACCTGTTTGNAATAAAAG I 1 1 1 II II 

Sequence 2522 

CCGCGGTGGCGGCCGCCC GGGCA GGTAC I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 I I 1 1 1 I I I 1 1 I I 

TTTNGGTCATTTGNCCTTAGTTTTTAAAMCAAAAGTGTTAACTAGAATNTAACAACAGA 

TCAMTCCAAACNCAGCAGTCCAGNGGAGAATCAAAACTTTTCCGGCTTTATTNTNTGGG 

AAAACCCCTGGTCTGTTTTCATTCCTATTGGNCCAGGCCACCATTNTATGATATGAAGGC 

CTAAATTAGGAAAGCTAGGNGAGCTGNGCAAATTNTGGGTGTCTGANCCNCCTGTTGTTT 

GGCGTGTGATGGGGGTGGAGGCCCNCACAGGGGTGTTCTCGCTAGNGTTCAAATCACAAA 

AACAGGGACCGTAACTAGGGGGAGGNGAGCAAAGCNCTCACCTTGGGCACAAAATTTAAG 

GNGTGCCAAAAMCCCAGTAACCAAAGATAAATACTNTTTTMTGCAACATTnTAAAAA 

ATCCMTTAMTGTAAAMGGTTTTTGATGGACMTGTTTCNAAAATTTTNAATAAAAGG 
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GTTCCCCCCGTA C1 1 1 1 1 I I 1 1 I I 1 I 
Sequence 2523 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACACACATTACACTATCCC 

TTATCAAATATACTGAATGTAAGTAGGCTGGGTGCAGTGGCTCATGCCTGTAATTTCAGC 

AACTTGGGAGGCCAAGGTGGGTGGATCACCTGAGGTGGGGAGTTGGAGACCAGCCTGGCC 

AACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGCACAC 

GCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAACTGCTTGAACTCGGGAGGTA 

GAGGTTGCAATGAGCTGAGACTGCACCACCGCACTCCACCCTGGGTGACAGAGCAAGACG 

ACGTCTCAAAAAAAAAAMGTAATTAMGCTTGTTTCGATTACTTAGGCTCATCAATAGT 

AAGATCAACATGGTACCTGCCCG 

Sequence 2524 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCCGGGGTCCTTTGTCCTNAAAACGT 

TAGATCTGGGCTCTGATTCCTAACTGCCACTCGTGATCACAGAGGTGCAGAGAGTCATTA 

GCCATTGTTGCACTTCCATTGTTGCAAGCCCCTCACCTTCTGGTTGAAGTGAC7TTCAGG 

AGMTTAAMGAGCTGGNGTTTCCCTCCCCACTCCCATTCATTTTTCTCATm 

TTGGGAGCCAGACATAAACACAATTGCNTATATGAGAGAAATCAGAAAGTCATGGCACAG 

CCTAGGGGATGGTGCAGGCTCAGAAAAGACCTCTGAGAAGACCTTAGATTTATACTTCAG 

ACTCATCTTAGGCAAAGAGGGCTTACAACAATCAAAAAAACAAAAATAATAAAAACAGTA 

ACAAAAACAGCAAAC CCTGN AGAAGAGGAAGAATCTGATTTCCAGAGTTAACACATTAGT 

AAATTCTAATGTCTGGTTTTCAACAAAAATATCACAAAGGGATTACCAGAGAAACAGGGA 

MGTGTAACCCCATTCAAATTGAAACCTAAAGAGACTTAANAAACAANAGGGATTTAAAG 

GTCAAGAAAATTTTTTA 

Sequence 2525 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGGAAGGTCACGGGCTT 

GAGTCGTAGCCAGGGATAGNATTATGGGTGAGAATAGACAGGATGGAGGACACGAGGCCA 

CAAAGCCCCACAATCCCAGGGCCACAGCGCCAGAGGCCTAGTTTGTCCAGAGGAATGAAG 

AGATCACAGGCATTTCTGACCACGTNTAGCANAAGTGGGGGATGACCTNTAAGGGCTCGA 

GCCAGGAGCNTGACTTGCAGCCCNAGT7TCAGAGCCAGTTGGGGCANACCTCCTCCTGGA 

GTCCTGGTCCCCCAAGTCCCCCAGTTTTNATTTCCTCNTGGGACTCNTCTTCAAAANCNT 

TTMTNCNCCGNTTNNANGNAMNNATTTTGGTTCATTNATANGNCNANTTTNTNAACCT 

TNCNGGTTAAATTNATNANTNNGGAAAACNAATTGNACCNNCCCNNGCNNNAGCNGACCC 
CCNGCCCGTAC N I I 1 I M I I I 1 I 1 I I I I I I 

Sequence 2526 

CCGGGCAGGTACGCGGGGCTTCTGTTGGGCGTTTCTGCTGAGAGGCGGGAGGCGCCGAGA 

GTCTGTGCGAAGGTCCGTGGACAGACTGCTTTGCCTGTTGTTGCTCTTCGGAGGCGGCGA 

TCCCCGAAGGCGAGCTGAAATACGGCTGCAGGCTACAATTTGCAGCCGACCATTATGGAT 

GACAAGGAGCCGAAGAGGTGGCCCACCCTCAGGGACCGCTTGTGCTCGGATGGCTTCTTA 

TTTCCCCAATACCCCATTAAACCGTATCATCTGAAGGGGATCCACAGAGTTGTCTTCTAT 

CGTGATCTGGAGGAACTGAAGTTCGTTCTGCTCACCGCCTTATGACATCAATAAGAGAGA 

CAGGAAGGAAAGGACCGCCCTACATTTGGCCTGTGCCACTGGCCAACCGG 

Sequence 2527 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAAGAAGTAAGTGGAA 

CACCTTTCCCTGTCATAGTTATTTTCATCCAGACATCTGGTGGAAGCATCAGATTCCGAC 

AAACAAGGATTTATGTCAGGATCTCTCAGCCTCTGTGTTACCGAGGGCATTCTAACAGTC 

TTCTTACTCCGGCCTCCGCTTCGCCAGCACCCAGGCCGTCTCCAGCTCCAAACACCCCCG 

CGTACCTGCCCG 

Sequence 2528 

ACACTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 1 1 1 11 1 1 I I I I 
i n i i i i i n i i I NGG AANGGAGTCTCGCTCTGTCACCCAGGCTGGAGTCCAGNGGCACG 
ATCTCGACTCACTGCAACCTCCACCTCCCGGGTTCAAGCTATTCTCCTGCCTCAGCCTCC 
CGAGTAGCTGGGACTACAAGGGATCTGCCACCACNCCCGGCTAATTTTTGTAATTTTAGT 
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AGAGACGGACTTTCACCATATTGGCCAGGCTGGTCTTGAACTCCTGACCTTGNGATCCAC 

CCACCTTGGCCTCCCAAAGNGTTGGGATTACAGGCGTGAGCCACCGCGCCCGGCTTACTA 

CMGTCATMGTTTCTTAAAGGCAATGTAAACTCCGAAAACTTAATGCACTCTTATATTG 

NT MTAC ATTAAAATCCACTGGCCTGGCTTACACTTTTGAATCAATCTTTGAGCCATGCA 

TGATTTTGTMCATTCACGTACCTGCCCCGGCGGGCCG^^IT^ITAAAACTAGNGGATCCCC 

CGGCTGNAGGMTTCGAATTTAAAGCTTTATCGATCCCGCNNCCTCGANGGGGGGGNCCC 
G 

Sequence 2529 

CTATAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I 1 1 I 1 1 1 1 T 

i i m i i i i i iGl I I I I ATTTTTTTATTTATGTTGGCAGAAAAGCTGAGGCAGGGCTTCTG 

ACATAAGGTAAAAGTGTCTTGGAACATGTCCTGGGTCCAGGGTCTATAACCCCTTGNGGC 

CTATGGAACACCAAGCTCTGTGCCAAAGGGTGGAAGGCTGCCCTGCCNCACTACAATNTA 

AGCCCAGAGCATAAAACCCCTTGTAGCCTGNGGAATATATNCANACTCGCTGGCCCCTTG 

CTCNTTGCTCTNCCAAGATCACAAATNGATTGCATNTNNAATTAAAAAAACCTGNTCTCC 

CTTATCNNAAGGTAGCAAGANNCATAGCCAAACCCGTNCAGGNTACCGNTTGNNGCACCA 

ATTA CCTTTNTNTCN TNACGTCCTAACCTGGCNACCCTTACNTCANAACATCCTAATTAC 

CTGGT I 1 1 1 I I I I I I GGATTCCCAATAAAAAGGGGGG 

Sequence 2530 

CCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGTCCCTACAAATGCAACGTCTGCAAT 

AAAGTCTTCACCCAGCGCTGCTCTCTGGAGTCCCACCTGAAGAAAAT(XATGGGGTGTAG 

CAGCAGTATGCCTATAAGCAGCGGCGGGACAAGCTCTACGTCTGCGAGGATTGCGGCTAC 

ACGGGCCCCACCCAGGAGGACCTGTACCT 

Sequence 2531 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGAAGACCTTAT 

TCGATATCTTGAACCAGA GAGA T GGCAG TTGGACTTAGAAGATCTATATAGGCCAACTTG 

GCMCTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTGGATCTGAACCT 

TCTAACAGAGGAGGTAAGATTATACA GCTG CACACCTCGTAACTTCTCAGTGTCCATAAG 

GGMGMCTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTG 

TGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCA 

Sequence 2532 

GGAGCTCNCCGCGGTGGCGGCCGNCCGGGCAGGTACCNCGGGGGTGCCCCGNAAGCAGTT 

GTTGTTGGTTGGG GGCCTTT GGGCCGGTGACGGANACTGCCCAGGTGTTGGNCACCATGT 

TCCTCTNCGCGGTCTTTTTTGCCMGAGCMGTNANATGNMCAMTAGTGCTTTTCGTG 

GAMAGAAAAAMTNCGCTCCCTTTNMCGGTGGATTGAAMTGACTNTGNTTTATAAAG 

AGAANACTGAGGGCGGGGATACTGATTCANAAATNCTGTANCGTGTAATAAAAG 

Sequence 2533 

CCGGGCAGGTACAGCTGCATCAGCTGCTCGTAGGACATGTCCAGCAGCTGGTCGAGGTCC 

ACGCCGCGGTAGGTGMCTTGCGGMGGTCCGCTTCTTCTTCTGCTCTACTTCTGCCACC 

CGCGTACCACGGCTATCCTTATAGCTTTTTAATTAMTGMGCCMGTGGGATTTGCATA 

AAGTGAATGTTTACCATGAAGATAMCTGTTCCTGACTTTATACTATTTTGMTTCATTC 

ATTTCATTGTGATCAGCTAGCTTATTCTTGTGTACCT 

Sequence 2534 

GACATGG CGCCCGCCGC CGCTTCCCCCCCGCGTAC I I I I I I I II I I I I I 1 1 1 I I 1 1 I I I I 

TTTCTTTC 1 1 I I I I I I I I I I I 1 1 I I I I M I I 1 1 | || I M 1 1 I 1 1 I I I I I I I I II I 1 1 1 I I 

M u i i i I 1 1 I I I I I I I I 1 1 I I I I I 1 1 I I I I 1 1 I I I I NGGGGGAAATTANAAANCNTTNN 

TTCNAAAAAAAAAAAAAAAAAAAAAAANTAAAAAANTNTAAANAAAAAAAAAAAGGGGGC 

CCCNGNNCNNTAAAAAAAAANNGGNNCCCCCGNGNGGGNGNAANANTTANAAAAATNTTN 

NNANCCCNNNGANAANGGGGGGGGGNNCNNANNNAAMTTTTTTTNNTTTNANGNGGGGA 

AAAANGCNCCCTNGGGNAAAAAAA 

Sequence 2535 

CCGCGGTGGCGGCCGAGGTACTTCTC I II I I M I I 1 1| I I I I I I I A CATCCCATATG ACA 
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TTGA CTGAT TAAAMGGCCTGATCGGTTTTGTTATCTTCCCACACTTCGTTTAATTTAAC 

TTGTGTTTTGTATCTCTTATAMCCATAAGATAGGCCAGGCGCAGTGCCTCATGCCTGTA 

ATTGCAGCACTTTGGGAGGCTGAGGCGGGTGAATCACCCAAGGTCAGGAGTTCAAGACCA 

GCCTGACAACATGGTGAMCACTGTTTTACTAAAAATACAAAAAATTACCCCGGCGGNGG 

GGGNATGCCCCCTGGAAAATCCCACTTCNTCGGGANGGNTGANACAGGGAAAAATCCNTT 

TGMCCCTNGGMGGCAAAAGGTTTCNANNGNCTTANAAANCCNNCCATTTGTTTNCCCN 

AGNCTNGGGNGAAAAAGGGCAAAAATTTCCTTCNCCAAAAAAAAAAAAAAAAA 

Sequence 2536 

GA CTCCT ATAGGGCG AATT GGAGCTCCCCGCGGTGGCCCGCCCGGGCAGGACAAAACAAA 

GACTTTTCATCMCTCTTTTAGATATGCTAGAAGAGCTAAAGGAAACCATGGACAGAGAA 

CAAAAAAATTAGGAAAGCAATGCCTCATCCAATACAGAATATCAATAAAGAGATTGAAAT 

TGTAGAAAAGAACCAAATAGAAATTCTGGAGTTGAAAAGTATTATAACTAAAACTGAAAA 

TTCACTAGAGGTATTCAGCAGCAGACTGGAGAAGTCAGAAGAAAGAATCAACAGGCTTCA 

AGATAGGTCAATTAAGATTATACAGTCTGAGGAGCAGAAAGGAAAAAGAATGAAGAAAAA 

TGAACAGAGCATAAAAGACCTCTGGGACTCTATCAAGCA 

Sequence 2537 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTGTAAGCGAGCTACACCAAAGAATGGTG 

ATTAGCGCTCCCAGGGTGGGGAAAAATGAGGATTGTTTATATAGGCAAAATGGAGGTGCC 

AMCAGAATTATAACATTTTCGGAACTAAGGCTCATTTAMGATACAAATTTGATTGGCT 

ACTATTGATTACACTTGMGGGGATTTGGTTTMCTTTCTTTTGTNAAAGAAAGNACNAG 

NNNNTMGGNNNTTTAhnTTCCCANC^ 

ANGGGGGGCCTGGGNTTAAMTTTTNACCCTNNTAAACCCCNAAAAGGNCCAGAAAAAAN 

AAATA AAAANGTTTGGTNGNTTTTTCCCACCGACCCCAGGCTTGGCAAGGCNCCCTAACT 

TTTrTCNTCTTTNGGCCNAMTNCNATTTGGGGM 

TTTACCATTCCCCTNTTNTANTGGGNAAAANNTT 

Sequence 2538 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACAACAGTAAAATTTTGCTCTTTGCTTCT 
GGAGGAACACCCTACAACAGGTAATTAAAAAATTTAAAAATAATATGAAAGGGGAGCTAA 
ATACCTGAAAATTTAAACAAAATGAAGCAAAAAAAAGGGAAACAAATTCATTGCAAGAGA 
TGGAAATTACATTTTAAAGGCTGAAAATAATAAAGAAATTCACTAAACAACNGGCCCTAT 
TGNCCTGGCTCCTTCAGAMAGATTACTGGAANGGAACCTGGGAAGGGGGTTTGNGGAAA 
TACAAAAI I I I rGGGANTTNGGGGGGAAANTTANCACNGNGNGGAAANCCCCCCNGNACC 
CCCN1 I I I I 1 I I I INGNG GNCCTTTT TTNAAAAAAAAANNGGGTTCCCCNGGGNCCCNAA 
AAAAAAA AANTAAAAA AA I I 1 1 I I I ICCCCCCCCCCCTTTNAAAAANNGGGGGCCCCCCC 
CCNNNNGGNGGGGNATTTTTNNNTNNNN 1 I I I I I 1 1 NCCCCCCCCC 
Sequence 2539 

AGCCT TGGA GCAGGCAAT GATCT CCACCCTCCTGGGTGCGGGTGCAGACTCAGGCATCCC 

TGCACTT1TGGGGGCCTGCTTTTGCAGGCTCGGMGTGCTTGCCTCTGCAGCTTGGCTTC 

TCCCTGCTATTGGCACCAGCTCTCAGATCAAAGCAGGGGGGTAAGCCTGGAAGCCATGAA 

CAGGGAGGGCCTGAAGGCTGAGGGCCAGGCTGCCAGTCCCACCTGTAGGAGTGGGAACTG 

GTGCCTTTTCCAGGCCAGTAAGTATGCACTTNCTCCCCCGAGGCCCGTAAAATGCCCCAG 

GCTCAGCCAGAACTGAGCAGAGGGTGGGACGACCAGCAACAGG 

Sequence 2540 

CCGCGGTGGCGGCCGAGGTACTAGTCTCAAAAGCTGGGGACTCTGAGCCTTACCTAGAGT 

CTCAGCAGGTGGACCATTMGATTAACATTTCTAGTAGGTGAGTTCAATCACAAAAATAT 

TTCTTGTTCCATAGATTTTATTGTGGCCATGTCAGTGMCACCCACMGTTTTGCTCAGA 

ATATTTTAGGTGTAAGCTAAAT CCCTAA ATTGTTCAGAGTTCCCACAGCCCTGTAGCAGC 

AGAGCGAGMCTTTAACCAGACTTTTTCAATCCX^AAAGCTAATCTGGAGGCCAACAGTGT 

TCAAAACCTTGGTGACTGAGGAACCATTTAGAGTTTTTTCAGGCTCAGGAATCACATGGG 

TCGTTGTTGGGCTTGGGGGTAAGTTTCACAGGCGATGAAGCTTGACGTTGAGTCACTTGA 

CTTCTGGAGCCATAATTTATTTTCTCCCAGCAACCTCCT 
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Sequence 2541 

CCGCGGNGGCGGCCGAGGTACTCAATGQAGTTCCTTGGGGGGATCCAAAACCNNGGAGAG 

GAACACAGGCGCTGAGTATGGAGCAGNNTNGAGACCCACCCCACCTGCAGGCCTGACNAG 

AGCGCCNGGATN TCTG GTTCNGATGCCAATACTAGACACCCCAAGNAGNCNGCTCACAGN 

MCCNTTACCTNNTTTTNAACGCCCCTGGGCCTCGCNACGCGCACAGAAANGGGANCATA 

ANCNNCNGAAAGGNACCCACTGAAGCCCATTTCTCCGNNGAGNCANCNNGNAANNNNCAG 

CTNNCNCCCCGCTAAGGAAGANANCNAGNGNNGGGGANCCNGGCCCCNCAACCNCCNGNG 

GAACCCAGCAGNNCNGNNANANCCCCNGCNACCCNAGAAGAANNAAGACGGNNAAANACN 

GGGNGGAACNNNNAANAGANAACACACNNNNNAANAGAAAACGCNNAAAGGAAGNNAGNA 

CCNGNCCGGGCGGCCGNNCNAGAACAAGGGANCCCCCGGGCNGAGGAANNNGANACAAGC 

NAANNGAACCGNCX5ACCNNGAGGGGGGGNCCGGACCCAGNNNNGGNCCCNNAANGAGGGN 

AAANGNGCGCNNGGNGAANCANGGGCANAGCNGNNCCNGGGNGAAANGNAANCGNNANAA 

NNNNNANAAAAANNANCCGGAGCAAAANGGGAAAG 

Sequence 2542 

ACTATAGGNTTTTATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGACATTATTTTT 

T CTAT TCATCTAGCACNGAAGCCAAATTTGGGTCTAAAATCTTGTGATGGCAACCAGGAG 

CTTTTAMCTTCCTTCGTAGTGATCTCATTGAAGTTGCAACAAGGTTACAAAAAGGAGGA 

CTTGG ATAT GTGGMGAMCATCAGMTTTGMGCCCGNTTCATTTCATTAGAGAAATTG 

AAGGATTTTGGTGAGTGTGTGATTGCCCTTCAGGCCAGTGTCATAAAGAAATTTCTCCAA 

GGCTTCATGGCTCCCAAACAAAAAAAAAAAAAGAAAAAAAAAAAAAAAAGTACAAACTTA 

GAAGAAAATTGGMGATAGAAACANGATAGAAMTGAAMTATTGNCAAGAGTTTCAGAT 

AGAAAATGAAAAACANGCTAAGACMGTTTGGGAGAAGTTTAGAAAGATAGAAAAATNTA 

AANGCCCAAAATTGGGATAAATAGCNCTGAAGAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2543 

CCGGGCAGGTACTAGANNACCTTCCTCGCCACTCTCTCCACATGAGAGAGTCAGCTGCCC 

TTTCTCC TGTGC CTCTGCAGGAAGAACTCTCTTGCATGGCACATCTCAGCTCCTCATTGA 

GGGATAGTTTTCTTTGATMGAMCCTGGAGTCCATTTACTCTGACCTCTCTTTAAATCT 

ATAT CCAG AGCCACTAGCCCAGGAAAAACTTGGGTGAC(X)GTAATTTCTCTTCTCCTGCT 

GTCCTTTTGCTCTTACGCCCCACCCCMCTCCCCTTAMTTTTACAGGCTTATGACAGTT 

TGTATGTGCTCAGCCAATGAGCAGAAAACCTGGAAAGAATTTCTGGACTTTAGCCCACCA 

GTTTGTCTGGTTGACTMCCTGCTGAGAGCTAAAAATTGGCACCCATTGCCCCGTGCCTT 

CAGGCAGTCTCCTGGGGCAGAGTATGCCACCATCCGAATATCAGGCACTGAGTGGGGATG 

TGGGTG ATGCTCAACATGACTGGCTAGAGCTTTGGGGGTGGGNNGGGGGTTAAACTACTA 

Mini I GGCCATGANCTNTTTCCCCTTCC I I I I I I I I 1 1 AATTAAATAAANGGNTCAAN 

TAAATANTTCAAGGCCTGCCTTTNAAAANAA AT1 I I I I I I I A 

Sequence 2544 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATTTCA 

TATATTGTGTGAGCCCCACAAATGTCTATTTTAAAAAGAGTATAGTCCCTGGCCAGGCGC 

GGTGGCTCACGCCTGTAATCCCAGCAGTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGG 

TCTGGAGTTCGAGACCAGCCTGACCAATATGGTGAAACCCCGTTTCTACTAAAAATACAA 

AATTAGCTGGGCATTGGGGGAGCATGCCTTGTAATCCCAGCTACTCGGGNGGGTTGNGGN 

GNGGAA MNAN CTTTGMCCCCCCNANGGCCAAAGGTTTTTTATTTGGGGCCCAAAAAAA 

ACCNCCCTTTTGCCCCTTCANCCCNNGGGGNNAANAAAAAAGGNGGAAACCNTTCCNCTN 

CCCCCAAAAAAAAAAA I I I I I I I I AAAAAAMTTTTAAAAATTTCNAAATTTAAA 

Sequence 2545 

ACTTAG G GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI II I I 1 1 1 C I I I » I II I 

I i I I n AG I I I I I GAAATCCCGTCTCTACTAAAAATACAAAAAAAAAAAAAAAAAAAAAT 

TGGCCGGGCATGGNGGCGCATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGATA 

ATTGCTTGAACCCGGGAGGNGGAGGTTGCAGTGAGCCGANATCGTGCCACTGCACTCCAG 

CCTGGGCAACANAGTGAGACTTTNTNTCGGAAAAAAAAAAAAAAAGATCTGGNGGGTGAA 

AATMCCNGAATGAAAATAGCTTGAAAACTCACANGNGGGMGCTCCCTT7TACCCTTTT 





WO 01/070979 



PCT/US01/09126 



TABLE 1 

415/467 



TNTTCC CTTGGCCNGATGGMTCCNNNCCCTTTTTTANAAAAAAAAGGGAAAGNCCNCTT 
mm l NAAAAMNANTGGTTNTNAANATTTTNCNCNCCCCGGGGGGGGGGGNGTTATTA 
AAAAAMNGGGGNCCCCCCCCGGGGNGGGGAAMTNNNTTNAMTTTNTTTNCCCCCCCC 
CCCCCNGGGG 
Sequence 2546 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTCCATTGTTTCTAGGAAGTAACTAACTTGCT 

TTTGATT TTACA GGCTTGTAGGTGGMTGGGCTTGCCTTGTCTCAGATGAGACTTTGGAC 

TGTGGA CTTTT GAGTTAATGCTGAMTGAGTTGAGACTTTGGGGGACTATTGGGAAGGCA 

TGATTGGTTTTGAAATGTGAAGATAGGAGATTTGGGAGGGACCGGGGTGAAATGATATGG 

TTTCGTCCTGTGTCCCTCACCCMGTCTCATGTTGCAAGTCCCACAATTCCCACGTATTG 

TTAGAGGTGATTGMTMTGGGGGTGGGTCTTTCCTGTGCTGTTCTTGGGATAGNGAATG 

GGTCTCATGAGATCTGATGGTTTTAAAMCGGGAGTTTNTIsn"GCACMGCTNTCTCTTT^ 

NCTGCTGCCATCCATGTAANATGTGACTGGC 

Sequence 2547 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACCATGACACCA 

AGCATTCCTGTCC CCTCC CCCGCAATGGCAATCAAGTGTCACAGAAAAATAGCCTTTTAA 

MTTACCATGAGCCTTTTATTTTTATTTATTTTTACACCCAGGCTGGAGTGCAGTGG 

GATCACGGCTCACTGTAGCCTCAAACTCCTGGGCTCAAGCAATCCTCCTGCCTTGGCCTC 

CCAMGTTGGNTNAGGGTTGATGAGCCACTGTGCTCAGCCMCCATGAGGTTTTTAGAGA 

TGATCTTGT^AMCCCTTCTCGCCTGTTTTGCAGAAAAGAAAATTGAGATCCCAGAGAA 

ATGAAGTAACTTGCCCAAGGTCATTCAGCAGGCAAGATAGAACTAGATCCCCAAATTGCA 

AACTAGCTGNCCAGAGTTCTTTCCTCAATGAGCAATTTAAAAGC 

Sequence 2548 

CCGCGGTGGCGGCCGAGGTACTAACTGTCTGGGATCGTAGTCGATTAAACAGAGCCACCT 

TTGTTCCTGA GGCM TGCATAAGTCAGCATTTTTCMTGACTGCTTCTTTTTGGMGGTT 

TGGAGATGACTTTTATCCGCTTGCTGAGGAACACACCAATGTCATCACTGTTGCCATAGA 

ACATCmACAGACAACAT GAAG TGCTTTCGCTTGTCTGAGTCAGATATATACAATGGTT 

TGGCTGGTGCCATMGTTCTTTTC^^ 

ATTTCCTATN CCAATAAATG CACACNGGTGAGACTCTTGTTCAAAAACAACX)ATCGCGGT 
CCATTTGGTC 1 1 1 1 I I I I 1TC TTCCATTCNCTGGCCATAAGATATCCAAGGGGGGNGGGC 
AAAAAACTGGGAGTTATTTTGTATGCCAAAAAGACACAGCCAAGAGGACTTGNGGATCAT 
GCCCCC 
Sequence 2549 

ACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC i ITTTTTTTT 
I II II I I I I 1 rGG CATAAAAAGTGCACMGTTTGTCCTATGTGTTAAAACATCTTAATTT 
GGATGTATTTTTATCCAAAATGATGCATACTTGGATGCATTAAAAGCACACCAACAAAAA 
TNCNCNCTNTAACAATAANG GGNNNNT TTNNAANGGNGNCGGGGAGNGGNAACAANATTT 
NGGTANANGGGANNTTNTTNTTTTTTTC^ 

AAn i i i i iCCCCNNCANATTTAATTTGAAAATNTGAAAGNTATNGGAAGGACANNCNCC 
AATTGGMCACNNTGTGCAAAAGTTCNANMCNAMNAAMGATNNTTTTCTNGGTTNGG 
GCNCTNANNCCNTGNAATTCAANATTTGGGGNNGGNNGGNNCCCCN I 111111 I GGGGGG 
GNGGGGGNTTTCCCNCCCCCCCCCCCCCANNNNGGGGGGAAANCN I I 1 1 II I I IAANNAA 
AAAAAA 
Sequence 2550 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTGCTGAAATC 

CGGCATGTTCTTGTCACACTGGGTGAGAAGATGACAGAGGAAGAAGTAGAGATGCTGGTG 

GCAGGGCATGAGGACAGCAATGGTTGTATCAACTATGAAGCGTTTGTGAGGCATNTCCTG 

TCGG GGTG ACGGCCCCNTGGGNGGACNNCCCCCANNGGNCNTAAANGGGTNANAACCNTT 

CCNGTTTTCCCCAAANGCCCGNCCCCTTTTCC^^ 

AAANGTTNCCCTAGGNTTTNTTGTCNCANNMNTTTCCCATTTTGTTTNTNTGGGANGAT 
GTTTNGCCCGTCANNTTCCACCAAATAANANTTNCTTTTTGGNAAAAAAATNNTAAANTN 
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NNNNCNTNNNNNNAAAA NCCCCC CCCCCCCCCTATNAAAANTNAGGGCCCCCCCCGGNNN 
GNAAMM TMNANTAAATTTTTTNCCCCCCCCNCNCCNGGGGGGGNGGCCCCCCCCNCT 
1 1 1 1 I I I I 1 1 I NTAANNNANCCCCCCCCCCC 
Sequence 2551 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGTCTCA(XACATCCTGGCTCCA 

GTGTGGATGCAGAGAGAGTGTGACAGAGGATCTGCCTGCGAACCACCTGGGATTAGTCAA 

GTCCCAAGGTGCCCAGAGTGGGACTAGTTCTTCACAGTGTGGCAGCTGCACTAATCTGTT 

TGTGAGGGMTA TCCAT TCCCTCACTCTACTCTCNTCATTATGGGMTTTNTTTTTGTTN 

CAAMTAAANCCCTTTTGTATAGANAAAAAAAAAAAAAAAAAACCNCCCCCCCNNTTNAA 

ANAANGGGCCCCCCCCNGGGGGNGNANNNNTTNNA A 1 1 1 1 1 1 1 1 I NCCCCCCCCCCCNGG 

GGG 

Sequence 2552 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGATCTTTTTGGCAN 

GAGCCCCAAGTAAAGCTGTCAATCCTGATGAGGCTGTGGCCATTGGAGCTGCCATTCAGG 

GAGGTGTGTTGGCCGGCGATGTCACGGATGTGCTGCTCCTTGATGTCACTCCCCTGTCTC 

TGGGTATTGAMCTCTAGGAGGTGTTmNCCMCCTTNTTATTGGGATTim 

NNCCCAAAAAGNGCCGGGTTTTTTTNNTCCCCCNTNNNGGGNAACCCNGGGGGGAAAAAA 

A MNGTGGTT CNNGGGNAAAAAAAAMNTNTTTTAAAAAAAAAAANATNTTTTGGNNNNN 

Aum iii fT GG GANAG ANNNCCCCCCCCCCCCCCNGGGGGGGNNTNTNTNAAAGAANN 

TTTNNNTmNCATTTTTNN^^ 

AAAAAAANCNCCCCCCCCGGGNNAANAA^f^lT^NAMTATTTTTTCCCCCCC 
Sequence 2553 

CTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCi I U I I M I 

I 1 1 I I 1 1 I I I I I I I I AAGACGGAGTCTCXaCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGG 

GATCTCGGCTCAGTTCAAGCTCCGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTC 

CCGAGCAGCTGGGACTACAGGCTCCCATCACCACGCTCGGCTAAGTTTTTGTAATTTTAG 

TAGAGACAGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGNGATCTG 

CCCGC CTCGG CCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCAGGCCAATA 

TGAMGTTTTTMCTTATTGGCACMNMGTTTTCATGMCCTMCMTATTTMTTAAC 

AAGTATTCTTCAATAACATTCCCTT 

Sequence 2554 

CACTACTTAG GGCGMTTGGAGCTCCCCGCGGTGGCGGCXDGCCCGGGCAGGTACTTTTTT 

I I I I I I I I I I IGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 

TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTG 

AGTAGCT GGGA TTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGA 

GACGGGGTTTTGCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACC 

CTCC TCGGCC TCCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTAAT 

TCTCCI I I 1 1 AGGNGAGTTAGGGGAACTGAGCCTCAAAAAACTTAAACGATTTCTCAAAA 

AACACCTCAAGTGATMAGTGGCCACATTGNAMGGGAGTTTTTATCTTTTTATTGNNGG 

CCCAGGGGTCATTGGACAAMTCATGCTACCTNTTGGATTTTAAAATATTCAATTGGCAA 

A 

Sequence 2555 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTCAGAGGC 

CGCCATCAACCCGCCAGATCAACCTGGAGCTCTACGCCTCCTACGTTTACCTGTCCATGT 

CTTACTACTTTGACCGCGATGATGTGGCTTTGMGMCTTTGCCAMTACTTTCTTCACC 

AATCTCATGAGGAGAGGGAACATGCTGAGAAACTGATGAAGCTGCAAAACCAACGNNGGN 

GGCCGMTCTTTCTTCAGGATATCAAGAAACCNAACTTGTNATTACTGGGAAAAGCGGGC 

TTAATCMGGGGGGTGGGCCTTTANNTTTTGGNAAAAAMGGGNGMTTNATTTCTNTTT 

TTGGGAACAAGNAAAAACCTGGGCCAAAAAAAAA 

Sequence 2556 

CACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTT 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 INAGACAGGGTCTCCCTNTGTCACCCAGGCTGGAGTGCAGT 
GGCACAATCATGGCTCACGGCAACCTCGACCCCCTGGGTCCAAGTGATCCTTCCACCTCA 
GCATT CCACAAGATGATGGAACCACAGGCATGCACTACTATGCCTGGCTAATTCTTTTAT 
. TTTTTGNGCAGACAGAGGTCTCCCTATGTTGCCCAGGCTGGTNTTGAACCCCTGGGCTNA 
AGCTATCCTCCCGCCTTGGCCTCCCAMGNGTTGAATTANNNGGAATGAGCCACTTTTTG 
GGCCTNGGCCTCNANTTMTTTNAAAANGGGNGTrrGNTTTNAACCNCCGCCCCTTTAAA 
AAAMGGGGCACCCCCXXIXaGGNGNGGGGNMTTTTNMNNNNAATTTTTTCCCCCCCCCC 
CCCCCGGGGGGGGGGGNGCCCCCCCCC CI 1 1 1 1 1 1 1 1 1 1 1 1 1 A ANNAAAAAAAAANCCCC 
CCCCCCCAAAAAAAAAAAA 
Sequence 2557 

ACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCTTGTCAAATT 

CCTTCTTGATATGAGCCGCAATGCCCCTCTCTATGTTGTATTTCTCCAGCGCCTGAGTAG 

CGCACTCCACCGAGTCCTGTTGCATCTCTTCCGACATGTCCGCATTTTTGATCACGGCCT 

TTCGGTCGCCCCGCGTACCTGCC 

Sequence 2558 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTACTGCTGAGGTCTNCAG 
GACAGMGNCACCTCCTNTGGTAGMCATNCAT(XX;TGGNCCTTNTCAGNCCACAGTTTT 
GCCAGAAATATCCACANGAACAATGACAAGGCTTTTYGCCT 
Sequence 2559 

CCGGGCAGGTAC I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 AGGGGACACGGTCTATCCGCAA 

GCTGNGGATTCTGCCATTCTCATCATAGTTTTGAAGCCGGGGTTCTGAAATCTCATGACA 

GATCTCCAGCTTGAACTCACGGAATGGTTTTTCTAGGCCCTGCTCATAATACAGCTGCAG 

GTAACCAGNGTCTGTCAGTTTGACGAANATCGGTCCCCAGTGCCTGGAGGACATGATGTT 

TTTNTTCTCAGGGATCCTCAACATNATTGGCCACCCGTCACGAGGCTGGGACCGTGCTGA 

TCCAGGCGGGTGAGCATNTAGTTCAATCCAGGCTACTGGGTCATCATCAGGTAGAGTTGC 

ACTGCCAAAGTG ATCAG GGTCATTMTTTGGAGTTGTTTGAGTTTTTCAACAGCATCAGA 

AGTGNGACTGGGTTTTGCTTTGAATCATCNAAACTGGATGGCATCCTTGGNAGATGACAA 

TGAAGGGAATNTTNT7TGGCTTT 

Sequence 2560 

CCGCGGTGGCGGCCGCCCGGCAGGTACTGGGCCGGGTGCAGTGGTTCACGCCTGTAATCC 

CAGCACTTTGGGAGGCCGAGGCAGGCAGATCACGAGGTCAGGAGATCGAGACCATCCTGG 

CTAACACGGTGAAACCCTGTCTNTACTAAAATTACAAAAANTTANCCGAGCNTGGNNATG 

GNNGCTTGNATNCCAANTTNTTCGGAAGCTTAGCNAGAAAATNGCTNAACCTTGGANGNG 

GAG CTTTGAN TGAGCCAAAATCCCNCCNTTGNACTCANCCTTGGGGGAAAAAACCNGNAA 

NTCCi l ll I lAAAAAAAAAAAAAAAAAAAAAAAGGGNCCNNTCGGCCCTNTTTTAAANNN 

TTGGGGACNCCCCCCGNCNNNNNGGNA A 1 1 1 1 1 1 1 1 NAAACTTTTTTTNTNCCCCCCCCC 

CCCCCNGGG 

Sequence 2561 

NCTACTTAGGGCG AATTGG AGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 1 1 I I I 1 1 1 1 I 1 1 
TTGATTGTTATtTAGTTTrTATTTCATAATCATAAACTTAACTCTGCAATCCAGCTAGGC 
ATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCACAAAGATT 
CTAGGNTCCTGCGACCAANGGGGGGGNGGGNGCCNNNNNNNCCCNAAAAAAAAAANNTTN 
GGGGGTT NAAAAAAANCCCCNTNN 1 1 1 1 1 1 1 1 1 1 1 GNCCCNCCCCCANCCCNGGGGGGGG 

"i GNGGGG GGGNTTTG CCNATNNCCGGNANCCCAAAGCGCTCATGCCNCANAAA 

NAANANTC CGNGC CI I 1 1 1 1 1 ICNCTTGNGNCAAACCNCCANTGNGCTTTNGACNNCCAC 
CCN CTTCAI I I I IGGGGGGGGGCAAAAAAAAAAANCCCGGGCNCNCTTTNGGGGGGGGGN 

CCCNrrTTTTTTNNAAAAAAAAAANANNCCCCCCCCCCCTTTTTTTGNGANNAGGATTTTT 
TTTTT 

Sequence 2562 

ACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I 1 1 1 II I 1 1 
i m i ii 1 1 i 1 1 I I 1 1 IAGGGGACACGGTCTATCCGCAAGCTGGGGATTCTGCCATTCTCA 
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TCATA GTTTTGA AGCCGGGGTTCTGAAATCTCATGACAGATCTCCAGCTTGAACTCACGG 
AANGGTTTTTTTAGGCCNTGNTMTMNNNAGNTGCAG 

NNCAAAAANCGNNCCCMNGNCTGGAGGACANNNTNNTTTTNTTTTNAAGGATNCTTAAC 
ANTATTGGCCNCCCGTAACMGGGTGGGACCGNCCTTATCCAGGCGGGNGGGCTTTTTAG 
TTCAATCCCGGCTCTGGGGNANTANTNAGNAAAAGGTNCCTTCCAAAGGAANNGGGGCAA 
NMTTGGGNGGTNNTTNGGNGTTTTTCMCCCNNANAAMGGGGGCCGGGGGGGTTTTTT 
TTAAAANAAAAAAAAATGGGGGCNTNTTGGAAAAAANAAAAAAGGGAANN I I I 11 1 I I 
Sequence 2563 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCAGGTGGAGAGAGGGTGGAAAGTGA 

GCAGCGGGCTGGGCTGGAGCCGCACACGCTCTCCTCCCATGTTAAATAGCACCTTTAGAA 

MATTCACAAGTCCCCATCCACAAAAAAAAAAAAAAAGAAAAAGTACTATTCTAGTCTTA 

AGAATAAAGACCNTGTTCAGCCAGGCGNGGGGGTTGNCNCNTGTANTCCNACNCNTTNGG 

GNGNTTGGGGGGGCAAAATCCGGGGGTCAGGGCATTCGAGACCAGCANCCCGGGGTCAGG 

CTTTCGAGACCCAGCCTGGCCAACATGGTGAAACCCTGTCTNTACTAAAAATACAAAAAT 

TAGTTCGGCATGGGTGGTGCACACCTGGTAATCCTAGCTACTCAGGAAGGTGAGGCAGGA 

GAATCACTTGACCTAGGAGGCGGAGACTGCAGTGAGCCAAGATTATGCCAATGCACTCCA 

GTATGAGTGACAGAGCAAGACTTGTCTCCAAAAAAAAAAAAAAAAA 

Sequence 2564 

ACT ACTTA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGAGCTC 

GGATTTTAAGGCAGT AGCC TGCTGATGCTCCCAGCTGAATAAAGCCCTTCCTTCTACAAT 

TTGGTGTCTGAGGGGTTTTGTCTGCGGCTCGTCCTGCTACATTTCTTGGTTCCCTGACCA 

GGAAACGAGGTAACTGATGGACAGCCGAGGCAGCCCCTTAGGCGGTTAGGCCTCCCTTGN 

GGAGCATCCTTGAGGCGGTNTCCCCCCCCCCGGGGCCCCCTTNAAAAANANNCNCCCCGN 

AAAAAAANACCCCCGGGGGNNNCCCCNANNNMCGGNGGGGNCCCTTTTTTGGGGNNNAA 

NAAAANNCCNNCCCCCACCNCAAANAANNACTN 1 1 I I I I I 1 1 1 CNCNTNNTNNAAAAAAA 

AAAAAAAAAAAAAAAAAAAAANNTTTCCNCGGGGGGGGGNTTTIS1NTAAAAAAAAAANNGN 

GCCCCCCCGNNNGGGNGAMN^f^TTTTNAAAATTTTTTNCCCCCCCCCCCCCCGGG 

Sequence 2565 

GCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I M 1 1 1 II 1 M 1 1 1 I II I II 
GCTGGTAGGAAATGCTTTAATAAAAATGCAATCTCTAAGGGGCCATGGCATCATTAAAAG 
AAAGGATGTCATGCCCAGATCCANAACTTGAAGGTGGCNGGCACCAGCAAGCACCATANT 
TTTGMTNGGCCTTNCTTTNCAGGGTCCTTAN^ 

CTTGAAAAMTGGANNAACNTGTTCNCCNCCCTTGGGmrsrTTAGTNGGGAGGGMCTTT 
TNGTCCANCTAAAAAATNTTGNNCGNGGGCCCANTTTNGGG 

TTGG CCCCNNGGGGGGGCNGGGGGGGNNNCCCCTTTGTTNNNCNNNNTANGNNNAAAANT 

TTTTNNNTCNGGGGNAAAAAAAAAAAAA 

Sequence 2566 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAAGAAGAGG 

AAGAAGAAGAAGCAGAAGGAATGGAAAGCCTGGAGAAAGAGGATGAAATGACGGATGAAG 

CAGTTGGAGACTCTGCTGAGAAGCCTCCTTCTACTTTTGCCTCACCTGAGACTGCTCCAG 

AAGTGGAGACCAGCAGAACTCCACCAGNTTGTGAACCCCCAACCCTTCAATCAAGAAAAG 

ACCTTTGATCAGGAGAAGACTTCTCGTTTCATT7TGGGGACACATTCAGGA7TTCTTCAA 

MGCAGTGAGTMCATTGMCCCCTCTTGCTTGCTATTCTGGTCTNTCTTCTNATANAAA 

TTGGAANMMCCCCCGGNCCCTTGGAGCCTTTTAMTGCCAMNGNNNGCNCTTTNAAA 

ANTTAGGGNANCCCCCCGGGCGGGGGGGNAJ Mill) I AAMNTTTTTTTCNCCCCCCCC 

CCCNTGGGGGGGGGGCCCCCCCCCNN 1 1 1 1 1 I 1 1 1 1 1 1 NTTNNAAGNNNGNGNANANNCN 

CCCCNNGGGNAAAAAAAAAAA 

Sequence 2567 

CTATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNCTTTTTTT 

TTTTTNTNGNGGACTTGGGCTT CTCTGC CCATATTGCANTGTTGATGTTCCCAGAGTTCT 

ATCCTTACTCTAMNGATCTCCATTTTTGAGCTTATCCACACAGNGGACTGTGGNTTNCT 
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GGTCTTNGNNCTTNTGCCTAGGACNTNCTTTCCTAGGTANCCACATGCCTGCTCTCATTA 

TNTTCATGTCTTMCTCANAGGNCAACTTTGANTTAANGCTTTCTATNACTACATTNAAN 

ANNGTGNGACCTTTCNTGTCTNCCCANCTNGCCTGATNTTTNCTTNTCCATTGCACTTTN 

CTTNTGACTTCT 

Sequence 2568 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGNTGAGTGACCCTTACAAGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGNTG 

CTGCTCGTGTTGAATCTAGAACCGTANCCAGACATGGNACTGGAGGACGAGCAAAAGATG 

CTTACCNAATCCGGAGATCCTGANGAGGANGAAGAGGAAGAGGAGGAATTAGTGGATCCC 

CTAACAACAGNGAGAAAGCAATGCNAGCAGTTGNAGAAATGTGNAAAGGCCCGGNAGCGG 

TTAGAGCTCTGTGATGAGCGTGTNTCCTCTCGATCACATACAG 

Sequence 2569 

CCGCGGTGGCGGCCGAGGTACATTATGTTTTCGTTTTCATTCATCTCMTAGTTTCTTCT 

TGGAGATTTGGGGGGAAAATGATAGACAGGAGGCAGGACTAGATTGCAGCCCTCACCCGG 

ACAGACAGCAGCTCACAGGGACTCGCATCATGAACTTTGGCTCCAGAACTACTGCAGGAA 

TATACCAGGAAAGCCAAGAGAATCCACAGACCCTCTGAAGGAAGCAGATTGCTCCTTCAG 

GACCCAAGAGACACCCTAAATACTGTGTTGGTATCGTTGGCAGAGAAACCTCAAGACGGT 

TCACATTACAGGACTCTGTGCAGACAACCCTCGGTACCTGCCC 

Sequence 2570 

AGGTACTACAAGAACATCGGTCTGGGCTTCAAGACACCCAAGGAGGTGCGGGGAACCTCA 

GAAGAAAGAAGGGGAACCTGGCGTTCCTGCACGTGTGCCCACGACGAGTTGCCCTGCCTG 

CATCT AAGT GGCTTCTGGGGCTGCTGGGAATTGTAGTTGCTTCCCTGAGGCCACGCCCCT 

GGCTNTTTTAAGGMCCGCCCGCCCAAGGCTCACTCCTTTATCTTTCCTATCCTTTCAGG 

CTATTGAGGGCACCTACATTGACAAGAAATGCCCCTTCACTGGTAATGTGTCCATTCGAG 

GGCGGATCCTCTCTGGTAAGTGCGGGAGTTACTGGTGTNTGGGGCCTGAAATACTGAAAG 

AAGGGTCTTGGGGCCCA 

Sequence 2571 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCGGGTACCCCCATNACAG 

NNGNCAANCNGNCTGAGGAAGTGGCNNCTACCANGCAGGNGATCTTNCNGGAGCGGTTGG 

CNGCAGNGCCNGAGTTNCACCGGTNTTGNGCCCCTGTTCAAGTNCTNACCTGANCCCATG 

GCCCTCAC CGNGT CANAGACGGAGTNTGTNATNCGNCTGCACCAANCACACNTTCNNCAA 

CCACATGGTTTTTNANNTNGNCTGCACAAACNCACTNAATGACCNTACCTTGGANAATGT 

NNCAGTGCNNATGGAGCCCANTGNGGCCTATGAGGTGCTCTGTTACGTGCCTGNCCGGAG 

CCTGCCCTACNANCANCCCGGGACCTGCTACACACTGGTGGCACTGCCCAAAGAAG 

Sequence 2572 

TTAGGGCGAATTGGAGCTCX)CCGCGGTGGCGGCCGNNGNGCCCQGGACGCACAGNAAANN 

CNTGTNTTNGTNGGNNTNTCTATNAAAAAGGCAATCAAGAAAGATAATGTGAAAAAGANA 

GGAATTNATAGGTCGGGAANANATGAATGTCAAGACATTTGAAGAACTATAGTAAAATGA 

TCAACACTAAATATACTNAGAGAAANCTTTGTTAATATGCCAATGAGGTNGGCCTGATCT 

TTGAMTAGTGAATAGGAATNCAATGCATTTCCTCAGTGATCACTGATTANGAATGAGTT 

GGTNNGGATCCTTGGGA 

Sequence 2573 

TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAAGCCTGGTTGAAGTC 

TTTTCCTTGAGAGGCCTGGATTTGTTCACATATTCAGATTTCMCTGGCTTTATAACAGG 

TTTCCT TATGT GCCTCATATTGGCTCCAGGTTCACTGGGTTTAAAAGGAGCCAAAGCACC 

ATMGGTTTTGGCAAAGGAAGAGTGGCATCTGCTTCTGGGATGTAGGCATTCGGACGCTG 

CTCT GCAG TTGTGTAAACACCATTGGGCAGCTGGCGATTATCTGCTTCCAAGAACTCCTG 

AGACTTTTCAGCGATGGCCAAGGCGTGCATTTCTTCATCTCGAAGGACTTTCAACCATTC 

TTTCTCAATTTCCTTATTGAGTGGCAGACCTTTTTCTA 

Sequence 2574 

TGCCGGTGGCGCCCCGCAGTCCGCTTGTCCGTCCTTNCTCTCTGACTGNGGTACNNCGGG 
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GCCTGTCTCTGCAGAGGTCANGGGAGGCNGGGGCCCCAGCACACGTTCCNCAGTGGCAGC 

GNGGAGCGGCAGCTTACGGGTTCGCGGGAGCCCCCGACCCCCCAAGGGCTAGAGGAGCNC 

TCGGCGGACCAAAGAAAGCCCGCGAAGCGGNTGCGCGCCNACCAATAGGTGCGGGGCTCG 

GAGCCNNACAAACTTGCGCCGTACCTCCNTCCGCCCCCGCCCCGNCNCCNCCGCCGCCGC 

CGACTCCCCCTACTCCTGCTCCTGCCTTGGCTCCTCCGCCNAACGTNTCGCACTCCGAGA 

GCCGNAGNGGCAGCGGNNCCGTCCTGCCTGCAAAGAG 

Sequence 2575 

CCACGCGTCCGTTCCCAGNCAGTTCCATCCAAGGGAGMTTAAGTAGAGAAATTTGTCTG 

CAATCTCANNGGNNNGACAAATCTACGACACCAGGAGGAACAGGAATTAAGCCTTTCCTG 

GAACGCTTTGGAGAGCGTTGTCAAGAACATAGCAAAGAAAGTCCAGCTCGTAGCACACCC 

CACAGAACCCCCATTATTACTCCAAATACAAAGGCCATCCAAGAAAGATTATTCAAGCAA 

GACACATC7TCATCTACTACCCATTTAGCACAACAGCTCAAGCAGGAACGTCAAAAAGAA 

CTAGCATGTCTTCGTGGCCGATTTGACAAGGGCAATATATGGAGTGCAGAAAAAGGCGGA 

AACTCAAAAAGCAAACAACTAGAAACCAAACAGGAAACTCACTGTCAGAGCACTCCCCTC 

AAAAAACACCAA 

Sequence 2576 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCCTCCTCTATTCCTCCCA 

ATCTCAAGGTTACTCTTAAATACTAGTAAATGCAAAAAGAACTTGTAAAGTGGCAAGGCA 

TGGCCTATCAAAAGTCAGCCCAAGGGCAGTTTTCAGCCCTGCCTCACCTGGGTCTAGTTC 

AGCTGACGGATGAGCTGATTGATGCGTTCACCCCGATAGCCAGGTGTGCCCATCTCCTTG 

AGGMGCCCACTCTATTTTTGGTAGCATGACGGGCCACTGAGAGGTGGAAAGGGCACAAG 

AACCATGAGATCTCCTGGAAATGCTTCCCTGGGAAGGCAATTTCATGAATGAGGTCTTCC 

AAGCAAATGACGCCAAACTTCCCCAGGTGCTCCTCAATCACTGTACCTGCCCGGGCGGC 

Sequence 2577 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACGCAGGACAGACTAGGAGGGAGCCGGGAGC 

TGCCCGGCGGGTCATGGGAATAACGCCGCCGCATCGCCCGGTCGGCATCGT7TATGGTCG 

GAACTACGACGGTATCTGATCGTCTTCGAACCTCCGACTTTCGTTCTTGATTAATGAAAA 

CATTCTTGGCAAATGCmCGCTCTGGTCCCGTCTTGCCGCCGGTCCAAGGAATTTCACC 

TTCTAGCGGGCGCAATACGAAATGCCCCCCGGCCGCTCTAGAAACTAGTGGGATCCCCCG 

GGGCCTGCAGGGAAATTTCGGATATTCAAAGCCTTATTCGAATACCCGTCGACCTCGAGG 

G 

Sequence 2578 

CCGCGGTGGCGGCCGCGCGGGCAGGTACCCCGAGTCCAGCGGAGACAAAGGAGTTAGAAA 

GAGACAGAATAAGAGTTTAAAAGGCAGGTCCAGGGGACCGGAGCGTTGGAGGCTTGCTCA 

TGGCCCAGAGCTCTTTGGCTCCGCCCAATTTATTGATTTACMGCTCTTTGTTCTTAGGG 

CAGATGGGAGGGGTAGGAAGGGATGAGGAAAAGGATTAATCAGTGAAGGAGAACTCGTGA 

GTCATTCMTAATATGTATAGTAGTGGTGGTTTCTGTGAATTTCCTTGAGTAAAGGCGTG 

TGTCTAAACTACTCAAGATCTTTAACTTATCGGNATTGAAATGGATGGG 

Sequence 2579 

AGGTACGCGGGATAGTGAMCCCCGTCTTTACTMTTTTTTGTATGm 

GGTTTCACCGTGTTGGCCAATATGGTCTCGATCTCCTGGGTCATGCCTGTGGNCCCAGCA 

CTGTGGGAGGCTGANGCAGGAGGATCATNTTGAGGCCAAAGAGTTCGGGATCAGCCTGGG 

CMCATAGTGATACCCTATCTCTTAAAAMGAAGMGTTTTTAAATTTGAAATAATAANA 

GGTACCTTGCCCGGGCCGGCCGCCTCTAGAAACTAGTGGGATCNCCCCCGGGCTGCAAGG 

GAATTTCGATATCAAGCCTTATCGGATACCGTCCGACCTTCGGAGGG 

Sequence 2580 

CCGCGGTGGCGGCCGGGCAGGTACGCGGGATGATTGMTTTTGTTTCCGCCTAAAATAGT 
AATCTATMGATATAMCTCGAGTTAGGGTTTACATTTTTTACTTATGAACACAGGGCAC 
TAGG GCCAC TTCAGTCTMTTTCCTGCTTTTTAATTACTTTAACACTCCACAGGAGGAGG 
ACTGGTTTTCCTCTGTGACTTCCTAATGTATGGCAAGCAGGACTTCTTCTAATCCACTAC 
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CCTCTTCCCCTAGCTTMCTAAGGCTTGCAGTAAMTTATAMTTTCCACTTTCTTTCCT 

ACATTCTCAAATGTAGGAMTGAGGACAMCAACTTCCTCTCTCCMTTTACAACACTAT 

CMCTATTTGTCCTTTATTGTGCATTTCAGACACAGGTGTTTTMTTGNTAATCATGTTT 

TTACACTGCAGTGGATGGCAGGTTTT 

Sequence 2581 

CCGGGCAGGTAC 1 1 I I I 1 1 1 I I I I I I GAGATGGAATTTTGCTCTTTTGCCCAGTCTGGAG 

TGCAGTGGCATGNNCTCAGCTCACTGCAACCTCCACCCACTGGGTTCAAGCGATTCTTNT 

GCCTTAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCCGTGCCTGGCCCTGTCT 

CTCTTMGAGTAGGTTCATTGTCTGTCTTAGAGTCACTTCTATTGCMCTCATTTTCTTT 

TTCCAGGGCAC AGATC GACCMGCTGCCGGTTCCCTATTCTGCAGGGACAGGGACTATTT 

CTAGCATACCTGCTTTCGTCCACCCAGGCAGGGGTTTGGGGGTGGGTCTffTTCTGTGCCT 

GCAGTCCCCCATTTTGACACTTTGGGTTGCNCACCCATTTmGGGANM 

GGGAAATGAAGGCTTCCATTGGG 

Sequence 2582 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTT Cl I 1 1 1 I I 1 1 I M I I I I 

TTTGAGATGGAGTCTCGCTCTATCGCCCAGGCTGGAGTGCAGNGGCGCGATCTTGGCGCA 

CTGCAAGCTCTACCTCCTGGGTTCACACCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG 

GACTACAGGCGCCTGCCACCATGCCTGGCTAATTTTTAGTAGAGACGGGGTTTCGCAGTG 

TTAGCCAGGAAGGTCTCAATCTCCTGACCTCCTGATCCGCCCGCCTCGGCCTCCCAAAGT 

GCTGGGATTACAGGCGTGAGCCACCGCGCCCAGTTGTGCATTTCTGGTTTCTAAGAATCA 

MCCACTTGGG CTGTTTTTAGAAGTTACTTCCCATGTTATAMGCTGAGGMGCTTTTT^ 

rTTTTTTTTTl IGAGACAGAG TCTTTG TCNCCCAAGCTTGGAGTGCANTGGTGCAATCTT 

AGCTCCCGGGGTTCMGCAATTTTTCTGCCTTAAGNCTTCTGAGTAAGCTAAAAATACAG 

GNGNGCCNCCACCCCGNTNGGCTT A 1 1 1 1 1 

Sequence 2583 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACGCGGGGGCACTCAG 

GGAGCTCAGATTTTGAGACAGTNGCTGGCCGATGCTCCCAGNTGAATAAAGCCCTTCCTT 

CTACNAAAAAAGAAANNGAAAAAAGAANACAGGATATCTGAAATTAAGACNGCNGATGGA 

GNNGTTTCTNNAAATGACAGGGNCCAAGGNGNGACCACGGGACCAAGNGGCTGAACTGGN 

ATGAAGTTAAGAAGCAGNAANAAACATCCNATAATATGGTGATCAGNTCAACAGAATGAC 

ATATTNACCATGTNCCCNAGGAGGNGATGACTGAGATTTCAAATT 

Sequence 2584 

ttagggcgaattggagctccccgcggtggcggccgaggtacgctggatagcctcx;aggcc 
agaaagagagagtaagcgcgagcacagctaaggccacggagcgagacatctnggcccgaa 
tgctgtcagc1tcaggaatgcccccccgcgtacttttttt 

Sequence 2585 

CGGCTGCGGCGAGCCGGTATCAGCCTNACTCAAAGGGCGGGTAATACCGGGTTATCCACA 

AGAATCAGGGGAATAACCGCAAGGAAAAGAAACATGTGGAGCCNAAAAGGCCAGCAAAAG 

GCCAGGGAAACCCGTTNAAAAAGGCCCGCGGTTGCTTGGGCGTTTTTTCCAATAGGGCTT 

NCGNCCCCC 

Sequence 2586 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGCTGAGTGACCCTTACMGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGTTG 

CTGCTCGTGTTGGATCTAGAACCGTAGCCAGACATGGGACTGGAGGACGAGCAAAAGATG 

CTTACCGAATCCCGGAGATCCTGAGGAGGAGGAAGAGGAAGAGGAGGAATTAGTGGATCC 

CCTAACAACAGTGAGAGAGCAATGCGAGCAGTTGGAGAAATGTGTAA 

Sequence 2587 

CTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I 1 1 I I I I I I I I I I 
I I M I I NGGGTATGAAAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGA 
TTGMTCTA CATrTTG GCAGAAGTTAAACATTCCCACATAATGTCAAAATTATACATCAT 
GCAGTTCTGTTTTTTTGTTTGTTTTATTTTGTTTTGTT^ 
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AGGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCT 

GCCTNAGCCTCCCGAGTAGCTGAGATTACAGGTGCGCGCCACCACAC1TGGCTMTTTTT 

GTATTATTAGTAGAGACGGGGTTTCGGCATGTTGGCTAGGCCGGTCTCTCCTGACCTCAG 

GGGGATCAGCCC 

Sequence 2588 

TAGGGCGAATTGGAGCTCCCCGCGGNGGCGGNCGNNGNACANAATAANGCCTGTCACATA 

TTMGTNTGTMTMCGCATTTATTATTACTTATCAGGGTATGATTTATGAATTGNGGAA 

C CTGN GATTATGGGAGAGTCTGGCTTCAATCAAGGGCTGAAATTCCATTTCCACTGACAT 

GTTTTNCTTCCCCATCCCCCGATTCTGTCCTGCAACAGGGTAACAAGAAGGGGCCCTTAG 

GCCGTTGGGACTTTGATACCCAGNAAGAATACAGCGAGTATATGAACAACAAAGAAGCTT 

TNCCCAANGNTGCATTNCAGTATGGTNTCAAAATGTCTGAAGGGCGGAAAACCA 

Sequence 2589 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCA 

AATCGGCAT7TAGATAGATCCAGTGGTTTAAACGGCACGTTTTTGCCTATAAAAAAAGTG 

C AAAA AAGATGTGGTTTACMGTTAAAGCTACAGAATCCCTTTTTGCTGTAATTGCACCA 

GTT7TAAAGCCTCTGGACAGAGCAGTATTTCGTTTAAMCTTTGTTTTTCTTAAAAGCTT 

ACAGTGTTTGGCTAATTCTCCTCCCCTTTTTACAAGACGGGGGCCGGAGGGTGGACACTG 

GTGGCAGGTTAAGG GATACTG TCACTTTAAGAAGCCTGCAGATTGAAGTGTAAACATGGA 

GAMTTAGGGGCTGATTTTTTAAACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAA 

TCAGGGAMTCCAMCAGCGATTTACACCCGATTMCACCCCCTTTATATATTTTTTACA 

AAMTACACTGGGGAAMTAATCNAACCGNTTTCATCTCTCTTGGCTTTTTTGGTTTTTT 
AA 

Sequence 2590 

TTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCTTTTTCTTTTTCTT 

I 1 1 1 I I I I I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCNGNGGCGCA 

ATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTCC 

CAMTAGCTGGGATCACTGGCACAAACCACCATGCCCAGCTAATTTTGTATTTTTTGTAG 

AGACAGGGTTTCACCATGTTGCCTAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCTC 

CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATCCAGCCCCACA 

CCCTCATTTATACCAATTACCTGCCCAGTMCTGTGGACTTTTGCTTCCTCACCCCTGCT 

CTGATCTGGAAGGAGAGGGATTATGTTATAGCTTGTCAGCACAAGTCCCAAGTTCAATAT 

TTCTGCGGCAAAAACTTCTTCAAAAAATAAATGTAC 

Sequence 2591 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTCTTCCCGCTGAAG 

GGAAATTCACAGAAGCTATAGTTGATGCAGAGCCGAAATATCTGATAGTTGNGCGACCTG 

CTCCACCTCCAAGTCAAAAGAAGTCATGTTCAGGTAAAACTCGTTCTCGCAAACCTCTGC 

AGCTGGTGGTTGGCACTCTGACACCGAGCTCGGTCTTCCTGTCATGGGGTTTCCTCATCA 

ACCCACACCATGACTGGACATTGCCAAGTCACTGTCCCAATGACAGATTTTATACAATTC 

GCTATCGAGAAAAGGATAAAGAAAAGAAGTGGATTTTTCAAATCTGTCCAGCCACTGAAA 

CAATTGTGGAAAACCTAAAGCCCAACACAGTTTATGAATTTGGAGTGAAAGACAATGTGG 

MGGTGGMTTTGGAGTAAGATTTTCAATCACAAGACTGTTGTTGGAAGTAAAAAAGTAA 

ATGGGAAAATCCA 

Sequence 2592 

CTATA GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTG 

ACTCCTTTTTCGGACTCAGTCTGCCTGCACCCATGTGATTAAAAAGCTTTATTGNTCACA 

CAAAGCCTGTTTGGTTATCTCTTCACACAGACATGCGTGACACTTGGTGCTGAAGACCCG 

GGATGGGGGACTCCTTCGGGAGACTGGTCCCCTGTCCTCACCCTCACTCCATGAGGAGAT 

CCACCTACAACCTCGGGTCCTCAGTCCAACCAGCCTAAGGAACATNTNACCAATTTCAAA 

TCAGATCTTGGCTTAGTGGCTGAAGACTGATGCTGCCCAATTACCTCGGAAGCCTCCTGG 

ACCATCACAGATACTTTGAGTAATCTCTTATAGTGGAAGGATGCAAAGTTGGAATAA 

Sequence 2593 
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CCGGGCANGGTACCATATGTTTCAAAGTAGCTGTTCATCCACAGATAAAGAGATCAAGAA 

ACTCTCATATACATACACTAGGAAATATTCTCCAGCCATCAAAATAATGAAGCAGTGTCA 

TTTAGAGCAACACAGATGAACCGCGGAGACCTGCCTCCTACTCCACCATCACATGGAACC 

CACCACTGCTTCTCCGAAGCTCGCTCTGACCACGCCGCTGCTGCTGCAGGGGCCTCGCAG 

GAAGTGCAGTCTCCCCCGCGTACCTN 

Sequence 2594 

TAGGGCG AATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGTATGTTTTTGTCTTG 

AAAAGAGGTTTTCTAGCTAGAAATTCGGGGCCCAGAAGGTCAAGGCTCCAGAGATCCCCT 

TCTCATCATCCTTCAGTTGTCTGAGGAGACAGAGGATCACATGTGTCACAGTGAGTGCTG 

ATTCCCCAGAATCTGCAGTCTTAAGCTCCCCTGCTGACGCACACAGCCAGTTCTCTGCTT 

CATATTCATGTGACCCACTGTGAAGTGACAGGCAGCTGCCAATTCTGTCATATAAGAAGC 

AGCTCTGCCCATGCAACTTTGAAAGTTCCTGCACTTCTCCATAGACTGCTCCTTCATGGA 

MTAAACACATCATCACACTGTGGGTGCCTMGCTMGTTTGTTTGCAGTTTGATGATGC 

CCAGCTCCTTTTCAACTCAGCCTTAATTGGCACTTTA 

Sequence 2595 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTA C I I 1 1 1 I I 1 1 11 I 1 1 I I I GA 

GACAGAGTCTAGCTCTGCGCCCAGGCTGGAGTGCAGNGGCACCATCTCGGCTCACTGCAA 

CCTCCACCTCCTGGGTTCAAGCAAGTCTCCTGCCTCAGCCTCCTGAGTATCTGGGATTAC 

AGGCACATGCCACCACGCCCGGCTMTTTTTGTATTTTTAGNGGAGACGGGGTTTCACTT 

GTTGGTCAGGCTGGTCTCAATCTCCTGACCTCGGGATCCATCTGCCTGGTCCTCCCAAAG 

NGCTGGGATTACAAGGCGTGAGTCACCXaTGCCCGGCGAGMGCMCTCTTAAMTACTTT 

ATTCCTTCCTCTAGGACCCTTTAAATGGTGAAAATGGGCAGATGAGTAGCAATAAGTGGA 

CCTTTGTTACTCTTCTGAGTTAGAA 

TAAGTGGATCCCCCGGGC 

Sequence 2596 

ATTG GACCT NCACCGCGGTGGCCCGCCCGGGNANNAACCATAAACCGTNGCAGNCTCAGC 

ANATATTTTCCCTTCCTTAAGTCAGTAACTTTCACCTTTTCACTTACAGGMGCACTTTA 

CGGCTTCTCTTTAGCATATGCAMTTGCCAGCATTACCACTCTTGGACTTTGGGGCCACT 

GTTAAGTAAAGTAAGGGTTACTTGAACATAAGCACTGTAGGCCGGGGCATGGTGGCTCAC 

GCCTGTAATCCCAGCACTTGNGGTAGGCCAAGGTGGGTAGATCACCCGAGCTCAGGGAGT 

TCGAGGACCANCCTGGGCCAACATGGTGGAAAACCCCACCTTTACTAAAAAATACAAAAA 

TTAGCCCAGGTTTTGGTGGGCTGTATACCTGGNGATGCGCANCCTATTCAAGGAAGGGCT 

ANAGGGCAGGGAGAAATCGCTTGGAACCTGGGGAGGCCGGNANGTTGCAGNTGAGCCCAA 

GAATCCGTGGCCCACTGCACTCCAANCCTGGGGTGACAAGAAGCCGAGGACTCTGGNCTC 

AAAAAAAAAAAAA 

Sequence 2597 

CCGGGCAGGTACTTCTTGATTTCATCATACAAGACAAGCACAAAAGCACCACCCATGCCT 

CTGAGAACATNGGACCATGCACCCTTGAAAAAAGCTTTGCCTNCTTCATCACGAGCAATC 

TTCCGCCAGCAGTCAAGCCGTGCCTGTGTACAGATGGGGTTTTGCCATGTGGACCAGGCT 

GGTCTCGAACTCCTGGCCTCAAGTGATGCACCTGCCTCGGCCTNCCGAAGTGCTGGGATT 

ATAGGAGTGAGCCACCACGCCCGGCTACAGAGTTGGGTTTTAACAGAAGAGGACCTTGAA 

TGCTGMGCTTNACAGGGCGGCCAMCTMCTCGCTGATTTTTGCAAGACCACAGTGTAA 

AGGTCGGATGTCCACCTGAAGAAGGGGTGGGTGCAACTCTCTGGGTGCTGCACACACCAT 

GAC CMGC CTGGGCATGCAGCACCCCAGCTCCCATNCATTCACACTGGTTGCTTTGTGAG 

GTCATTTTGAGAGGGCTTTCANAGCCTTTTAATGAGAA 

Sequence 2598 

AGGTACCTTTGACCCATCATCTTGGGAGGTGGGGAGGACCNCGAGGGNCCAGGCAGGGTG 

TAGGGGMTGTATTAGNCCAANGAGATTTCCCTCTTCATCCGCAGCAGNGTATCTATTCT 

ATACCTGGCTATGGGAGAAGACCCCTTGCATGGGAGGGACCCCTTGCTATGGCCCCTTTA 

AGCCAGGCAGTGGGGATCTACCTGNGGCCCCGGCCTCCCTAAAGTCATTCACATTGAATG 

GGGGGATGAAGGNTCGGGACAGTGGCTCATAAGAGCCCGAGTATTGAGCCCTAANCTGTG 



• 
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GGGCCTMNAAATGTTCCTTTMTAAMCMTC 

TNAAAAAAAANIMAAANGTACCTTGCCCGGGCGGGCCGCTCTANAACTAAGNGGGATCCC 
Sequence 2599 

GGCAGGTACGCGGGTGGACACMTTTCCACMCCTTTCAGATGCTGATGTAGAGCTATTG 

GGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCTAGAGATAACTCGGGA 

CCCAGAGTTGGTCGACAGATGTTAGGATGTATCCTAGCTTTTAGCTATAAACCACTCAAA 

GATTCAGCCCCCAGATCCCACAGTCAGAACTGMTCTGCGTTGTTGGGGAAGCCAGCANT 

GGGCCTTGGGGGAAGGAAAGCCCATGGCTGTGGTTCAAGAAGAGGGGTGGGGCCTGGGCA 

AGCCCACTTTTCGGGGGAAMACTCCTTCCGCCCCAGGGTTTCTTCTTCTCTTAAGGGAG 

GAG ATTAT TTNTTCACCAAACCCCGCTGCCTTTCATGCCTGCCTTTCAAAAGCTAAGATC 

CATGTTTTG CCTTT GCTT MGAG NAATTACCTGCAAAATCANGCCCCAAGTGGCTTTGGG 

CGMTNCCATTTTAAAAGATTTTCTAGGCCCCTTCMGGGTTTTTGTAGAAGTGTNAAGC 

CCTTGGTGGGGCAAGGGNTTG 

Sequence 2600 

GGGCGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGGACTTNN I I I M 1 1 1 I I 1 

I I I I 1 1 1 1 1 1 I I I ACGGAGTGTNTGTGGACAGTGCCAACTTTATTGAAGACAGNTCCCTN 

TGAGACTACAGGCCCAGCCACTCANAAGGCTGAGGCAGGAGGATCGCATGAGCCCAGGAG 

TCTGAGGCCGCAGCGAGCCATGGTCACACCACCGCACCCCAGTCTGGCAACAGAGCAAGA 

CCCTGTCTCCAAAAAACAAACACAAAAACAAAACAAAACAGCCAAAAGCAGGCTAGAGGA 

TGCTAMTGAGACTTTCCTAAATCTCCATTTTGGCTGGGACCCAAAGCTGAGGACAAAAA 

TTGGCAMTATATCCATATGTTGCAGA7TTGCTGCCNATCCATGAATGATTTGTTAAGGA 

Sequence 2601 

AGGTACATGGAGTTTTGATCAACGGACTTATNCCNCCTCTCCTAGGAACTAAAATGCCAG 

GGCCAGGCTGTGTATTTCTTTCCCAGGAMTTAGCTTTCXJAGCCCCTTTATATATGGAAG 

GCTGGGTTAAAGTTATGGTTCCAGMGCTTCCAMTCCTGAMTAGATGTTTTAAAGATG 

CAACCTCAAACACCAATGCTGTTGTTAAAGAGCCTATGGGGAATTGCTGCTGCTCTAGAA 

ACTAGTGGATCCCCCGGGCCTGCAAGGGAATTCGGATATCAAGCTTAATCGATACCCGTC 

GACCTCGAGGNGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTTAN 

ATTTGCCGCCGCCTTGGGCGGTMTTCATGGGTNCATAGGCTGTTTCCTGGTGGTGGAAA 

ATTGGTTTATCCCGCTTCACAAATTCCCACACAACATNAC(3GAAGCX)CGGGGAGCCATAA 

AGTGGTAAAGCCTGG 

Sequence 2602 

TGGAGCTCCACCGNGGNGGCGGCCGAGGNACCAAAAAGACTCTCAAAAACCAATACTCCC 

CCGGGCMGGGAATAGCCAAGTTTGTTGCGGTTTNCAATGAATGACATCAGCCCTGTGTA 

GGTCTCMTCAAAATGGGTTCAGTTMCACCATCAGTTTCTTTCCTCTTCCAGATCCAGT 

TGMTTCTTGTGGGCATTCTGGATAGCTGGAACAAGCTTAGACATGAACCCAGACAACTT 

GCAMTTTCAAGGAATTTCTCACTGGTGNATTTCATAGGATGCTCAGTGAAAGTAGNATA 

AGGAACTTCAGTGGACCATGGGTTCCAGCGGGGACAAGAAAGAGACTGCTCCTCCGGGAC 

TCCCCCAAGTAGAATCTTAAGGGCCTTCTCCTTGGTCTTCTTGTCCAGGGGACATCCCAG 

GGAAAGGTGGAACTTTGCCCAGGC 

Sequence 2603 

CCGGGCA GGTAC GCGGGGGGCATATCAGATACGAGGGCACCATGTAGCACAGCTGGACCC 

CCTGGGGATTTTGGATGCTGATCTGGACTCCTCCGTGCCCGCTGACATTATCTCATCCAC 

AGACAMCTTGGGTTCTATGGCCTGGGATGAGTCTGACCTCGACAAGGTCTTCCACTTGC 

CCACCACCACTTTCATCGGGGGACAGGAATCAGCACTTCCTCTGCGGGAGATCATCCGTC 

GGCTGGAGATGGCNCTACTGCCAGCCATATTGGGGGTGGGAGTTCATGTTCATCAATGGA 

CCTGGAGCAAGTGGCCAGGTGGGATCCCGGCAGGAAGNTTTGNAGACCCCTGGGATCATG 

GCAGGTTCACAAATGGAGGGAGGAAACCGGACCCTGCCTGGGCCAGGCTTGTGCCGGNTC 

C ACCC AANGTTTNGAAGGAAGTTTNCTACAGCCGGAAAGTGGGNCCCTCTGANGAAGCCG 

CTTTTGGTTCTAAGAAGGCTGGCGGAGGGNACCCTTCGGCCCGCTTCTAAGAACTAGNTG 
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GGATCCCCCCCGGGCTGGCAGGGAAATTCGGATATCNAAGCCTTATCGATACCCGNCCGA 

ACCCTCGANGGGGGGG 

Sequence 2604 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGGAGGGTTTAGGCAGCGTGTTCT 

GATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAGGAGGCCTTT 

TGAGGCCGCGTAGTCGGTGTTTTTGAACTTACTCTACAGCTTCTGGCAGGCCGTGCGGCG 

CCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTACGCCATCTTT 

MGGTTCTAAATGAGAAGAMCTTCMGAGGTTGATAGTTTATGGAAAGAATTTGAAACT 

CCAGAGAAAGCAMCAAAATAGTAMGCTAAAACATTTTGAGAAATTTCAGGATACAGCA 

GAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAAAAAA 

GTTCTGAAGAAAATAGTAAAAGAAGCCCATAAACCGCTGGCAGTAG 

Sequence 2605 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGGTCTGAAGGGGTCAC 

TCAAGCGCTCTGCTATCTCTGCAGCTAAAACGGGTGTCAGGTTTTCAGCTGCTACTAAAG 

ATAATGAGCATAAGCGTTCACCGACCAAGACTCCAGCCAGAMGCCTGCACATGTGACCG 

TGTCTGGGGGCACCCCAAAAGGCGAGGCTGTGCTTGGGACACACAAATTAAAGACCATCA 

CGGGGMTTCTGCTGCTGTTATTACCCCATTCAAGTTGACAACTGAGGCAACGCAGACTC 

CAGTCTCCAATMGAAACCAGNGTTTGATCTTAAAGCAAGTTTGTCTCGTCCCCTCAACT 

ATGAACCACACAAAGGAAAGCTAAAACCATGGGGGCAATCTAAAGAAAATAATTATCTAA 

ATCAACATGTCAACAGAATTMCTTNTACAAGAAAACTTACAAACAACCCCATCTTCGGG 

ACAAAGGAAGAGCAACGGAAGAAACGCGAGCAAGAACGAAAGGAGAAAGAAAGCAAAGGT 

TTTGGGAATGCGAAGGGGCC 

Sequence 2606 

CCGGGCAGGTAC I 1 1 I k 1 1 1 1 1 M I I I AACATCTTTGTTTTTAAAATAG AATGATAG AAC 

TTTGCCAGTCTTTAAGATCTTGGCTTMTTTAATGTATTAATCTGTTTGTGCAAACATAA 

TACCACCATTTAAAAATGTTAGGGGGATGAGTTGCAGTTTTTATAATAG A I I 1II11II A 

AAGTTTGGTATTGTAAAACATTCACACCTCTGTCCCTCAAAATTGATAATTACCGTTTAA 

AGNGCAGNCATTTGNGGTNTAGAAATCTTGTTTTGTTTTGCTTCCATTATTGAGT^ 

CTAAGGGAAAATTGGAGGAGAAGGGGACTGGAATATGAAAGCCCCAAATTCATATAAAAA 

AGTTTGCGGTTNTMGGTTTGTATTAAAAMTAGNATATTATTAANGAAAAMTTTTT^ 

TCACTTGATGTTTTGGTTAG 

Sequence 2607 

ATATGGG CGAATGGACTCCCCGCGGTGGCGGCCGGGCAGGA CI M I I I I I I CI I 1 I I I I I 
TTTTTTTGGAAACTTCCTTTTCTTA 

GCTTGCCTAAGTTTTGGGCAGTGMCTCCTTGGATGTTCTGGGCANGTAAGTGTTTATCT 

TGG GCCTG CAATGAAGCCAGCCGAAGTCCATCCCTTGGGCAAGGGCCGGGCTTGTTGGTG 

GGGTTTTTGGAMGAAGTTTGGGACCAAGGGGTCCTCCTCAAGGGGGAAGCCGGGGGGGG 

GGTTCTTCCCTCGGGCTCTNGGGCCGCCTGCCATATTTCTTCCTTGCCTTGGGCCGAACC 

GCCTGCTTGAATACCTTGGMTTGTTTTCTTGCCTTGCTTGGTTGTTTTACCTM 

TTTCCCTCMTGGTMTGTGGTTTGGTAATTTTAMCCTMTMTCCTTTGGGCTCAATT 

TTCMTCCCACCTCTTGTCCAATCAAGCAMCTTGNTMNGATTCTTTCCCCCAAAAATN 

GMTTTGGCTGGCTTGGGCAAAGGGCCTTGMGCAAATTCNATTGGTTTTMTCTTGCAA 

ACAAGCCTGMCrmCTTTTTTCMGGGTTCCCCACCATTTMGG 

AGGATGTGGAAATT 

Sequence 2608 

CGCGGTGGCGGCCGCCGGGCAGGTCAAACGATGTGTCCGTGATGACTGGTGGGGCTCATG 

TMTCCCCACCTMGAAAAGTAGMAGTGCAACTTTATCTTTTAGGTTAATAAGTGCTGA 

GAGATGGAGGTTTITCCTTCTCATTTTGATGGAGATGCCTAGAAAACCTCGCCTGACACT 

CTTTGTCCAACGCAGGATAGAGAACATAGCAACAGAAAGGGTGAGGCAAAAGGCATGGCT 

GGTN AAA AGGCA CTGCATGTTATTAAGGATGNGGGGCCTGGTCCTGTTCGNTTCACATGT 

TTTTCTT^^^TTTTATACAGAAATAGGAATCTACCAGACAGTAATAAATGCCACTTCTCAC 
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AGAAAGTCTGACAGGCTTCCACTGCCTCTGAGAGAACAACAACATGTTGGCTCCATAACA 

TAAAGAAAAACAATGCTGGGTGCGGTGCGGTGGCTCGTGCCTATGATCCCAGCACTTTGG 

GTGGCTGAGGCAGGAGGACTGCTTGAGCACANGAGTTTGAGACCAGCCTGGCCAACANGG 

CGAAACCTTCTCTNTACTAAAAATNCAAAAAAAAAAAAATTAGCTG 

Sequence 2609 

TCACCGCGGTGGCGGGCGGCCGCCCGGGCAGGTAC 1 I 1 1 I I I I I I I 1 I I I 1 1 1 11 1 1 GTG 

GAGACAAGGATCTTGCTCTGTCACCCAGACAGATGTGCAGTGGCATAATCATGGCTCACT 

GCAGCCTCAACCTCTGGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CCACAGGTGTGCACCCCCACTCCTGGCTMTTTTTTTA 1 1 1 I 1 1 GTAAAGATGGGGNCTT 

GCTATGTTTNCCMGGTTAGTCTNAAACTCCTGGGGTCAAGCGATCCTCCTGCTTAAACT 

NCCAAAGCACTTGCGATTCAGGNGTGAGCCACCMGCCCAGCCCCAGCACCACTTTTAAC 

TAGCTGCATMTCTTGGGTAMCTACTMCCTTTCCAAAGTACCTCGGCCGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNATTGCCCGCTTGGNGNAA 

TCATGGCAMAGCTGTTTCCGGGNGAAMTGNTTATCCCGTTACAATTNCCNAAANATAC 

NAACCCGGGAGCNTAAAGNGTAAAGCCNGGGGGCCCANTGGGGAGCTAACTACAATTAAT 

GNGTTGGGCTACTGGCCNNTTCAATGGGGAAAACCNGGNGCCACTTNNTANTGAATCGNC 

ACCCCCCGGGGGAGGGGGTGGTTTTGGCCTTTTCCTTCTNGTMTANTNNTNGCCGGGGT 

NG 

Sequence 2610 

GCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAATGAGATGGATACAATTAGTNAAAC 

CTTAAAATTAAAAAAGCTGTAGACAACAGAAGGNAAACTGGAAATCCATTTACAATTCAA 

MGMCTCACTAATAACAAAATTAATGTTCATCAACTTCATTTATAATCCATTTNGGCCT 

ACATNGCNTAACTAAANTGACACATGTCCCCGGGGGCTGCAGGCGTNGCNCCAATCTTCG 

CTCTGAGGNGCTNTCTTAACCGNNANACCCTGGAAGCGGGCAAGTCTCTTGCTGTGTCGG 

GACCTTGCAGNCCCTGGCCCTTCCGCCACCATGGGAATACCTNGGGCCGTTCTAGAACTA 

GGTGGGATCCCCCGGGCTGNAGGGAATNCNNTATCANGCTTATCGATNCCGTCGACCTTG 

AGGGGGGGGCCCG 

Sequence 2611 

CCGCGGTGGCGGCCGAGGTACCAAGTGTGGGAAGATGTTGAGCAACTGGAACTCATGCGT 

GGCAGGTAGGGATGTAAAATGGCACAAAGACTTTGTAAAATACTTTGGCAAATTNNNAAA 

AAGNAAACACATAGCTACCATACAACCCAGCCATCCCACTCCAGTATTTAACCCAGGTGA 

MTGAAAA CTTA TGTCCAMCAAAGACTTGTACTTTTCTATGATGACCCGGGCCGGCTTC 

TTTMCGNTTTTNGGTGCGAACCGCNGCCCATGTTGGCGGGTCCTTGGTAAAAGACCCCG 

CGTCCTGCCCGGGCGGCCGN 

Sequence 2612 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACANGAAAGTCTAGATGATCTTGTAGTGCCAG 

AAAGTAAGAAAGTAATAAAAAGATGACAGGTCTGTCACAATGATACAGAAGCCAATGTGA 

CAMGCTCTCAATAGTTAMTCTCGAATTTGAGTAGTAAMGTGACACAGTTTTAGATTA 

TAACCCAAAGAACTAAATAAATATCCATGAGCCCTATTGATATAAATGACAATTAAGGGT 

TTTTTGTTTTGG 1 1 I I I I I G GTTG CTTTG GTT 

Sequence 2613 

GAAACCTGTCGTGCCCAGCTGCATTTANTGAATCGGCCAACGCGCCGGGGAAGAGGCGGT 
TTGCGTATTGGGGCGCTCTTTCCGCTTTCCTCGCTCACTGACTCGCTTGCGCTCGGGTCC 
GTTCGG 
Sequence 2614 

GGAGGCGGCCGCCGGGCAGGTACTTTGTTT C I I I 1 1 1 I IC C I I I I I CI I 1 I I I I AAGACA 

TGGTCTTGCTCTGTTGCCCAGGCTGCAATGCAATGGTGCCACCTCGGCTCACTGCAANCC 

TCGACCTCCTGGGTTCMGCMTTTTCCTGCCTCAGCTTCCAGGTAGCTGGGATTACAGG 

CGCCCGCCACCACACCAGGCCACATCTATGTATTTAGAGACAGAGTCTTGCTCCACCTGG 

GAGACAAAANCNGACCTCCGTTTAAAAAAAAAAAAAAAAGGGAAAGGAAAGGAGGAATTT 
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TTGAACCCAGTGCCCGTTCCTTGCAGATCTGCGAACAGTGGGCACCAGCCTCTAGTCTCA 

AGTGTGAGGTTTGACTCCAGGACGNTGAATTTA 

Sequence 2615 

CGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTGCTACACGGAGGATGMGCTAAGGAGCT 

GGCTGCGGAGGTGGAGGGTTCAAGACGGCCCCAATGAAGATGGGGAGATGTTCATGCGGC 

CAGGGAACTTGTCGACTATTTCCCAAAACCATACCCCAACAGTGGAGGGCTGNTNGAGNT 

GCCAACAACGGAGCNTNGCCCCCTGACCTCAGCTACATCGTGCGAGCTAGGCATGGTGGN 

GANGACTTCCNTTTTTTTCCTTGTTNACGGGCThnTTGGAACCNCCCCCCGGNNTTTTNT 

NTGGGGNAAGGGTN TTTAT TAAAMNCCNTTTTTTTNCNGGCAAGCCTTTTGANATGGNC 

CCNCCTTTNAACAAA 1 1 1 1 1 1 1 1 ANNTTTTTNGANGNGGGCCCCCCATCTTNCTNTGTTC 

CAANATAGCCAAGGGTTGTGTGCNCCTTCCTGCCGCTGGGCATTTTGANCCAGAACNCCA 

CCATAAAAAACGCNTGGGCTAAANATTGTTNATAATGATGGNTTTGGGGNNNCCCCTNTC 

TANAANATAMGGGGGACATGGGNTTTTTTCTGAAMGGNGGNATTTTTNTNCCCCCCAT 

TNAAAAAATGCNCCCCCCGGGGGAAAAAATTTTTN I 1 1 1 1 1 1 1 I 1 1 CCCCCCCCCCCC 

Sequence 2616 

CCCGCGGTGGCGGCCGGGTACATTGCTATTTTTCCACCTATGTCCACACCTTCTCATGAA 

CTTTCAAAGTCAGAGGCAGNTACCAGTGCCATTAGAAATACAGATTCAACAACGTNGGAT 

CAGCACCTAGGAATCAGGAGTTTGGGCAGAACTGGGGACTTAACAACTGTTCCTATCACC 

CCACTGACAACCACGTGGACCAGTGTGATTGAACACTCAACACAAGCACAGGACACCCTT 

TNTGGAACCGATGAGTCCTACTACGTGACACAAGTCACTCAAAGATCAAACATCTATCCA 

GGCCTNAGCATTCCCTTTCC ATCTT ACTGGAAGTCTACCTGNGCTCGGGACACANGGGAG 

AAGCTCTNTTGAGGCACCACTTTTTGGGAAACCATCTACAGACACACTGTCCAGAGAGGA 

TTGAGACTGGCCCAACAAACATTCAATCCACTCCACCCAT 

Sequence 2617 

TATTNAGAAGTGGTTCCAGGTCTTAGCACAAAACATGGATGCACTTTAGGCCANNCATGG 

TGGCTCACACCTGTAATCCCAGCACTCTAGGGGGCCAAGGTGGAAGGATCCCTTCAGCTC 

AAGAGTTTGAGACCAGCCTGAGCAACATAGGGAGACCCTATCTCTACCAAAAAAAAAAAA 

AAAAAAAAAAAAGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTCGAGnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnTAATTGCCCCCTTGGGCGTAATCANTGnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnTCCGNTCACAATTNCACACAACATACNNGCCCGGG 

AGCATAAAAGTGTAAAAGCCTGGGGTGCCTAATGAGNGnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAAACCTGTCGNGCCCAGCTGCATTAAATGA 

ATCGGCCAACCCCCCGGGA 

Sequence 2618 

CCCGCGGTGG CGGC CGGGTCTTTAGGATGCTGGTAAACTATGGAATTTCAGACGTGGACA 

TCTGGCAGACA I 1 1 IUI I I AAAATGAATAAAGTGAGCTTATCACCTCAAGGAAAACAACT 

NAAGCCATCT GTTG CCMTGATAAAAATATGAGATTTAAAAAAATGTTGAAGNTTCAGGC 

TAAMTTAAAAT TTTGA AAAACTTATA7TGGCCACCACANGTTTGACACCTTACCAATTA 

TTAATGGGANGGTTTTNTTAAGAGGNCMCAGTGATNTTAACAMTGTGGGTTTTTTTGN 

TNTTGG^JTTATTGGGAGGTTTANCNATTGGGAAGATCTGTAAGACTCAAGNGAACCATGG 

NTNTCCAAATGACTAAGACATGATGGNTACAAAATCACCCCTGGGTAAAAGATCCATTCA 

AAGTGCAGGACAGACCAATGGAATTTAGTGTAACAGAGGTNTGAAAGGTTCATGAAGGCT 

GGGCATGGTGGCTCACX3CCTGTNATCCTAGCACCTGGAGGGCCCG 

Sequence 2619 

CTCCCGCGGTGGCGGCCGGGTAC 1 1 I 1 1 1 1 CTTTTCTTTT C I I I I 1 1 1 1 1 1 1 II I I GAGA 
CGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAG 
CTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGTGGCTGGGACTACA 
GGTGCCCACCACCACGCCTGGCTA C I 1 1 I 1 1 1 I G T AI I 1 1 1 ATTAGAGACGGGGTTTCAC 
TGNG TTAGC CNNGAATAGGNTCGATCTCCTGCCTTGNGATCTGCCCCCCTTGGCCTTCGA 
GCCCI 1 1 I IGACCTNAAGGACCAGCACTGGAATAATGTTTGGGANNCCCNTGCTTTTNAG 
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GCAGGCTTAGGGAGCAGGAAAGCATACAGGTGTAGCAGCCTTTCCAGCTGATCCCCATGC 
CCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAGGGCTGA 
Sequence 2620 

CTCCCGCGGTGGCGGCCGGGTACTTNGTTCCTTTTCTTTN C I I 1 1 1 II 1 1 1 1 1 I 1 1 GAGA 

CGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGNGGCGCAATCTCGGCTCACTGTNNG 

CTCCACNTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTNGAGTGGCTGGGACTACA 

GGTGCCCACCACCACGCCTGGCTAC1 I I II 1 1 I GTATTTTTATTAGAG ACGGGGTTTCAC 

TGNGT TAGNC CCGGANAGANTNGATCNCCTGACCCTGGNAACTGGCCCCCTTGGCCCTCT 

MCCCTTTTTGACCCCTMGGGAANGTACT^ 

NGGCAAGGCTTAAGGGAGCAGGAAAAGCATACANGGTGTAGCANCCTTTCCAGCTGATCC 
CCATGCCCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAG 
Sequence 2621 c 

NCCGCGGTGGCGGCCGCCGGGCNGGACCTGTTTTATCCCAGCTGAGAGGCAAGGAGAACC 

TTTGTTTCTTAAAAMTAAGCTGGTTTCAGCCAGGTGCGGTGGCTCACGCCTGTAATCCC 

AGCTCTTTGGGAGGCCGAGGCGGGCGGATCACCTGAAGCCAGGAGTTCGAGACCAGCCTG 

GACMCATGGTGAGACCTTGTCTTTATTAAAAATGCAAAAATTTGGCCAGGCGCCGNGGC 

TTTACCCCCTTATTCCCAGCACTTTGGGAGTCCCANGCAGGTGGATCACAAGGTCAAGAA 

ATTGAGACCCTTCTTGTTTACACCAGGGMCCCCCTTCTTTCCTAAAAATTTNTAAAAAC 

CAAA^fTTGCTGGCATGGNGGGGGCACCTGTAGTCCTAGCTACTTGGGAGCCTGAGGCANG 

AGAATGGTGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCGAGATCGCACCACTTGCAC 

TGCAGCCTGGGGGACAAGAGTGAGACTCCCTCTCNAAAACANAACAAAACAAAATTNNCC 

NAGCGTGGTGGCAGGCCCCTTGAGGCANGANGAATCACTTGAACCCCGGGG 

Sequence 2622 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGANGACCTTAGAAAAAGGAG 

GAMGGAGGAGAGGCAGATMTTTGGATGAATTCCTCAAAGAATTTGAAAATCCAGAGGT 

TCCTAGAGAGGACCAGCAACAGCAGCATCAGCAGCGTGATGTTATCGATGAGCCCATTAT 

TGAAGAGCCAAGCCGCCTCCAGGAGTCAGTGATGGAGGCCAGCAGAACAAACATAGATGA 

GTCMGCTTTGCCTCCCCCCCCCCTTAGGGAGTTAAGCCAAAAGCTGGACAAATTGCCCA 

GAGCCTGTGATGCC7TCTTAGCAGGGAGAGCAGATGGAAATCCACCTTGTAGAGGCTTCC 

CCAGAAGAACCTCCAMTTCTGTCAGCTNATACCAGAAGTTAGAACTTCTGCCAGAAAAA 

AAAAAAAAAAAAAGTACCTGCCCC 

Sequence 2623 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCNI I II I 1 1 I I I I I I I I 1 1 1 ICTTATTT 

AGGGGAGAACTTTTACCTTTTCACTTMTGCACTTCTCTTTGGTGTATCTGTNTTGNGNG 

CAACACTACTCTTGCTCTTNAGGGCCATTAANTAAAATAAGAGTTACTCAGGCTGGGTGC 

AGTGGCTCATGCCTGTAATCCCAGCACTCTGGGAGGCCAAGGCGGGCGGATCATGAGGTC 

AGGAGTCMGGGCMNCCGGCNAAATNGGGGAMCCCCMNTTTTTCTAAAAATACAAAA 

ATTCNTCCNGGCAATGGGGGGCNNATGCCTNAAATCCCAANANACCCTNAAAAGGCTNGG 

GNCNGGGAAAANTNTTTNTAANCCAGAANCCCCGGGNGGNNGGGGNGNTGNANATANGCC 

CAATANNCCCCCNTTNNCNTNTNCNCCCGGGGGAANNNTTGGAANNCCTTTNTTNTNAAA 

AAAAAMCNAAAMGGNTNTCNCTNTNCCCCCCNNGGGNMNNNGANACCTCTTTTTTTT 

CTCCNCCCNNNTTTCNCTGGGGGGGGGGNNGAATNTTANTAAAANAANCCNCCCCNGGGN 

GANANAATNTNANT I I I I I I I CACCCCCCCIMCGGGG 

Sequence 2624 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACATACTGCTTTGATTGATAT 

ATACAAAACACCACCTNCTTTTGATGGAAACGGGGCCCAGAAGCAGATCCTGNGATGAGG 

ATTAATGTGCAATTCATGTAATNTATANCACAGTNTTCAGGANGTGCTCACAGGATAAAC 

CAATGAGGGAGNGGGTTCAGGGGCNGGACAGGGAAGGGAACTAAACTGNGCAAGGGACNN 

TTTNNGGGCANGGNNCGGNCTTAAGCCTANTCCCGTTGGGAGCTCTGGACATACATNAGG 

CTTCAAAGNTNGGNNCAACTGGGNGCAAAAAAACCTGGGCTGTTTATGCCTGCACCCCTT 

TAGGCACCCTAAGCCCTACTTTGAAGTNTTTATTTTAGCTCTCAATGTGTTGAACTTATA 



WO 01/070979 



PCTYUS01/09126 



TABLE 1 

429/467 



AGGAAATTAAACGTTTCATGAGTTAAG 
Sequence 2625 

TCCCCGCGGTGGCGGC CGAGGTACAGATA ATTCAAATCTATTATCTAATGTATTAGGTTA 

ATTTCTATTTGTTCTATAC I I 1 1 I I I I I I I I I 1 1 GAGATGGAGTCTTGCTNNGTCGCCCA 

GGCTGGAGTGCAGNGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGTTCATGCC 

ATT CTCCT GCCTNG GCCTTC CAAGNAGCTGGGACTACAGGTGCCCGCCCCGACACCTGGC 

TNATTTTTAAAAAAATTTTTTAGTAGAGACAGGGTTTACCGGGTTMCCAGGATG 

NATCTCTGACCTCGGGATCCCGCCCNTTTGGGCCTCCAAAAGACTGGGGATTACAGGGCG 

GTGAGCCACCG ACCC GGCCTATTGACATTTTTTMGGGTCMGATTTTCTrTTGTGTGTC 

TAGTMTTCGTCTTTTATTGCAAAGATMTrrGCTTATTNGACTNAGAAAAATGATTTGT 

GGGCATACAATATTGTATGTGGTACCTGCCCGGGCNGCCGCTCTAGAACTAGNG 

Sequence 2626 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATAATTCAAATCTATTATCTAATGTAT 

TAGGTTAATTTCTATTTGTTCTATAC 1 1 1 1 1 1 I I I I I I 1 1 I GAGATGGAGTCTTGCTCTG 

TCGCCCAGGGTGGAGTGCAGTGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGT 

TCATGCCATT CTCCT GCCTNGG CCTTCC AAGTAGCTGGGACTACAGGTGCCCGCCCCGAC 

ACCTGGCTAATTTTTAAAAAMNTTTTTTNAGGAGAGACAGGGTTTCACCCGTGTTAGCC 

AGGATGGNCTCGATCTCCTGACCTCGGGATCCCNCCCNTTTGGGCCTCCCAAGGACTGGG 

GA TTACA NGGCGGTGAGGCCACCGCACCCGGNCTATTGACATTTTTTAAGGNTTCAGATT 

TTCTTTTGNGNGTCTAGNAATTCGTCTTTTATTGGCAMGATAATTTGC 

AA 

Sequence 2627 

GCTCCCCGCGGTGGCGGCCGNGGTACAGATAATAACATCTGATATCCACATGGGGTCTGG 

AGGTGCNAGCCACCTTCCTTTCATCCCACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCT 

CTCTGTTGGAGGCTAAGGGCCAGTGTTGGAAGGAGCTCTGGTGGAAAAGTGTGGTCTGCA 

TGANGGGCTCCCATGAATNAGAGGATAGGGGTGGCNGGTACCTGCCCG 

Sequence 2628 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCAGCAGGA 

AGAAGACGGACCCCGCGATGAGGGCGGCGGCANGGAGCACCTTCATGTTGCGTTCGGAGC 

CCCGCGTACCCTATGGACAGTTGTGTCCCCAAGGAAGGATGAGAATAGCTACTGAAGTCC 

TAAAGAGCAAGCCTAACTCAAGCCATTGGCACACAGGCATTAGACAGAAAGCTGGAAGTT 

GAAATGGTGGAGTCCAACTTGCCTGGACCAGCTTAATGGTTCTGCTCCTGGTMCGTTTT 

TATCCATGGATGACTTGCTTGGGTMGGACATGAAGACAGTTCCTGTCATACCTTTTAAA 

GGTATGGAAGAGTCGGCTTGACTACACTGTGTGGAGCANGTTTTAAAGAAGC 

Sequence 2629 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTCATCCCCTCAG 

TGACTAAGAATTGCAGNATTTMGAGGTAGCAGGMTGGGCTGAGAGTGGTGTTTGCTTT 

CTCCACCAGMGGGCACACTTTCATCTAATTTGGGGTATCACTGAGCTGAAGACAAAGAG 

AAGGGGGAGAAAACCTANCAGACCACCATGTGCTATGGGAAGTGTGCACGATGCATCGGA 

CATTCTCTGGTGGGGCTCGCCCTCCTGTGCATNGCGGCTMTATTTTGCTTTACTTTCCC 

MTGGGGAAACAMGTATGCCTCCGAAAACCACCTCAGCCGCTTCGTGTGGTTCTTTTCT 

GGCATCGTAGGAGGTGGCCTGCTGATGCTCCTGCCAGCATTTGTCTTCATTGGGCTGGAA 

CAGGATGACTGCTGTGGCTGCTGTGGCCATGAAAACTGTGGCAAACGATGTGCGATGCTT 

TCTTCTGTATTGGCTGCTCTCATTGGA 

Sequence 2630 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGATAGCAAAGACTGGGACCACAGTGGAGGGAT 

GCCTAATCCAGACAAGGGCAGGAATAGGCAGGGAAGGCTTCCTAGAGGAAGTGATTTCCA 

AGCTGAMCTTGACAGATGGAACAGMGNTAGCCAGAGATGGGAAAACTATTTTTGGTCA 

ATGGAAGANCAGGTGGTTGAGATAGAATCTGACACATGANAGCAAAAAAAAAAGTCCANN 

GNTGGGAGAATACNGNGTGAGAATAAGACAATNTTAAACTGGCNATATAAGTAAGNGATC 

ATCACAAGGCTTTGTAGGACATAGTAGGGAGTTTAAGACTTTTTATTCTGAGGGCAATGG 
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GGAATCACAAGAGGGAGTTAGGCACTNTACACAACTTC 
Sequence 2631 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGGCCTCCTACACCT 

ACCTCTCTCTGGGCTTTTATTTCGACCCGNGATGATGTGGCT 

Sequence 2632 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATCATACGAAAGTGT 

AGTTTCAAAGGAGATGGACAAACGATACCTACAGTTrGATATTAAGGCCTTTGGNGAAAA 

TAATCCTGCCATTAAATGGTGTCCTACTCCAGGCTGTGACAGAGCAGTAAGACTAACGAA 

ACAAGGGTCAAATACATCTGGATCTGATACACTCAGCTTCCCATTGCTGAGAGCTCCTGC 

TGTTGATTGTGGAAAAGGACACCTCTTCTGCTGGGAGTGCCTTGGTGAAGCACATGAGCC 

TTGTGACTGCCAAACATGGAAGAATTGGCTGCAAAAAATAACCGAAATGiAAACCAG 
Sequence 2633 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC C I 1 1 1 Hi 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GAGA 

CGGAGTCTCGCCCAGGCTGGAGTGCAGGGGCNCAATCTGGTCTCACTGGNNGCTNNGCCT 

CCCGGGTTCGTGCCATTCTCCTGCCTCANACTGTCGAGTAGCTGGGACTACAGGCACCCA 

CCACCAAGCCCAGCTACTTGGGATCGCATGAGCCTGGGANTTTGAGGCNGCAGTGAGCCA 

TG ATCAC TCCAGCCTGGGCAACAGANNGAGACTCTNTCACAAAAATTAACTTTACTTAAT 

TACTTTTATTAT TGNCA TA ATCG CTCCATTTTATTGTTGTTTGTTGTTGTTGNAAACCAA 

ATTGTAAAACCATTTTATNTTTTGTTGCAAACCAAAMGTTTCTGAGACAGGTCTTAATT 

TATTTAGAAGTTTATTT 

Sequence 2634 

TGGAGC TCNCCC GCGGTGGCGGCCGAGGTA C I 1 1 1 1 1 1 I 1 1 1 1 1 I I 1 1 1 1 I I GTGATAAG 

GCTAATA AAAAGGAGGCACAGTGAATTTAAGACATTGACA7TTATAGCCGTAGC 

TCGACGCTTACTCTTGGAGGCCCAATTATCATCCCTGTCCATCTTGTAAGTGTCATGTCT 

TCGTCATCTTCTAGACCCCAGCTAACTGTGCCATCTCCTACTCCTTTCTGGCCTTCTTCG 

AGTTCTTCCAACAGTCGGAMTTGCGAGGGACTTTTACTCCCGAGCCCGTGGTGGCTGCC 

ATCTTGCGTCGCTGTTGCTTGAAGGCCGGCCCCTTCTCCCCGCGTCCTGCCCGGGCGGCC 
GCTCTAGAACTA 

Sequence 2635 

CTCCCCCGCGGTGGCGGCCGAGGTACTCTCCGCCTAAGTCCATAGAGAGGCTAACACCCT 

TCTAGTTTATTAGTCTTTG^ITAAACCTATGAAATTCTGAAT^^TAATGGCTAN^^TATATG 

CANNGTGGAGGNCACNCAAAAATGTTTCTCTCAACAGCTGTGATAAGTATAGGCTTATTT 

TGATGTCTAAAGATCTG^ACCTGTATCTGNTTTCATCTTTCAACACAAATTCATGGGAA 

NNTTAACTATGNNCCTGTGNTCNGGACAAGTGTGCATGAGAACATCATNCACTAAGTTTA 

TCATNAAATGGGAAANGAAGCAGACNT7TTAAAAAGCACCCAAACCGGCCGCTCTAGAAC 
TAGTGGA 

Sequence 2636 

AGGTACGCGGGGGCTGACTCTCTTCTCAGATTCAGCCCAGTTGCACCCAAGTAAAATAAA 

CAGTCTTGTTGCTCACACAAAGCCTGTTGGTGGACTCTCTTCATATGGACTCATGTGACA 

TTTGGNGCCGAAGACCTGAGACAGGAGGACTCCTTTGGGAGACCGGCCCCTGTCCTCGCC 

CTNTTTCA TGAG ATCCACCCATGACCTTGGGGTTCCTCANCCCAGCNCGAANGAACTTCT 

TCACCAANTTTTAAATCGGGGACCGCAAAGCCAACAAGAATGAAGAAGTGNCCAGGTATG 

GTGGCTCATGCCTGTAATCCCAAGCACTTAGGGAGGCCGAGGAGGAAGAATTGCTTGAGG 

CCNAGGAATCCCANACCAGCCTTGGACAACATGGCAAAATNCCANTNTTTTCNAATNANA 

TTAAAGNAAANAAATTAAAGTTACCTGCCTNGGCGGGNCGTTCTAGNAACTAGTTGGANC 
CCCCCGG 

Sequence 2637 

TTAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACACGGTCAGTCCG 
GGTCCTAAGGGCGCAGGGTAGGGCATNCCACTGGGAGTTCAAAGGGGAAACNAAGATGGT 
TCCCACTGC TGCTC AGACAGT GTCG CTAAAATTCCCTCACTCATTTTCAAGTCTTGTTTT 
ACATMTGGCTTTTAAAGCAACTTTTGTTAATGCTNCTGATNCTTTAATNCAAACTATTA 



WO 01/070979 PCT/US01/09126 



TABLE 1 

431/467 

ATGTNCCATTTTGGTGACATCTGGGTTTTCATACANAAAAAAAAAAAGTCAAAAAGGGGA 
MCAMMNATATTTACACATTTTATATATAMCTTAAAAACCTTGTACCTTN 
Sequence 2638 

CGAGGTACTATGGCAGTTTTGCCTCAGGTGCTGAACATTTCTCAGCCCTGGCTAAAAGGG 

AGCAGCACAGGGAGAGAAACAGGATAGGAAAGCAGAATGGCGAGCAGCCTATGGCCCAGG 

GCCTGTAATCCCTTCCCAAGACTAGCTGCTCAGGGTGGTGCAGGGACAGGACCAGACCCT 

GCGCCTATTTCCTGCCTTCTTTCCCTATAGGGAACTCTGTAGGCTGAGCCACTGTCCTGC 

TCTTATGACATTATATCTTGTGCCTTTCTCCTCAGCAGTGAGCAGTGAGCTACTCCTGGC 

CCAGGCCCTAGGGGAMTGGATCAGTCTTTGAGGTTTCTATTTGGGGAGGGGAGTACCTG 

CCCG 

Sequence 2639 

CCGCNTGGCGGCCGAGGTACTTG I 1 1 I 1 1 1 1 1 1 I 1 1 1 I GAGACAAGGTCTCGCTCTGTCA 

CCCAGGCTGGACTGCAGTGACATGATATCGGTCACTGCAACCTCTGCCTCCTGGGTTCAA 

GGGTG ATTCTCGT GCCTCAG CCTCT CAGGTAACTGGGATTACAGGCATGCACCACCATGC 

CTCGTAI I I 1 1 I I IGCGTGTGTTTTTAGTAGAGACGTGTTTCACTATGTTGTCCCGGGCT 

GATCTCCAACTCCTGTAAGTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGAT 

TATAGACATGAGCCACCACACCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAA 

GTGTGGTAGTCATTAAGTGCTGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGC 

ACTTNCCGCTCCACTTGTGATTAGGTGGAGCCATGTGACTACTTT 

Sequence 2640 

TGGGCTCCCCGCGGTGGCGGCCGAGGTACTAGAAACTTTCCAAGGAGTCTTGGGTGTGTA 

GCCAAGAGGAGCCATGAGCTATGGACTCCTCAAGCACGGGAAGAGGAGGTGTGTGCTGAG 

AACAGAGAGGCCCTGCCCTCTGTCCACTAGCGAGAATCCCTAGCTGCCCCAGCCCAGTCT 

TTCTCCCCGGCATTCACAAACTTTGCAAGCGTTGGTCCAGGGCCCTTCTCCAGATCTGTT 

NCMCTTT GNAG AGTGAAGGGCTTGAGCATACGGGGGAAGAGAGTCTGCATNANGTTAGG 

GGGAAAACTTTTAAAAGATACCCTCATTGTGTCAAMGAAGTGTGCCMTCTATTTTTGT 

ATCAGCATTGGGAAGNGCACTTTCCCCTGGGGCCGTGTGGGTGNGTGAATGTGCAAGTGT 

CTGAGAGATACTGCATCAAGCCCTAGACCCTCAAGAGCCAGTCCCGCCCTTTTACAGAGC 

ANTCCCTTATCCTGGGGCCATGGGTCAGGCTGACCTTCAA 

Sequence 2641 

CCCCGCGGTGGCGGCCGANGTACGCGGGGGACTTCGGGCTTGTTCGCTGGTGGCGTNNGA 

GCCNAGCCCGGACTGGTCAGGATNGATCACGGACGTGCAACTCGCCATNTGNNGCCAACA 

TGCTGGGCGTGTCGCTCTTCTTGNTTGTNGATCTCTATCACTTACGTGGNCNGTCAAACA 

ATTCCAAGMTGCANGMTGMAGTTGGCGCTTTCTTCCGCCCCANGGTCCCAGGACATT 

AGTCTGNGGCANGATNGAGGGGTNTNGMGGGGCCTTTCACACTTMCTTTATTCCTTTT 

ACCCTTCACMCATACAAAAGGCAACTTACACCTGGGATTrTTNCAAAACAACCTTTTAT 

TTCCCTCAGANGNCTTTCCNTTAATCCCTATGGAACAAAGAANGCTNGNCCACTTGAANT 

AGGGGCCCNAGTATAGGGGGCTTTGCTTTTCTACTTCCNTC 

Sequence 2642 

GGACTCCACCGCGGTGGCGGCGAGGTACGCGGGTATCTGTCATAAGCTCAACATCTGTAG 

ATCAGAGGGCT ACCAG AGGAACCAGTTTTAGAAGATGAACAACACCTNTTAGAAAAGAAA 

TTGCCTGTNACGTTTTGNAAGATAAAGAAGCGGGGANAACTTCGAACGTTGCAAACCTGN 

AAACTGGGAGAAACGANGGCAAGCTCTNTTGGAACAAGCAAGCTGCAAGGGAGCAAGGNN 

CGCCTGNCCCANCATGGAGCCGGCCNNGNCNANGGAAAAGGAAGGAGCGTGAGCGCCAGG 

AGCAAGAAGCGCAACAAGACAACTTGGAACTNGGGGAAGCAACTGGGAAANGCATGCTGT 

NAGCTANNAACCTGCANTATNAGAGGGAGGAGAGGGAGGAAAAGAAATTNGANAGGCGAA 

GAGGCTNGCAAAACTGGGAAACTTGAAAGGCAACGACAACTTNGAGTGGGAACCGGAANT 

CGAAGGCAAGAACNTNCCTAAATCAAAGAAACATAGGAACAAGGAGGGACATAGTTTGNA 

C 

Sequence 2643 

CAATTGGACTCCCCGCGGTGGCGGCCGAGGTACCTTATGTAGCCCAAGAAATTCAAGAGG 
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AAATTGATGGGGCTCCTTCAGGAGCAGCGTGCAGATATGGACCAGTTCACTGCCTCAATC 

TCAGAGACCCCTGTGGACAGTCCGGGTGAGCTCTGAGGAGAGTGAGGAGATCCCACCGTT 

CCACCCNTTCCACCCCTTCCNAGCCCTACCTGAGAACGAAGACACTCAACCCGGAGTTTG 

TACCTGCCCGGGCTGGCNCGCTTCTAGAAACTAGTTGGATCCCCCGGGCTGCAGGGAATT 

CGATATCAAGGCTTATCCGATACCGTCCGACCTCGAG 

Sequence 2644 

AATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTAC 1 1 1 1 I I I I I I I M 1 I 1 1 1 I 1 1GGA 

AACAGGGTCTTGCTATGTTGCCCAGACTGATCTCAAACTCCTGGCCTCAAGCAATCCTCC 

TGG CTTAG CCTCTCAAAGTGCTAGGATTACAGGTATAAGCCACTGCACTAAGTCCAGTGA 

ATCATTTTCCAGACAACA TTTACT GAATATCCGACACTGCCTTCCACAGACGGGGGAGGT 

TTCCCAMGAGGCCGTGCTTTTTGAGAAGCTGGCAAGATAACCCTGGGGTGGAGGGAAGG 

TGGCCCGCAGCTGAGACCACCACCTTACTGTCCCCAAATCCTCACCAGAGGTTGCTGACA 

CTTTCCTNTTCCC CATCC TCATCTACTTTTCAAATTAGAACAGTAGAAAATGGGATGATT 

AGGCTCCTTNNCATTTTTAAAAAACCCCGTTGmCAMTTTTT^^ 

AAANGGGNGTAATCNCCTTNGTAAAATTTAAATT 

Sequence 2645 

CCGGCAGGTACCAATCATATAATNTATATAACATTGCTATCAGACTAAAAACACATTCTT 

AGCTAMGATAACTTACCATTTAGAAGTCAAAATGCAGGGAATCTTACTCCTGTTTCCAT 

TTTNTGNCCCNCTTGCTTCACTCGNGTATGNCATGCTCTATCTCTTCTCCTATGCAGACT 

TTANGNCNGTNGGCCATTMNTCTTGMGAAATTTCCTTCNNTCTTGCTGTCACNTACCA 

NNTTANTTGGTCTGCGTGCAACAAGAAGGNGTATTATANNAAAAAAGTTCTTGCTTAACC 

ATTCANGATTAAATAMNAAAAAAATTCCTTTGTTTNAACATTTTGNTATTTTT^ 

TACACCAAACTTTTTTMTTGCCTTTTNCANAGNNCCTTTCCCTCCAAAAM 

AAAAAATCTTCAATCNACATAAAATCAAACACCTGTATTGATCCATGTTCATGCTAAGCT 

GGGNAA 

Sequence 2646 

ACTCCGCGCGGTGGCGGCGAGGTACAAGCGCTTTGAATATCATGGGCACXDATGACTGTGA 

CCCTACAGGTAGGATTGGATCACTCCATGAGAGTAGCCGGCAGGTTTCTACAATGGCCTG 

GGAATGGACTGATTATTTTTATACATTTTCTGGCCTGAGAGAAAGCCAAGGTCCCCTGCT 

GTTCACAGCAACCCTGCCTGGGAGCTTGGAATCTTGGTAAATCTGCCCCGTTNGGATCTA 

TTGGAGGTAGGCTCACCCTTTTTNGTCTTTTGTGGGAAMTTAAAGAGAAATMTTNTCA 

GACNTATCATCACCTCCAGTGGAACTACAGAANACCTGGACCCANCTGCACTATTTTAAT 

GTAAAAATAACAATATGGCCAGGGTGCAGTGGCTCACGCCTGTAATNCNATCACTTTGAG 

CAGCCAAGGCGGGCGGATCACGAGGTCAGGAGATTAAGACCATCCTGGCCAATATGGGTG 

AMCCCTGT!^rrTTACTAAAATACAMAMATTAACTGGGCATNGTNTTGCGTGCX)TGTNG 

TCCCANCTACTTGGGANGGCTGNGACCAGGGGAATTGCTTTGAACCCCGNANGGCNTAGA 

ATTGCANTGAGCCNNAATCANGCNTCTGACTTCTACCTNGGCGACAGGANTGGGACNTTT 

TNTTAAAAAA 

Sequence 2647 

AGGTACCCTATATTCTTCTTGATTTCTAGCCTTTTATTGGCTCTCAGATTGCCAGAGTTG 

GGACTCAATAGTAAGCANCCATTCTGGTGAGGCGGAAGNGATNCTACCAGGGTGNGTTNT 

CATGACMGCANMTCACTGNGTTTTTCTCTCTACTCTGTGGCATANGACTCTATGCCAT 

AGAGNGACGTGTGAAAGGCTTGAGGCT 

Sequence 2648 

TCACCGCGGTG GCGG CCGCCGGGCAGGTACTTATTTTT C 1 1 I I I 1 1 I I I I i GGGGGTGGG 
GNCCTGGGNANTTTTm" NMG GGGCTTTTTAACNNGGGANGANANCCTGTGCCGGTTCAN 
C CCAN GCCGG TNGTN MTTTTCANCCTTTTTTCANCTNGCTTTGGGTTTTTM 
GTTTTCNANCNTTTTTCTTTNAAANCCTTC^ 

CTTCNAMTNTGGGCCTGGGTCCCTTNTCCCCGGNTTTAAAAAANAACTTNCCAAAAACA 

AAACANTTTTTTCCCCTNG 

Sequence 2649 
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CCGCGGTGGCGGCCGCCCGGGCGGGTACGCGGGGGTCTCCAGAGTGAGTGTTCCGGAGAG 

CACCGTGTTAGGGAAGGAGGAGGCTGCGGGCTAACCTGCCGGGAGGGAGGATGCTACTGC 

CTGTTTCCTATTAGTGACAACCCACCTCCTAATCACGTCCTGCTTCAAACAAGGTAACAT 

CACAGGACAGCCTCCGAAACMTMCTGTTTGMTATCCTTMTCrrCGGCMCTTCAAT 

MCTCCCCAAAATATATGTMTCAGAGMTTMCTTTTACMCTTTTGGTTATTGTTTGC 

TTGAACCATAAAGCAGAGCTCTTTCTGGGATAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 2650 

AGGTACTTCACAATACAACTCTTGCAGAAAGTATGAAGACACTCTGTGATGGTGGTGGCA 

TCCACGMGTAGCCGGCGCATAGGCAGCAMCAATGTGTTCATTCAAGTCTTTGATCTTC 

ACTCGAACCTCCTCCTGACCCTCCTTTCCCAGGGGAGACTACACAACGTCGGCGACACAA 

CGCGCAGGCCCCCGCGTACCTGCCCG 

Sequence 2651 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAACAAGCAATAATAAA 

AGTCCTCTGCCGAAGACAGAATAGTGTAAMCCT7TCCTTCCCCTGCCCTTACAGATTTC 

TAGAGAAGATAAT1TGGGAATAAAAGGTATAAAAGGTTTTTGAGAAAAGAGAGATCAAAG 

GACATTCAGAGTTCGGGAGGATATCTGGGAAGCAGACTTAGCTTAGATCCTAAAGGGTGT 

AGAAGATCTGCATTGCTGGGGAGGAGTGTGGGAGTGGTAAACCAGTCAAAGAATTCACAG 

ATGTGCTGGGAGGAGGAGAAGGGGAGCAGAAGCAGGAAAACAGTCTGGCTGAAACACTGG 

TTGTATTTCAAAGAGCAGTAGCAAATCAGGGTGGATAAATGTGTGGGCCTAAGCAGCTGG 

TGAG CATTAM TGTGTMGMTCAMTTGTATCCTTAGCAMCCTCTGAAGATTTCTGAG 

TAAGATTTTTATGAGTGTTGGTAAATTTATATAGGATGCATATGGGTAGTACCTGCCCGG 

GCGGCCCGCTCTAGATCTAGTGGATCCCCC 

Sequence 2652 

CCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCA 
GTGGTTTAAACGGCACGTTTTTGCTTATMAAAMGTGCAAAAAAGATGTGGTTTACAAG 
TTAMGCTACAGAATCCCTTT TTGCTG TMTTGCACCAGTTTTAAAGCCTCTGGACAGAG 
CAGTATTTCGTTTAAMCTTTGTTmCTTAAAAGCTTACAGTGm 

CCCCI I I I I ACAAGACGGGGGCCGGAGGGTGGACACTGGTGGCAGGTTAAGGGATACTGT 

CACTTTAAGMGCCTGCAGATTGMGTGTAAACATGGAGAAATTAGGGGCTGATTTTTTA 

AACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAATCAGGAAATNCAAACAGCGATT 

T ACAC CGATTAACACCC CCTTTATA TATTTTTTACAAAAATCACTGAGAAAATAATNAAC 

GTTTTCATCTCTCTTGGCTTTTTTTGGTTTTAAMAGT^^ 

NTTAAAAATTTAA 

Sequence 2653 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACCAA 

ATGGATTACAAGCAGCATCCAGCAGAAGACAGACCCCCCAACCCTGCCCACCAGGGCTCA 

CACTCTACAAAACCCTGAGGGCCTAGAAATCTGTAAATGCATCGCCAAGCACTGGGGCTG 

ATTTGCAGTAATTCTGTAAGCAAGGCAAACATGATCTAGCTTTGAAGGCAGCATGAAGGC 

AGCGGGTTGGTGAGAATAATCTCTCCTTAAGAGAAGAAGAAACCTGGGGCGGAAGGAGTT 

TTCCCCGAAGTGGCTTCCCGCGTACAGMCACAGMCTTATTTCTGTCAGTTATTTAATA 

CATTGAAAATTTAGTGAAATGTTCAMGAGMTAGATGTTTCCCAAMCAACAACTTTAT 

GTTAAAAATAGTCATTAAAAGATCTGTTGTAATTAAAA 

Sequence 2654 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTAAAGGTATAGT 

AGTTGGCAGCAGAATGGACCCATTGAGGATAACTATAAAATTACNGAAAATATTTAAATG 

CAACTTATTGCTATAGAAGCAAAGAGGACTAAAGGGCAAAATTCTAGAGAGTGGTAAATC 

TCAGAAAGCACAAGCATAAAATGCAGCTCTGGGGGCCCTTTCCACTTCTGGCTATAGGGA 

AGAACCTGAATACTGAACTTGATTCAGGCAGAGGACCATAACCTGGGGGTCAGGGGAAGG 

CATGGGGGGGACCAGAAACCAGAAGAACGATCAAGACTGCAATGAAAAAAATGGATACAT 

TAGGAGCTTCAMCACATATMTTTCTCAAGAAATTTCCAGATTCTCATGCTGCATAGGG 
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CAGGAGCCTGAAMGCTMTTTGAGMGATMT^ 
TTTGCAAGTACCTGCCCGTGCCGGC 
Sequence 2655 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGGAGGTACCACTGAATCCAAGG 
CTCTCTTGGGTAGCCTATGTGCCTCTTGGATGGTATGTGGAAGCCAAGGACTGTCTGAAC 
GTGCTGAACAAGAGCAACGAGGGGAAAGAATTACTCGTCCCACTGACGAGTTCTATGTAT 
GTCCCTGGGAAGCTGCATGATGTGGAACACGTGCTCATCGATGTGGGAACTGGGTACCT 
Sequence 2656 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCCGGGCAGGTACTTGAACAGTAGGAGG 

AGGTGGTTCCTCATTCGTCTCCCGGGAGCGTCCTCTTCTCAGTCAGGCTGGCACCATGAC 

CCAAGGAACCTCGGCGAGTGACGGATAAACACCAGTCGGCCCGCGACTAAGAGCTGCGCC 

CCCGCGTACGCGGGTCACCAGGGTCAGTTTCTTTMTGATGGTTTCCAACTGGCCTAATA 

CATTAAGTAAGACTGGCTGATAACATGACCAGACAGACATAAAGACCCTGTTGGGAATGA 

CATTGAACTCTCAAAGTCAAGATTTCTTACACAAATCTATCAGCTGGAGAAAATGAAGGC 

AGTGTGGTATATGTGTGCAAATAAGGACATTATGAAGCTTAAATATGGAATGTCTCTTGG 

ACCCCCGATGTCATCTGNATTCTCTTTTTCTTCTTGTACCCT 

Sequence 2657 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTTGGCTCACTGCAA 

CCTCAACCTCCTGGGTTCAAGCGATCCTGCTGCCCCAGCACCACCCCCTCTCCAAGTAGC 

TGGGACTACAGAAGTGCATCACCATGCCCAGCTAGCTMCTTGAATTTTTAGTAGAGACA 

GGGTTTCACCAGGTTGCCCAGGCTGGTCTTCTGAGCTCAAGCAATCCACCTNCCTTGGCC 

TTCCAMGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCCATTTTATATATTATT 

TTTGCATAGCCTTCATTGTCTTAGCAAAGTCAGAAGAAGACCANTAACATAATGTNATTT 

AATTAACAACAACGNCAACAACAACAACAAAACAACCCNGGATGGGGGCAGTGGCTCAAG 

CTANTMTGGCCGNCC TTATGAA AGNCCAAGGGGNGNGGATTGGTTGATTCCATTTTAAA 

ACCAMCTGMNCCCTNTTTTTTANAAAAAATMACCGTTAAAAAMTCCTTGGTTTTTG 

TTTAMNGNGCCGGTCCCCCCCCCAAGGAGAAAATTTTTNGGGGGGGGGGNGTTAAAAAA 

AA 

Sequence 2658 

GGGCG AATCGGAGCTCC CCGCG GTGGCGGCCGAGGTACTTTN I I I 11 I 1 1 11 1 I I 1 1 1 I I 

(Ml! iCTCCANAGGCTAGTTTTTCTTCAGTCCTTAANAACTCTGCTCCTTATATGGGCT 

TCGGTGGCAGTCATGGGGCAGCACCGCAGGTCTACAGTGGGGTGGAGGTGTTCGGTCCTT 

GCGGGGCTTCATGACCTTGATTTNTGGCGGTCGGGGGGCANCACCCNCAGGTCTACAGTG 

GGGGGAAGGNGTTCGATCCTTGTGGGCTTAATGACCTTTGATTCCTGACTACCGGGCTGT 

GAATNGGCACAACTTCACACAAGTATTGTGNTTTCNCATCNACCCTGGGAAAGAACCCTG 

CCCCGGGGGGGCGGGGTTT 

Sequence 2659 

TTAGGGCGMTTGGAG CTCCCCGCGGTGGCGGCX)CGCCCGGGCAGGTACTTTNTTTTTTT 

I 1 1 1 1 1 I I IAI N I M I I I 1 1 I M I 1 1 I 1 1 I M 1 1 1 I 1 1 1 1 1 1 l M I M I 1 11 f 1 1 1 1 1 1 

i i I 1 1 1 I I I 1 1 I I I I 1 1 I I INAAAAAI 11111111111 I NAAAAAAAAAANCCNCCNCCC 

CCCNMNNNCCCCCNAAANAAAAAAAAMNNNNTTNAAMAMTTTCNCGGGNNNTTCCC 

CCCNNAANNTTTNT NTNGGC NAAAAAAANNNNNNN I 1 1 I 1 I I I NGGGGCCCNNCCNTAAA 

AAAAAMNTTTAAMTTTTTTGGGGGGGNCCNGGGNANCCCCAATTTTNANNTTNGGGGG 

GTTNAAAAAMNAAANGGGGNAANNAAAAAMNGGNCCNNNNAATTNGNNTTNTNAAAAT 

TNCNCCCNNNAAMNTTTTTCTNMN^ 

GGTAANGG GGGGCCAGGGGGGGNTTTTCCCAANTTTTNCCCGGCCCAAAAAAGGAAAAAT 
I M 1 1 1 I I I GTTGGGGGGAGGG AGTTTTNAAAAA 
Sequence 2660 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAACAGGGATAAG 

TTCTTGGATAAGGTGCCAACATACCTA TAAM GCTGATTTTTGAGTAAATTATTGATTCT 

AACATATGTMTGGATTTGGTGTGATAATTTTCTGATCTTTMCTATAAGTGACTTTTTA 
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TTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAATG 

GAGAMCGGAAAAACTATC TCTGT TAAAAACTGATTCCTGTCATTCTTCTGATATCAAAT 

MGAGGMGGAAMTAMCTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTGC 

AGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAGG 

TCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAATCACGTCTCTACTAAAAATACAA 

AAATTATCTGGGTGTAGTGGTGCGTGCCTGTAATCCCAGNTACTCGGGAGGCTGAGGCAG 

GAGAATCACTTTAATTC 

Sequence 2661 

TCGGAGCTCNCCGCGGTGGCGGCCGGGCAGGTACTCTGGCTGCAGACTGACCTTGCTCAG 

GTCCGAGAAGGATGGGGCAGCCACTGGAGTGGATGCCATCTGCACCCACCGTCTTGACCC 

CAAAAGCCCTGGAGTGGACAGGGAGCAGCTATACTGGGAGCTGAGCCAGCTGACCAATGG 

CATCAAAGAGCTGGGCCCCTACACCCTGGACAGGAACAGTCTCTATGTCAATGGTTTCAC 

CCATCAGACCTCTGCGCCCAACACCAGCACTCCTGGGACCTCACAGTGGACCTTGGGACC 

TCAGGGACTCCATCCTTCCTTCCCMGCCCTACATTTGNTTGGCCCTTTTCTGGTGCCAT 

ACACCCTTA 

Sequence 2662 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGGAGGGACATACAAT 

ATTTMTAGGATATTTCTACAGAACAATMCTTATATTATGTCCTTGTAAAAATCTGTAC 

CTCTTTAAAACATTTAACTG AAAC ATCCA I I I I I 1 1 I I AGCTTTGCTAATCAAAATTGTT 

TTAAGAATTAAAACTAGGTTGTAACTAATGTCAGTACCTGCCCGGGCGGCCGCCCGGGCA 

GGTACATAGGCATCCTATTCACTGCACCCTGTCACACCCGGCACCCCCCGCCCCGCACAT 

TATTTGAAAGACTGGGMTTTAATGGTTAGGGACAGTAMTCTACTTCTTTTTCCA 

Sequence 2663 

AGTTCGGTGTAGGTCGTTTCGCTCCMGCTGGGCTTGTGGTGCACNGAA(X)CCCCGTTTT 
CAGCCCGACCCGGNTGCGCCTTATTCCGGTAACTATTCGTTCTTTGAGTCCCAACCC 
Sequence 2664 

AGGTACTGCAACTGCCAGAACTTGGTATTGTAGCTGCTGCCCGCTGACTAGCAGCTGGAC 

TG ATTTT GAATAAAAATGAMGCATTAMGGGTTTCCCTACAAMCATTTTTCTTTAAM 

TACTTTTGAAAATGGCTATMNCAGTTGACTTTCACCCTTGGAGAGCATCACACTGTGTG 

AGGTTCAGTGATTGTTGAC CCTNC CCAGCCCCTNCTGCTTCTTTAAGTTATCTGTGTGCG 

TGCNCTTCCTCTCAATCTTNTTTTGCACCGCTCATTTNTTTTTCTCTGACCCATGAAGAA 

AAG G AAMACTTT ACTG ATTG ATAAANTTTTT AAAAN AA 

Sequence 2665 

GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTA A I I I 1 1 I I I I I I I 

TTTTTTGGTAGMGTGGTGTCTCACTATATTGCCTTGGCTGGTATTGMTTCyrTGGGCTC 

AAGCAATCCTCCCCTCTT GGCCT CCCAAAGTGCTGGGATTACAGGCATGAGCCACCTCAC 

CCAGCCCACTTATTCATCTTTTTGCCTGCAAGCTACACCACCAAAGCCCCAGGTCAMCA 

TCTTTCTCCACAGACTGTGAGAAAAGAGCCTTTCTTCCTCCTMTTTGTMTGGCTTTCA 

GCTCTATNTGTCTAGCTTCAATCCTGACATCTGCAGCTTACA 

Sequence 2666 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGAGGCTTGGCTCTGTGTGAT 

GCACTCTMCCAGCCCACACTTCAGCAGCGGCATCATTTGGATTCAGGGAGAGTTTCTGC 

AGCATGAAGGAGTAGGAAGAACACAGGGTGAGTTCACAGGMTCCCTACTTTTCCATGTG 

ACCTTGTTTGGCCTGMGTGATTTTCTTGACCTCTCTGGGCCTGACTTTCCTCATCTGTA 

AAGTGGAGGGTTTGAAAATATCACAGAAAACAAAATACCTTAAAGGATGCTCTGGCACAC 

AGCAGGTATTTCCATATMTGATACCTCCCATTCCTTTTTATGTGAGCTATATCCCCTGA 

AAC CCAG GTTTGACTAAATTGAGACCAACTTTCATAATATACACAATGACTGNTAGATAT 

GMTTTTGGTGTGGTGAAGATGGGGAGTGAAAAAAGTAGAAAAAGTCAAATCTCATTGAA 

TAAAAAAGGG 

Sequence 2667 

NNGGCGGCCGCCCGGGCAGGTACTTTCTTTNCT I rTTTTNTTTTTTI i GAGGCAGAGTCT 
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NGCTCTGTTGCCCAGGCTGGAGTGCAGCGGTGCGATCTTGGCTCACTGCAAGCTCCGCCT 

CCCGGGTTCACGCCATT CTCCTGC CTNAGCCTCCCAAGGAGCTGGGACTACAGGCTCCCG 

CCACCACGCCTGGCTMTTTTTTTGTATTTTTAGTAMGACGGGTTTCATCGTGT^ 

AGGATGGTCTCGATCTCCTGACCTNATGATCCGCCCGCTCTGTCTTCCCAAAGNGCTGGN 

ATN ACAG GGCCTGANCCATTGTGCCCAGCCAAANTGTNCCTTTGNAAAGTTNGCGAAATC 

AGATTTTGTTTCCCAATAGAACCAAMTTTTTATGAGGGATGCTAGCATTTT^ 

ATANTMTTAGTTTACAACCTGMNAMTATTATGTTTTGTANTAGATAAATATTAAGGT 

GNGGATTTTAA 

Sequence 2668 

CTAT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACAT 
ATGATTTTTATTAGTTTCTGGAGGCAMTTTMTTTTT^ 

TAAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGT 

TCAAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTG 

GGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTG 

AGTTACAAAAAAGTGTAACTTATGAAAAGTCCGTATTTATATTGAC 

Sequence 2669 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGGGGT 

GGGAGC ATTTA TGGTGATCAGATGTGCCTGGCAAGCCCCTGTTCAGACATTGCTCACATT 

CCAMTGTTTTCGTGTAGAATATTGCACAGGTCTGGGGACGCTCTACCTGTGCCCTGTGA 

GTGTTAATAATGGTGGAGAAAGAGTGTAGCTGTGCCCTTGAGAGAGAAGGTGAGGGAAAG 

AGTGCACCAGTCAGCTGACCGTCAGCTGGCTAGGCTCTTCACTGAGTCCTATGTCGCAGT 

GCACAAATCACTGCCCATCANGCCTCAGTTTCCTCATCTGGTAAATGGTGATAACATCAA 

TCTGCCCCCCCGCCAGGGTGCTGTTATGAGGGTCAAAAGTGGTAGTGGAGGGTAATACTG 

GNTGAGTCCATTTGTGTGTGGGAGGMGAAAGGCTTTACATTNACCTGGTACCTTGGGCC 

GCTTTMGAACTAGGTGGATNCCCCCGGGCTGCANGAAATTTNATATTAAAGCTTATTCG 

ATTCCCGTCCACCTTNGAGGGGGGGG 

Sequence 2670 

AT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACATAT 
GATTTTTATTAGTTTCTGGAGGCAAATTTMTTTTTATTTTAAM 

AAAGAAATAGTTCTCAAAAAGACAACCGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTGAG 

TTACAAAAAAGTGGTMCTTATGAAAMGGTCGGAATTTAATATTGACCTTTGGTAAGAG 

CTCMTTANTTTCAGGGAAAGGCAAGGGAGTATCACCATTCTGAGTAATACAATTTCAAC 

TMTCTTTATTTCTTCTACTTGAAGTCAGTGCCTATCTACCACAAACATTCCTATATCAG 

TGTGCAAATTMTTTNGAGACAATAGGCTTTTTACGACMTGMTTGGTTACTTTNAA 

Sequence 2671 

ACGGCGA A TTGGAG CTCCCCGCGGTGGCGGCCGAGGTACTTN NI rTTTTTTTTTTTTTTI 

mrm icatttttataagaatatataaaaaatgatataaanggacatttacggtagtg 

GGGGAAGGCATATATNTACGTTAAAAGGCAGGACATTTTTAA 
Sequence 2672 

TACGAGCCATTTACAAAAAATCGACCGCTTCAAGTCAGAGGGTGGCNGAAAACCCGACAG 

GGACTATTAAAGAATACCAAGGNCGTTTCCCCCCTGGGNAAGCTCCCCTCGTGCCGCTCT 

TTCTNGTTTCCCGMCCCCTGTCCGCTTTANNCGGGAATACCCTGGTCCCNGCCTTTTTC 

TTCCCTTTTC 

Sequence 2673 

CCGCGGTGGCGGCCGAGGTACGCGGGATGTGGTCTAATCAAAGCCATCTCAATTTGTAGA 

TGAAGAAGGCAAGGACTAATGACAAGAAATGAATTGTTGGCCGGGCATGGTGGCTCACGC 

CTGTAATACCAACACTTTGGGAGGCCAAGGCTGGTGGATCACCTGGGCTTGGGAGTTCGA 

GACCAGCCTAACCAACATGGAGAAACCCCGCCTCTACAAAAAAAAAAATACAAAAATTAG 

CTGGGCATGATGGCGGGCGCGCCCCCTAATTCCAGCTACTCCTGAGGCTGAGGCAGGAGA 
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ATTGCTTGAACCCAGGAGGCGGAGGTTGCGGTGAGCTAAGATCGCACCATTGCACTCCGC 

ACTCCGGCCTGGACMCAAGAGCAAAMCTTTTGTCTTAAAAAAAAAAAAAAAAAGTACC 
TGC 

Sequence 2674 

CCGCGGTGGCGGCCGACGNACCNGGTNCTCTGTGGGATACNCATTAGAGTTGCTCGNTGG 
ANATGGMTGATGGTGGGGTGCATTTNANCATGGCTGAI^GCTTTCTGCTTAAGGACCTG 
ATGTATTMTGCTCtCCANGTCATTCATATTTGGGGGAAGGAACAAAGAAGGGTACCTN 
Sequence 2675 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAGCATGATCTTGGCTTA 

CTGCAACCTCCGCCTCCCTGGTTCAAGCAATTCTCCTGCCTCAGCCTNCCGAATAGCTGG 

GATTACAGGCATGCACCACCATACCCGGCTMTTTTTGTATTTTTAGGAAGAGATGGGGT 

TTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATTCACCCGTCTTGG 

CCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCGGCCTCCATTACACCTCT 

TTATTCTAGTTCAACTCAGACCGTGAAGTTAGCATACAGGTCCTCAGGAGTTTGAGGCCA 

CTTTCCMGGATAAGGGCCACCTTCMGGGCACATCTTGCCCTTAMCAATTAMTTTNT 

GMAGCTTTTGGGAAAAGGGGNNGAACATGCCTTTT 

Sequence 2676 

ACTTAGGGCGM TTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTAC I 1 1 1 f 1 1 1 1 

II I I I I I I I I I I I GAG ATATAAATTGGTATGTTCACATGTTGGTAAGAAAGATCTAATAG 

CTAGAAAAGGAACTGCTGAAGTTGCAGGAAAGAGAAGAGAAATNTAAGGAATGAAACCTG 

AA AAATA GAGGGCAGAGTCAGAGGGGTGGAGAAGGAAGGGGCCCTGACGGGAGGNGGACC 

ACTTTTTNCAAGGCACATGCAGGCAGGCACCTNTNTGGAGGGTCCCATGGCTGGCTATGt 

GTGGCAGTGCATGAAATAGTCACTGAATATGCTCCCTTGCTGTCCTCTGCTTGTGCTCAC 

CGAGGAGTGACTGTGTCTACTGGAMCMGTTTGCCTTTTCACTGGTCTAAATAAANGGA 

TCMTGACCTCAAATCTAMGAGCATTAGGAAACCTTNTNAAGCCANAACTGTACCCTTG 

GCCGNTNTAAAACTAAGGGGGATCCCCCCGGCTGCAG 

Sequence 2677 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGCTGCGTGCGGCGGG 

AATCATGGCTGCTCGCAGAGCTCTGCACTTCGTATTCAAAGTGGGAAACCGCTTCCAGAC 

GGCGCGTTTCTATCGGGACGTCCTGiSGGATGAAGGTGGAGTCTTGCTCTGTCGCCAGGCT 

GGAGTGCAGTGGCGCGATCTCGGCTCACTGCTCTGACTACACCCGGATAACAGAAGACAG 

TTTCTC AAAG CCTTATGATGGGAAATGGAGTAAAACAATGGTGGGATTTGGGCCTGAGGA 

TGATCATTTTGTCGCAGAACTGACTTACAATTATGGCGTCGGAGACTACAAGCTTGGCAA 

TGACTTTATGGGAATCACGCTCGCTTCTAGCCAGGCTGTCANGCAAACGCCAGGAAGCTG 

GMGTGGCCACTGACCGGAMGTTGCANGAAAGGTGTTTTTTTAAACCCAGG 

Sequence 2678 

NCCCGGGCAGGNA C 1 I I 1 I I I M I 1 1 1 1 1 1 1 I II M 1 1 I 1 1 I I AATGGGGATTTGTCTAT 
TAAMTAGGTTGTNGGAAGNACCCCANAGGACCCTTNTTGGATCAGGAATCAATAATCCC 
ACCNAGACGGGGNCCCAGCATTTATTGGGNCGCAGCATAAAACTTGGCCACCTCCTTATT 
GGGGNTGCCCTGGGCTGAGTCAAACCACATTTGGATGCANCnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnhCTGCAGGMTTTCCATTATTNAAAGCCTTATCGGATNCCCGTTCGAC 

CTTCNAGGGGGGGGGGCCC 

Sequence 2679 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTAACANGGGTGTCTGGTCTT 

AGMGCCTCCCTTCAGATCCCAGCTGACCCTGGTGACTGCCTGGCCTTGATGTTGGCTGC 

AGCCTTCTGATAGAACCACATGGATTCCACCCACAGCTGGCCAGGCTTGTTACATGGGTC 

MGGGMTACAAATGGCCCCCCCCCAGGGAGCAGGTGTTGGCCTCAGTTTTCAGGGACCC 

TTGGTGTTGCTCCTTACCTAGAGCCCATTAATCTACCCCATNAACTCTCTTGCCATGAAA 

AGCCATCTTCCAGGAGCCCTGTTTT 

Sequence 2680 

AGGTACTCAAAGACGMTCATGAAAAAGAAAAAMCTTTATTTCAAACAGGTTCAGTGAT 
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ATATGTGGGTGCTNCAGCAMGGCTGGTTGTGGCAMGTTTCATTTCAAACTGTATGATG 

TGGGCTGGGCMGGTGGCTTCACGCCTGTAATCCCAGCACTTTGTGCCCGCGTACTCAGC 

TGTGTTCATGTGGNGGTCTGTGGAAAGAAAAGAAGACTCGTTTGGAAATGAAGCTGTCCC 

TTTCCAAGCAAGTCTTCTGGTGGCTTTTCTTCTCTCAAAAATGGGATCCCGATAAAATAT 

TTGAATAGGAGCNGAATTGGTAGAAATGTTCGTGCCTGTCACCCCAGAAAAGCCTTGCCT 

GGTTTT 

Sequence 2681 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGTTTAAT 

TTCACAGTGAGGGAGTCGGTTGGTGATCTCTAAGAAATCCCCAAGCACCTGGTGTTGGGA 

AA GTCCCTC AAATAAAGAAGTGTTCTTTTC I 1 1 1 1 1 I 1 1 CCTCTCTCTCTTTCTATTTAT 

TTATTTTTTGAGATGGAGTCTTGCTCTGTCACCCAGACTGGAGTGCAGTGGCAGGATCTC 

AGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAAT 

AGCTGGGATTACA GACAC CCACCACCACGCCCAGCTMTTTTTGTATTTTTAGTAGAGAC 

GGGGTTTCACTATGTTTTGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCCACCAC 
TTAACCT 

Sequence 2682 

CCTCAGATTTTGGGCCTAGGAAGGTAGGTGATTTAAACTCACTGAAAGCATGTACACCTT 
GCTGTTGCTGCTTGCTGCCACTGCTGCTGTTTCATCTGTTCCTGCTGCCGCTGGAATCGT 
GGAGGTAAAGACTTCTGAAACTGTTTGAAATAGCCAGATAATACAGC 
Sequence 2683 

GCGAATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACTTTC i MINI I CTTTTTCTTTTT 

CI 1 1 1 I I I I I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCAGTGGCGC 

AATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTC 

CCAMTAGCTGGGATCACTGGCACAAACCACCATGCCCAGCTMTTTTGTATTTTTTGTA 

GAGACAGGGT7TCACCATGTTGCCCAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCT 

CCTGCCTCGGCCTCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATTCAGCCCCACA 

CCCTTATTTATACCAATTACCTGCCCAGTMCTGNGGACTTTTGCTTCCTCACCCTGTTT 

TGATCGCTTTAAMCTMGTGGGATCCCCCCGGGCTGCAGGMTTTCGATATTCAAGCTT 

ATTCGAATACCCGCCGCCTTCGANGGGGGGGG 

Sequence 2684 

GACTACTAT AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTC 

TTTTCTTTTCTTTTC I I 1 1 I I I I CI I I I I I 1 1 GAG ACGGAGTTTTGCTCTTGTTGCCCAG 

GCTGGAGTGCAATGACATGATCTCAGCTCACCACAACGTCCACCTCCCAGGTTCAAGTGA 

TTC TCCT GC CTCAG CCTTCCCAAGTAGCTGGGATTATAGGCATGTGCCACCAAGCCTGGC 

TMTTTTCTATTTTTMGTAGAGATGGGGTTTTCTCCATGTGGGTCAGGCTGGTCTTNGA 

ACTCCTGACCTCAGGTGATCCACCCACCTCGGGNCTCCCAAAAGTGCTGGGATTACAAGG 

CCGTGGAGCCCACCGNACCCAGCCAAAGGCCAAAATNCTNAAGTCTTTTG NI M I NIM 

TCAMTTGAGGGGTNGTTATTTMCAACAAGCTTGGATCATTTGAGNCCMTTNTTTTGG 

TTCCCCGTTTCCCCACCCGCCAATTC Ni I I I I I I M I C 

Sequence 2665 

AGGTACMCATTTAGAGMCCTTAGCTGCCCAGAAAMCTCAGATTTTCTGCTTTACAAA 

AGMTAAAAATCATCGMTTTATTACCCTGGACTTTATTGGAAATCAGTGAAGAATATTC 

ATACCAMTACCAGGTTTCACGMCNTTNCCTCTCTCTCTTTCNTTCAAGGTAAGGGGGT 

TTGGNCAAAAGGNTTGTTT^^^TTTTNGMAANGCCGCCCNGNTTGGNNCNAGGTGGNNCC 

TCT TTAAG GGTTGGGCNNAC GGGG GGTGGTTMGGAAAAMTTCAAATTTAMNCNTTTT 

TATATTTTCCC GGGGGC TGNGTTTTAANCTTTAAMCCCCCTNGGTTTTNCCAAAGGGGN 

AATTCCCAAGGTnTTTAAAAAAAGGCCNN l I I I I I I I 1 1 I GGGGACCCTNGCCCNGGNC 

GGGGCCCT TTTANAA AAAAAGGGGGGGNTNCCCCCGGGGGGTGGGANGGAANTTTTGGAN 

NTTTAAGGCTTTTTTGGGTANCCGGGGAACCCTTTNAGGGGGGGGGGG 

Sequence 2686 

CCGCGGTGGCGGCCGAGGTACTCCAACCCAAGCAACAGAGCAAGACCCTGTCTCAAAACA 
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AMCAAAACAAMCAMCAAAAAAMTGAGGTAGGCATGTTTTATTCCCATCTTACAGAT 
GAGGAGACTGAGGCAATAATAATTCAATGGCTTATCTAAAGTCACAAAGCTAGTAAGGAG 
CAAAATCCAGTTNT GTCTGCTTCCAGCCCACCTTGTCCCACTTGCTTCTTTATT 
Sequence 2687 

TACTATAGGGCGMT TGGAGCTCA(X)GCGGTGGCGGCCGCCCGGGCMGGTACTTTTTTT 

I N 1 1 1 1 I I I I I I 1 1 I GGGTATATAAACTATTTATTAACAGACAAGGCCTACAGACTTAT 

TTCTTCTTGGACACACCCACGGNGCGGGCACGGGCGGNNMGTNGGTTTTGGTGTGCNTN 

GCNTTNGNACACTAM GGGCC CAAAAAGTTGACGCANTCCCTTTTNTGGGCCCNMTTTT 

TTTAANTCNTTCNAAGGTTTTTNACGGGAGN 

GCTTGTATTTTCCATCI^rTTTANATCCTTIvrrTGTTCTG^ 

TGTACCTNGGNCCGCTNTAGAAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnTGGATATT 

AAAGCTTATTCGANNACCCGTCCNNCCNTTCAANGGGGGGGG 

Sequence 2688 

AACACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTG 
GAGAAAGAAGGGGGACCCCACCAGGGCTATGGAGAGACAGTAGAGGCAGGACTGAACCGT 
CAGCAAAGATTAAAAGGATGACCTGAGGCTGGCAACCACAAAACC 
Sequence 2689 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTCT 

TTTGCTTCCTCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGGAAGTTGTCTCTGA 

AACCCGGGGACAGAGGACGCAGGACAGACTANGAGGGAGCCCGGGAGGATGGGCTGCANC 

TGTGGAGGAGGGTTTTAAAGGAGAGAGGGTCGGAGAGCAAAAGGCCTNAANAAGCCCANA 

AGCMGTTGAGAGAGGGTGGAAMGTGAACCACCGGCTTGGGCTTGAACCCGACACGCTT 

TTCTTCCAATGGTTAAMTAGCCACCTTTTAGAAAAAMTTTNACAAAGGNTCCCATTCC 

NCNAAAAAAAAAAAAAAAAGGAAAAA 

Sequence 2690 

TATACGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGACG 

GGAAGGATTCCTGCCAGGGTGATTCTGGGGGTCCGCTGGTATGTGGAGACCACCTCCGAG 

GCCTTGTGTCATGGGGTAACATCCCCTGTGGATCAAAGGAGAAAGCCAGGAGTCTACACC 

AACGTCT NGAG ATCACGAACTGGGATCCAAAAAACCATTCAGGCCAAGTGACCCTTNACA 

TGTGACATTTTACCTTCCGACCTACCACCCCACTTGACTGGT^ 

TAMCCTTGGCTTTCCCTTNTNTNNTGGCCAGNTTTTNACCCTGATGCGTAAAAAACGCA 
ACCGACGTGAGGGGTCCTGANTNTCCCTGGGNTTTACCCCAACTCCATNCTTNGATTAAT 
GGGGG 
Sequence 2691 

CACTACTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCACGGTA C I I M II II I 

I I I I I I I I I I IGGTAGGAGATGAGGTCTCATTATATTGCCCAGGCTGGTCTCAAATTCCT 

GGCCTCAAGTAATCCTC TCGC CTAGGCCCATAGTATTGGGATTACGGGCGTGAGCCACTG 

CGCCCAGGCTTTACTAGTTTTCCATCTATCTTTAGGCCTNCTCANATTTCTTGTGGGGCT 

CCACTTCCACCAGTCACACCTTTAA 

Sequence 2692 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCACTCAAA 

AAMTTATTTCTCAATTTGTTTGCTCATACTATTG A I M I N I I CTAGAATCTAAATAAT 

TGTGAATTAACTATGATAGCAGTATGCACAAACAGTAAGTGAAATCAGACCTATTAATTC 

TGAGAGGAAGGAGGTGTCAGGATTTTCACAGAAGAAGAGCACTNAGGCCAGGCGCAGTGG 

C TTAT GCCTGTMTNCCAGCACTTTGGGAGGCTGAGGCAGGCNGATNAAGGNGGNCAGGN 

GTmANACCAGCCANCCMACNTGGGGAMCCCCGNTTmA^AAMNTCCAAMT^ 

CCCCGCCTGTA NTCCC AGTTACTNAGNGGGGNTTGGGNCAAAAA AI I I II II I A CNCCCG 

GNGGGNGNGGGTTTTNCAGAGNNTATAMTCGCGCGNTTNGCTTTNAMTTTTNGGGNAA 

AAAAGAAACGAGTTCNCTTTTGAAAAAAAAAAAAAAAAAAAAANGG 

Sequence 2693 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAGCCTAA 
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ACAATTTCACCTANGTAAMTATTGATGTCATMCCAMCTATATGGCCCCGTTTCATAA 

AGGTTACTATATTCTATAGAGAGTGAAGAGGTGGCCTTTCTATCCCAGCTTACCCTATTC 

TTGTTATTGTTCAAATTCTCCTGAAGCTTGCATAACTAGCTGCCATCAGGTAAATGCTAT 

TGGCTAGCAGAAGACTGCAGTTCTGTTAATATTAGAACCAGCAGGGGGAACTTGGGAACT 

TGACATTAAAAATCTAGAAACGGAATTTTA 

Sequence 2694 

CCGCGGTGGCGGCCGCCCGGGGAGGTA C \ I M I I I I 1 1 I I 1 1 1 11 1 1 I GGGCGGGGGTCT 
TTATTTGAGTTTAGGCATGATTCGAATGAAGAGGATCATGCTAATGAAGATGAAGCAGAC 
GATAATGAGCGTGGCCCAGAGCAGCCAGTTGACTGACT 
Sequence 2695 

GAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGTGGTCCCAGTTACTCCAGAGGCTGAAG 

TGAGAGAGTCTCGTGAGCCCAGAAAGTTGAGGCTGCAGTAAGCTGAGCCATGATTGCACC 

ACTGC ACTGT AGCCTGTCTACAAACAAATAAACGAAAAAACAAAAAGACTTGTGAAAAGT 

GCTGATTTTTMTTAGGAAAAGATTAMCATTGGATAGTCATGGMTTGTTTACTGAACA 

TTAGAAATTGGTTGCAAGGGTCTATGCTTCTGTAAAATAAA 

Sequence 2696 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATATGAATCACTA 

AGAGTTCAGAAATTAAGCTCACTTTAAGAAAACTCAGCCAGGCACAGTGGCTCACGCCTG 

TAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACGACGTCAGGAGTCCAAGACCA 

GCTTGACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGAGCGTGGTGAC 

ACACGCCTGTAATCCCAGCTACTCAGGAGGCAAGGCAGGGGAATTGCTTGAACCAGGGAG 

GCGGAGGTTTGCAGTGAGCCCAGATCGCGCCATTGCACTCCAGCCTGGACAAGAGAGCGA 

GGACTCTGACTTCCAAAAAAAAAAAGGAAGAAAAAAAAAAGTTCCCTTGCCCGGGCCGGN 

CGNTTCTAAAAACTAGGNGGGNTNCCCCCGGGGCTTNNANGGAATTTNGATTATCAAAGC 

TTTNTCGANNNCCCGTCGACCCTCGAGGGGGGGGC 

Sequence 2697 

CTACTTAGGGCGAAT TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI NiMH 

I I 1 1 I I I I I 1 1 I I I I IGGTANGGGACCGGGGTTTTACCATGTTGGCCAGGATGATCTCGA 

TCTCCTGACCTCATGA NCTGC CTGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGCATGA 

GCCACTGNGCCAGGCCTTTTGTCCATTTTTTATTGMCTGCCTATTNCTTCTTACTGATT 

TGTAGAAAGTCTTTATTCTAGTCTGGCAGGTATTTTNTTTCMCACTTCCAAGA^ 

ATTGGCTTTGGNAGCTAATGCTTCATTTGANAAAATGCTACCTGTATTATTGATGCCCAT 

GTGA 

Sequence 2698 

CACTACTTAGGGCGATTGGAGC TCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCT 

TTTTTCI I 1 1 1 1 1 1 I 1 1 1 1 I M I I I 1 I CTTTTTCTTTGAGACTGGGTCTCGCTTTGTTGC 

CCAGGCTGGAGTGGAGTGGCGTGATCTTGGCTTACTGCAGCCTTTGCCTCCCCGGCTCGA 

GCAGTCCTGCCTCAGCCTCCGGAGTAGCTGGGACCACAGGTTCATGCCACCATGGCCAGC 

CNACTTTTGCATGTTTTGTANAGATGGGGTNNNACAGNGTTGCCCAGGCTGGNCTTAAAC 

TCCTGGGCTCAGGCGATCCACCT^rTTTTANCCTCCCAAAGTGNTGGGATACAATTGNGAG 

CCACCACGTCCAGCTGGAAGG GTCMN AA TTTTTT ACATTTTTNGNAGCACAATNTGGAT 

TTTT CACCC ANCCmCCCCTTCTTTITCCCTTTTT^ 

NCNNTTTTNTTTTANAAAAAMAAANCNTTTTTTNNCCNAAAAAAAA 

Sequence 2699 

CTACTATAGGGCGMTTGGAGCTCN CCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTT 
» M I I I 1 I I I I I I I I IGI I I | I | | | | | | | | | i | | | | I CCGACCAATATGGTTTATTTNTG 
CCCCAGCCMGCTTNTTTGGACCCTGGCTGGGGGGAAAGGCACCCCAGGCACCGGCAAGT 
TCCAGTCATTGCAGATCCTCCAGGTNTAGGNGTGACTGGAAGTANCCTGGGCACTGNTGC 
TGGACCGTNGGATTCTCCTTCCTTNTTCCGCCGGCGGGTGGTCACCAGGACACCGCANAT 

CAGGCATGTGATGAGTCCCAGGAGTCCTGCCAAGCCGATGAGGATGACAGCCCAAAAGGG 
AAGGT 
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Sequence 2700 

A CTTAG GG CGAT TGGA6CTCCCCGCGGTGGCGGCCGAGGTACTTCAGTGGATGCAAATGA 

AATTTTATCTTTTGAAAGTAAGAGGAACTATGGCTGGGCGGGGTGGCTCACACCTGTAAT 

CTCAGCACTTTGGGAGGCCAAGGTNGGGCGGATCACCTGAGGTCAGGAGTTCAAGACCAG 

CCTGGCCAACACAGTGAAACCCTGCCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGT 

GGCACGCTCCTGTAATCCCAGCTACTTGGGATGCTGAAGCAGGAGAATCTTTTGAACCTG 

GGAGGCAGAGGTTGCAGGGAGTTTGAGATTGTCGAGTATAGTGGATTGAGTGCTGTGGAG 

CCGAGATTGCCAAGTGCAGTGGATCGAGTCCACTGCACTCCAGTCTGGGCAATAGAGCTA 

AACTCAGTCTNAAAAAAAAAAAAAAAAAAAGTACCTGCCCXaGGCGGCCGCTCTAGAACTA 

GGTGGATCCC 

Sequence 2701 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGCTCACCAA 

GCTCTTGAGGCCMGCTTGATACGGCTGTTGTTTTTTTCTACATCGTAGCCAGCAGCCGC 

AAGCGCTTTCTTAAGCGCGGCCAGAGAAAACGCCGCTGCGCTCCTTAGAAGCTGCCACTG 

CCTTGGTGATAAG CTCAGA TACTGGGGGTCCGGATGCTTTGCGTTTCCCAGCAGTTGCGC 

CTGCCTTCTTCGCCTTTTTCTTCACMGGTGTTTTTTCTGCGGGTGCAGGAATGGTAGGA- 

GCMGTGGAGCAGTCTCCGACATGTTTTTGTCTTCCCAGAAAAGACAATAAGTAATCTCA 

AACTGTCAGAACAGCATGTCCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTAGGTG 

GATCCCCC 

Sequence 2702 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGAAGGGGAGG 

GCTrGGTGTTCCCTACGCGMGGTCTGACTCAATGCTTATTCCCTCTTCTTCCTCTCTCT 

GCAGACTGAAGAATTGAACCGGGAGGTCGCTGGCCACACGGAGCAGCTCCAGATGAGCAG 

GTCCGAGGTTACTGACCTGCGGCGCACCCTTCAGGGTCTTGAGATTGAGCTGCGGTCACA 

GCTGAGCATGAAAGCTGCCTTGGAAGACACACTGGCAGAAACGGAGGCGCGCTTTGGAGC 

CCAGCTGGCGCATATCCAGGCGCTGATCAGCGGTATTGAAGCCCAGCTGGGCCGATGTGC 

GAGCTGATAGTGAGCGGCAGAATCAGGAGTACCTGCCCGGCCGCTCTAGAACTAG 

Sequence 2703 

CCGCGGTGGCGGCCGAGGTACGCGGGTGTGAGGAGGTGGGGAGACCACCCCACCCCCATG 

TCCACCATGACCCTCTTCCCACGCTGACCTGTGCTCCCTCCCCAANCATNTTTCCTGTTC 

CANANAGNTGGAGCTGAGGTGTCTCCATCTATGNCTCAACTTCATGGTGCACTGAGCTGT 

AACTTCTTCCTTCCCTATTAAAATTAGAACCTGAGTATAMTTTACTTTCTCAAATTCTT 

GCCATGAGAGGTTGATGAGTTAATTAAAGGAGAAGATTCCTAAAATTTGAGAGACAAAAT 

AAATGGAACACATCAAAAAAAAATTAAAAAAAAAGTA 

Sequence 2704 

TNAGGG CGAA TTGGAGCTCCCCGCGGTGGC GGCCGAGGTA CI 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 
ii 1 1 1 1 iui I I CI 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 INGCATCAAAAAGCT 
TTATTTCCATTTGGNCCAAGGCTTGTTAGGGATAGTTAAAAAAGCTNCCTNTTGGCTGGN 
GGGAGAGGCTTAGGCANAANCCCTNTTACTTTGNANGGGGCCCTT 
Sequence 2705 

CCGCGGTGGCGGCCGAGGTAC I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GTTTG AGACAGAGTCTC 

ACTCTGTCACTGAGGCTGGANTGCAGNGGCGCGATCTGGGCTCACTGCAACTTCCACCCC 

TTCGNTNTAAGTGATT CTNCTG CCTCANCCTCCNAANTAGCTTGGATCACAGGCGCCCGC 

CACCACACCCGGCTMTTTTTGTATTTTTAGTAGAGACATGGNTTCACCAMTTTTTAAA 

GAAAAATAAAGGTGCATGATCAACAATCAAACTTNTAGGACCGTCCCCTANCAGGAGAGC 

AGCAGCAGNAGCAGCACACANACCTGCCC 

Sequence 2706 

CACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCTNACCACTT 
CTMGAAACTCCAANAAAGGAANCATGTGTNTTNTATTCTGACTTAACTTNATTTGTCAT 
AAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATANTGNNAGATGGAGAANAGTGAATG 
AGNTTCTACCACTCTNTACTAATCTCACTATTTGMATTGAGCCCAAAATAACTATGAAAG 
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GAGACCGAAAATTTGNGACAAAGGATTGNGAAGAGCTNTCCATNTTCATGATGTT 
Sequence 2707 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGAGTGGGGG 

TGAAGCGTGTTCTCTACATAGGCAACACAGCCGCCTAAGTCACAAAGTCAGTGGTCGGCC 

GAGGTACTTTGGTAA GGAGA TAGGGAAGGAATTAAGGCTATTACTCTGAAGAAAGTTGGG 

GGGCCAGGGCTCCTATTTTTTTGCTGAGGAGATGGAAGATCAGGGCTTGTATTCAATAAG 

AATG GGAGGGGCCAGGGGATGCCTGGCAAAAGCCTTGCACTGTGAGGTGCAGGTAGAGGC 

TTTTATTCTGGTGAGAGGACATGGACTCTCTCTCTCCCCTCAGGTAACTGTGCCCTGTAC 

CTGCCCG 

Sequence 2708 

GM TTGGAG CTCC CCGCGGT GGCGGCCGAGGTCTNMTT^nTTTNCCCTTNNNNGAGACG 

ACCI 1 1 1 1 IGGANATTTTTTTATTrTTTGGCAAATTTGATCTTACCCTTTACCAGTTCTA 

TAATTTGGNTAAAAGCTGATTATGTCCTACAATGNCAAAGTCAGCTAACTGNCGTCTACT 

TAAGACTTNTGGNCATTTCCAACTTATAGAGGAAGGGNGNCTCTAAAATCTCTTCTTCAG 

AAGGCACCTCACTTNTCANACTTAAAANNCCACATCAAGTGTTCCATTAAAAGAAGATAN 

GGCATTCTGAGTGCAAACAAATGGGGGCTTNTTAAACTA 

Sequence 2709 

AGGTACATGTCTCTTCTCCTCTGACTCTGGGAATGMGATTTTTCTACTCCCACAGGCTT 

GAACTCTCCTTATAGGGAGTGGTCTTNCACATGGCCCAAAAATCAAGAAGGAAGGTCAAG 

AATAAAAACCTCCCAAGGCTTGAAAACTAAGAGCTGGCACGTAAGTACCTGCCCGGGCGG 

CCCGCTTCTAGAACTAGTGGGATCCCCCGGGCTGCAGnnnnnnnnnnnnnnnnnhnnnnn 

nnnnnCGTCGACCTCGAGGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGTGAG 

GGG7TMTTGGCGCCGCTTTGGCGTAATTCATGGGTCATTAGCTGTTTTTCCTGTGTGGA 

AATTGTTATTCCGCTTCACAATTCCACCACCAACATACCGAGCCCGGGGAGCATTAAAGT 

GTAAAAGCCTGGG 

Sequence 2710 

CCGCGGTGGCGGCCGAGGTACACGAGAANAGGAGACCCCAACACGCTTCAAGCTTTGAGT 

GGAG AGGACACAGCCTCTGCTGNGACAGGGAACANAGGGATGCGGAGACCCTGAAGATGC 

TTTTGGACAGTGGTCTGAGGTTGGGACAGTGGCAGGAGATACCATTCACCCAGGATCTCC 

AGGACAAGAGATCAGCCTGGCAGTTACATGTG I 1 1 1 1 1 1 I CAAACTGGTTGCCAGGTTGG 

CATGAGCGATGACATCAGGAGATTCCGACCTTTTCTTATTGGANGGGACCGGACTNTGTN 

GGAC CTGGGGAGTTNAGTTGGACAMNMGMCCTTTTAAMGGGGNTTTCNNTTTTGNT 

TTTrCCCCCCCNCNAAATTMrAAMNACCTTTTTTNTTTNCCCCAAGGGGNCCAAAAAAA 

CCCNGCCCCCCCCNNNNTN7TTTTTGNGGGGGANNGGGNNANTTTTTGGANAAAAATTTT 

TNTCCCCCCCCCCCCCCCC 

Sequence 2711 

CCGGG CAGGTACCTTAT TACAT ATGATTTTTATTAGTTTCTGGAGGCAMNGGAATTTTT 

ATTTTAAAATCAAAATCTATTTTAAAAGAAATAGTTNTCAAAAAGACAACNGATGACTGG 

GTGNGGNGGTGNGTGCCTGTAGNTCAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTT 

GAGGCCAGTTCAGTCTAGCCTGGGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAA 

A ATTTA AAAACCACAATAAATGTGAGTTACAAAAAAAGTGTAACTTATGAAAAGGTCCGT 

AATmATTATTGGACCCTTTGTTAAGGAGCTCAANTAATTTTCAGGGMGCAAGGGAGG 

TATCACCCATTTCTTGAGTTNAATACCAAATTTTCAMCCTAMTCTTTTAATTTT 

Sequence 2712 ^ 

CCGGGCAGGTACCCCCTGACACTCCAGAGCTACTGAGAAATACTCCTTTGTCTTTCTGCA 
CTTGTAGCCAGGCTGAGCTTTGTGAAAGAAGATTGTGTATGTGTGTAC I 1 1 CI I I 1 1 1 1 
Sequence 2713 

CCGGGCAGGTGCGCGGGT GGGTGA CGCTGGTGAANTGGCCCANGGAAGTGCACATGTCTC 
TNCCTGCTCTTCCAGGGTGATTTTTTTGATGTTTGGCTCTTGGTCCTTGTTTCXJACTGGC 
TTTCCATCCCCATGGGGGAGAAACAGTGGCTCCTGGGAGCAGAAAAGGAATTGAGGTGGG 
CAGGCAGAAGAGCCTGGATTGCTCACTGTTTTGGGAAACTTACTTNNAGANNTANANAAG 
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ATCCNGG AAMCCCAAAAN CCTTCTTGACCAAACCATTTTTTGGGG 1 1 I 1 1 1 1 I IGNGGG 
CCNAAAAAI I 1 1 1 1 1 II I TTAAAA AAAAAAAGNNNGTTGNGGANAAAAACCCCGGGGGGG 
GG GGGGGG TTTC I I I I I 1 1 1 1 I NGTNGGGGGGGGCCCNNAAAAAAANGGGGTTNAAAACN 
CCCTTTTTTNCCCCCCNNNNNNNNNNN I I I I 1 1 I I I 1 1 NAAAAAAAAAAAAAATTTTTCC 
CCNNCAAAMNNANNNNNTTTTTTNTTNNNCNNN I I I II I I 1 1 1 1 1 1 NGGGGGGGGGGGN 
NNAAAAAAAAAAATN 
Sequence 2714 

CCNGGCAGGTACNCGGGGGCGCGTGAGCAGCTGCAGCGGCAGAGGCAGCATCCAGCGGCG 

GCGCCAGCAGTTCCAGTCCGTTGCTTTAC I I II 1 1 GCTTCACCCGACANTTAGTCNTTTA 

ATGCCCCGAAAGNAGGAAAAGGTCCTTCCCAGGAGGAAATAACNAGTAGGGGGNCAAAAA 

NGAATNGGGATTCCCCAAAAAAGGGTAAAACCTTAAAAAANCAANNNGAANGCNCCCACC 

AMGGGMCGGGTTCTTGCCCCMGGAATTTGTTCAGNCGAAAMCCCTTNNCTTTTCCN 

ACCC C AAAAA AAACCCCTTGTNNANCCCCCCAANAAACCCCAAAAGGGAAAAAANAACAT 

TTTTGCTTTTAGGAAMGGGMCCCTNNTGTTNGGNN 

GNNGGGGTTTGGCCCTTTAAAAAAGGGGGGAAAAAGNAAAAAGGGGAGGGGNNAAAAAAA 

AAGNNCCAAGGGGNAAAAAGGCCTTTTNGNAAAAAANGG 

Sequence 2715 

CAGTAAATGAACTAATCTACAAGCGTGGTTATGGCAAAATCAATAAGAAGCGAATTGCTT 

TGACAGATAACGCTTTGATTGCTCGATCTCTTGGTAAAATACCGGCAATCCCTCCNGCAT 

TGGGAGGGM TTTTGG ATTTTCCMTGGAGGAATTCCTAANTMCCTGGTTTTGGGNAAA 

AAAMCCGNCCT7TTTCAAAAAAGGNANGGGNCCAAAAAATTAAMCCTTTTTTCCCC 

NGGTNGGGGGCCCCCCCTTTTTCCAMAAMTTTTTGGGTTCC^ 

CGNAAAGGGGGTTNGGGGGAAAAATTGGGAAAAGGGAAAAAAAAAAAAAGGGAACCCCCA 

NACCCCCCCCAAAI I I I I I 1 1 1 I I I GGGTTTAAAAGGAAAAAAAGGGGGGTTTGGGGGNG 

AAAGGAAANTTGNCCCTTGGGGCCAAACCAAGGGGGGGGAGGGGGGAACCCCCAANGNGA 

ATTCCAAAAACCCAAAGGGGGGCCCTTTTTAAATTTTTAANGGAAAAAAGGGAAAAATTN 

NGGAAAAACCTTTAAAAAGGGGGNGGNGGTTCCTTTAACCCCCCATTGGGAAAANTTAAA 

M I MMII l 

Sequence 2716 

ACI I I I I I I I I I 1 1 1 1 II I I I GTCAGAAAAACTCCTTTATTACCATCTCCCTATTACATT 

TCTATTCTAGGGTTAGTGTTMTCTCAGGGTCTTATTTCTTTAAGCACNGCNAAAAAGGG 

CCTTTGGTAGGAMGTTAGGTGGTAMGGTTTTTCMTCCTTTCNATTTTTNTAATCC^ 

ANGGGGAATTGGAAAAAATTCCANAAAGGNCCATAGGGGNAGGCCNTNGTTGGGACCTTA 

GGGGAN CAAATT MCCAGGTTTAGGGTCCTTAGGGGGGCAATAAAACTTTTCCCCAAAAC 

CCAACCATTTTTMG GGGG GGAAAAMAGGCCCCCCAAGTTTTTGGGGAAAACCCACCNA 

GGAGGCCTMTTAGGTTTTGGGGCCTTNCAAMCCCCCCTTTTMTTTTCCAMTTGGAA 

NCCAMTTTC CGGGTNGGTTTTTTTCNAAC^ 

I I 1 1 1 I I II I IGGGTNGGGGGNNAAAAAGGTTTCCTTTCCT7TGGTTAAAGGNTTGGGAA 
NCCCCCAANCCCCCTTGGGMTTNAACCCTTNGGCCCCCTTGGGGGGGGGGMGGGTTTT 
TTAAI I I I I 1 1 I ICCC 
Sequence 2717 

AGGTACI 1 1 I I I I I I I I ICI 1 1 I I I I I I I I I I GGTGGGGGTGTATAGGTCTGGGGAGNGC 

CTTTCAGGTGCTGCTCCATAGACATGTGTGTGCCCTGTATAGAAACCGCCCCCCATTGTN 

TAGAAAAAACNNCCAGACNCTTNTCCCAAANAGNCNNGNTTCTTNACCCATGTAAANNGG 

AAGGNTGNGCGTAATTAGGAAAACCAAGGGCTCAACCANANAGGGCCAAACCAATTTAAT 

TAAAAATTCCGGTTCTTGMGACCCTTTGNCCAAMTGNATTACCTTGTCNCCCMTTTT 

ANAGGGGMGGCCC^TATNTANNmrrTNTCTCmGTCTGGTGGGCCCCCCCCMCTTT 

CMGGTTTGCCTTNGGNCTNCTTACCTTAAAANTNAGNNAAACCAANGGCGGNCCAATNA 

AAMANGGCCAAAAGGGGGNCTTTGGTTNAAAAAMTTAAAAAAAAAAGGTTTTTANTTT 

TTAANCCAATTAAAAATTA 

Sequence 2718 
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TCGCGGTGGCGGCCGAGGTACANAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACT 

GCTTTGTGCTTTGGGMGTTGTCTCTGAMCCCGGTNGNACAGATTGTACCGTCATTGTA 

CCAGGACCTTACNGCANCGGNNAANGCNCCGGGNGCAGGGCATTNGGGGGNCCTTGGCCA 

AGTCATTGlsrTTNGNGAMTGNGGAAGGNGGGGATCTTTTACCAAGGAAAGGGGTATGGAA 

NGTTANGNAGGNTTCCCNGGTGTAAAGAAAAGGTCCCAATGTNATNAGGACCCCCCTTGG 

GMGTTAAAANGTCCCCCCATGGTAAGCGGTCCCAAGAGGGTTNTTGTANTNTAANGGGA 

AGNGGNGATNTGNGGGNAAAAAAAGGGATTGNNAAGANCCCAAATTTACCGNNGGANATC 

CCTTTGGGGNGGNGCCCCTTTGGNGTGGAAAGAGCCCNCCCGATTNTAAANCCAAACCCG 

TGCCTTA CmCCCIsriTCNCCNNMTTGGTTTTTAAAAAMTTANGNACNCM 

TTTTTTTMNGGGNAMAAAAAAAAAMTTTTTTN^ 

CCCCAAANTTNCCCAAACCAAAAAAAAAAAAAAAAAAAAATG 

Sequence 2719 

GCCGAGGTACAAACGGGTTTCCCACCTTGCCAGGGATCCTGGGGACAACAGATGTAAAAC 

TCCTGAGTCTCTGTGTGTGTGCCTGAGTGGCCAGTCTGCCAGAACTCCACACAGCTCTGT 

GTATTGAACCCAAGGCCTTGGTGGCCTGGGCTCATGAGCAGGTCTCCTGATCCATGGATT 

GCAGAGATCCATGGGAGMGCATGATTTCCCAGGCAGGGTCGCACATTCACTCATTGCTT 

CCCTTGGCTGGGGATTGGGGTTCCTTTGGCTCTGTGCCACTCCTGGGTGGGCCATTGCCT 

CACCCTTCTTTCCTCAGTTCTTCGTGGGTTGTTTGCCTAGTCAGTCACAATGTGAGAACC 

TGGCGATTTCAGTTAA 

Sequence 2720 

AGGTACACTCGCCAGCGGTTTTGCCACAGGAGTGTACGGGAACAAAGGAGACAGGCTCAT 

TTATAATCTGACGCGGNCACCCTNCTGCTGCGTTCGGTTTCCATTGGCTGGGACNGNACC 

TCACCTTCTGTATTTGTCCCGACTGGCTAGCACTTAGAACTTTTTAAAAGAGGCAAAGGC 

ATACAGAGANCAAAGGAAGGAGGAAGTNACTTGTGGAATATTGAGAAAGGTAAAAACACC 

TTTAAATAAGGAAGAGGAACAGGCTATGACCTAATGCTTGTTNGGATCAGTATAAGCATG 

TTAGGGCAMTATTTANGCTAMTTGTGGGAGCTMGAACATAAAGTATATTGATTTTTT 

ATTATGGCTAGCA 

Sequence 2721 

AGGTACAATTTAATTTTTCTGCTTGCCCNGGAAACAAAGCTTCTGTGGAACCATGGAAGA 

AGATGAAAATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGAGGACAACTTTGGG 

CAAAT AATCTGCATACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAAATCAAAT 

TTTTTAANNNATACACCACACGATACGACTCCCCGCGTACATCTTTGCTGTGGCTCACAG 

ATTGTTCCTCCCATTCCCCTTGCCGCTTTTTGCCTATCGATGGGTAGCMGAGTCTTTGA 

AATAAGCCCATTTGAGCCCTGGATAACAAGGGATAAAGTGGAGCGGATGCACATCACAGA 

CATGAAATTGCCTCACC 

Sequence 2722 

CCGGGCAGGTAC I I I I I 1 1 1 I l I I I I I iT Cl II I I IN i I I I 1 1 1 I I I I I I I II 1 1 I 1 1 I 

I I I M H M I I I I M M I I 1 1 I I I I 1 1 i 1 1 1 1 I I I 1 1 I I NTTTN I IN II I I I 1 1 I I GGG 

GGGNAAAAAANNTTTTNNNAANANNNGANAAAAAANNGGGGGAACCNNNNGGGGGNNCNC 

NCCCCCNN AAAAMA NGNNCCCCCCCCNNMNNNAAAMANNANTTTTCAAAAANNAAAA 

AANGGGAANTTTTTTANCNAAAAAMNNTTTNTNTAAANNNNNNTNTNNGGGGGNGGNGN 

TTTTNAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2723 

GAGAANGACACCATGTGCCTCAGAAACTGCTCGGTCAGACGGTGATAGCGAGCCACGCAT 

TCACAGGGCCACTGCTGCTCACAGAAGCAGTGAGGATGATGCCAGGATGATGTCTGCCTC 

GCGCCTGGCTGGGACTCTGATCCCAGCCATGGCC7TCCTCTCCTGCGTGAGACCAGAMG 

CTGGGAGCCCTGCGTGGAGGTATGTGGCTGGAGTCAGCTCCTCTGAACTTTCCCTCACTT 

CTGCCCAGAACTTCTCACTGTGTGCCCTGGTTTGTTTATTTTTGCAAAAAAAAAAAAAAA 

AAAAGTACCTGCCCG 

Sequence 2724 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCAATGA 



# 



• 
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atggtaactgatccaggcacgttatcacacttcx;tagtcatctccacctttcxjtgtattg 

cctgtggcttgttgtttaagattaagaatcaaagagattaagaagtatcacttcaagtct 

tgctctgctcacttctatgtttgcagtcaaattattccttatgttggtgacctaaagaga 

attactttcattcatttcatttcccccgtagcagatggaagtgagaaacctctgagaaaa 

tgaam catcctt mccactatctttcccttttatttgattattttatgtcagaaatttg 

caaaagi 1 1 1 1 1 ictcctcxttctcttccttgttgcttaactttttaattcatgccatat 

gcagatatccaattatgtgcatcctgtgaataaaccacgtcttggtcactgtcatatttt 

gaaccatctcatcagagatgaataata 

Sequence 2725 

CCGCGGTGGCGGCCGAGGTACGTATTACTG77TCCCATTATATG7TACAAATGAAATGAA 

CACAT TCTCATMGTTAAAAAMTATAGMTATATATTTTTCTTTTTCCTTTT CI 1 1 1 1 1 

TTTTTCTGAGACAGAGTCTCGTTCTGTCACCCAGGCTGGAGGCTAAGGTGGGAGGATCAC 

TTGAGGCCAGGAGTTTGAGATCAGCCTGAGCAACATAGTGAGACCCCCATCTCTAAAAAA 

AAAATAAGAAATAAAAATCAGCGACGAGGCATAGCGGCTCATGCCTGTAATCCCAGCACT 

TTGGGAGGCCAAGGCAGGCAGGTCGCTTGAGGTCAGGAGTTTAAGACXJAGTCTGGCCAAA 

TGGGG GAAACC CCTTTTTTCTTTAAAAAANTNC C 1 1 1 1 1 1 1 1 CCCCCCCATNGGGGGAAN 

NCCCCNTTTTTCCCCNCCCNCCNNNGGGGGGGGGGGGGGGGGAACCCCCCCCCCCCC 
Sequence 2726 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I 1 1 I I I 1 1 1 I 1 1 1 1 1 I 1 1 I I I I CCAAAACAA 

MCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGNGGAAAGTTTATTGGCAATAT 

TAMTTTCACCCTAGATAGGATATGAGMTGTTTTGATAAATCACAATTTATAGTATATT 

MTGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTAAAC 

CTTAAAGAA ACCTTTC TCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATGAAT 

TTGTAGATrTCTTTTTAGTTTMTTCCTGAGTATACAGGGCAAAAGCTTATNTCCTTTAT 

ATAAACTTCTGCTTTGGTCT AAAACTGATATATCTTCACG7TGAGGTTTCATCTGAAATG 

CACCACGTTTGCTGACTTGCTTCAATATGAATTTGTATGGCTATAAAATTGNGC 

Sequence 2727 

CCGCGGTGGCGGC CGAG GTA C 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 CGGGACTTTAGATA 

GCTGGATTATTCTTCTTTTCTCTTTTTGTCTTTCATGTAGTTATTAATGAGATGGATATA 

AAACCAACTACTAGGTNCATATCCCCAAAAAATGAAGTCAATATGTTGAAGAGATATCTG 

AACTCTCATGATTATTGCAAGACTATTCACAATAGCCAAGATNGGGAATCAACCTAAGTA 

TCCATCAACAGATGAGGNGATAAAGATAATGTAGCATATNTATACAACAGAATNTATTCA 

GCCTTAACANCAACAAAATAAATNTTGTCATTTAAGACAACACAGATAAACCT 

Sequence 2728 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACTCCACTAACCTGGGAACCTT 

GCCCCCATCCCCCATAGCANCCACAGCGAGACCTGCCCAAGGAGAGTCTGAGCTCAGACA 

TGCCTAGCCCTGCCCCAACTTGATGGGCCTTCCTATCTACCCTGGTAGCTGAAGGCAAAG 

GACATATACCCTTGGGAGTTCTAGGGCCCCGCCCATCGCCAGTTCCTCTCCATACTACCA 

CAGCTGATGCTCTCTGGGAAGTGCCACCTCCCAGCAGCAGGCCAATCAGCACAAAAATAG 

AACATTAAACCACCAAAGCTAANAACCCTCAGAGAATCCATTTACCCCCCT 

Sequence 2729 

CCGCGGTGGCGGCCGAGGTA C 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 CCCTGTATTTTTCTTTTTCCC 

GAGATGAAGTCATGCTCTGTTGCTCAGGCTGGAGTGCAGTAGTGTGATCTCGGCTCACTG 

CAACCTCCGCTTCCCAGGTTCAAGCGATTATCCTGCCTCAGCCTCCCAAGTAGCTGTGAT 

TACAGGCATGCATCACCATGTCTGGCTAGTTTrTGTATTrTTAGTAGAGACAGAGCTTCA 

CCATGTTTGCCAGGCTGGTCTCAAACCCCTGACTTCAGGTGATCCACCTGCCTCAGCCTC 

CTAGAGTGCTGGGATTACAGGMTGAGCGACCACACCTGGTrTGTTTCTTAAAAAACATC 

TTATATTTCTCTGCTTMCGNGCTCMTGTTGAACATGTGAAATATAATAACTTTTCATA 

ATCTCTTCTGATTCTGGCTCTATATCATCTTTGGGGNCTGGTTTTAANGGAATTCTAATT 

TTCTTGGCAAT 

Sequence 2730 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

446/467 



CTATAGGGCGAATTGGAGCTCCCCGNGGTGGCGGCCGAGGTACACGAGTCTTTTGCTTNC 

TCCCACCXJCTAGGGGGAAAMCTGCTTTGTGCTTTGGGAAGTTGTCTGTGAAACCCGGGG 

ACAGAGGACNCANGACAGACTAGGANNGGAGCCGNGAGGATGGGCTGCANCTGTNGAGGA 

GGGTTTCANAGGANAGNGGTCNGATAGCACCAGGCCTGAGAAGCCATAGCCTAGGTGGAN 

AGAGGTTTGAAAGNGACANAGCGGGCTGATTANCTGCCGTANACNCTTTNATNCCATGTT 

ANATAAACATNNTTNAAAAACCCTTCNTNTTGTTCCCAATNNTANAAAAATTAANCCCGG 

GCNATGGG GGGG CMTGCCCCCTNGTNATTCNCTNTTTTAATNAANAGAAAGGGNTTGNCC 

CCGGGTNANTTTTANTTTTNTNACTTTNNGGGNGGTTTNTIvrrTTTATNNTTNTNANAAAA 

ANAATTGGGGNGNT^^TCAMNNNATTTNNTTGTTNNAATNT^TTATATTN 1 1 IT I II I 

TTNANCTNNCCNTNNNAACAAAAI 1 1 1 1 1 1 1 1 NTNTNNCATATAAAAAAA 

Sequence 2731 

GGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATA 

ACTTATCGGCCAAAAGTAGTCACATGGGTCCACNTAATNACAAGNGGAGCGGGAAGTGCA 

ATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCNCTAATGACTACCACACTTNG 

CTAAGGTNACATAATAAATAAGCATCAGGACATTATGTGTGGNGGCTCATGTCTATAATC 

CCAGCNCTT 

Sequence 2732 

CTATAGGGCGNATTGNAGCTCCCCGCGGTGGCGGCCGTGNGCNCGGAGNTGGTATTGACA 

TAGCCTTTGTAGAAACAGTGCTTGAGTTCCGCTTCNTCTTCGGAATAAAACTTGGTCTGA 

TTCACCCCGGGCGTCCCGAGGAGGGTGACAGTGAACAGTGGAGCGATAMTCCGGCATTG 

GCGGTGAGATTAAA 

Sequence 2733 

AGGCGGGCGGCNGCCNCNCNCCNGGTACCTGATAAAAATTTANTNCTCCTTGGCCAGGCA 

TGGNNNCTCACACCNGTAATCCCAGCACTTTGGGAGGCCAAGGCCAGCANGTTGCTTGAG 

CTCAGGAGTTTGAGACX5AGCCTGAGCAATATGGCGAAACCCCATCTCTACAAAATATACA 

AACATTANCCAGGTGTGGTGGCNAACGCTTTCAGTCCGAGCTACTGATGAGGCTGAAGTG 

GGAGGATGGCTTGAGCCTGGGMGTGGAGGTTTCAGTTGAGCTTGAGTTTNATGCCATTG 

MCTCCAGCCTTGGGCGACGGGNAGACNCTNTTTTNAAAGAATTTGGAAAAAAAAAAAGG 

GAAAAAAAAAANNNN ANGN CCCTTGGGCCGNNATTTNAAAAAANATGCNCCNCCCCCCCC 

GGGGG^^TTNNGAMTTTTMA^J ^^TTAN AAGTTTTTTCNNNCCCCCNGGGGGGGGGGGGG 

GGGGGGGNGCCCC NNAA NNNC I 1 1 1 1 1 1 1 1 1 1 1 ATTTAAGGGGGANGGNCCCCCCCCNAA 

AAAAAAANATNTTT^f^^^TNTTTNTNNNCCCNTTNNNNANNNTANAAAAAAAAAAAAAGN 

GGGGGGGGGGGGGG 

Sequence 2734 

CCGCGGTGGCGGCCGAGGTAC I 1 1 1 1 1 1 I I 1 1 1 1 I 1 1 1 I I I 1 1 1 1 ATGAGATGGAGTCTC 

GCTCTGTTGCCCAGGCTGGNGTGCAGTGGTGTGATCTCAGCTCACTGCAAGCTCCACCTC 

CTGGGTTCACGCCATTCTCCTGCCTCAGCTTCCCAAGTAGCTGGGACTACAGGCACCTGC 

CACCACACCCGGCTMTTrrGGTTTTGTATTTTTAGTAGAGACAGGGTTTCACTGTGTTA 

GCCAGGATGGTCTCGATCTCCCGACCTNGTGATCCGCCCGCXJTCAGCCTCCCAAAGTGCT 

GGGATTACAAGTATGAGCCACTGAGCCCGGCCTCTCTGTAGCTTTTAAGATGTTCTTAGG 

TGACTTATGAGAATGAAAAAATGGAGAATTTTCCGTCTTCCTGCCATGAAATCAATTAAT 

GGCAATTGCTACTGAAAGCTGTTCTG 1 1 1 1 I 

Sequence 2735 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI 1 1 II I 1 1 1 1 1 1 1 1 1 1 IGG 

GGGGTTGCTATTATTGTTATTGTTTCTAGATGAATGTGNGAAGCCAAATTATTCAGTGTC 

TTAAAGACCATATTAAAAACTCCTGCCAGGCGCAGCAGCACGTGCCCATGATCCCAACTA 

CTTGGGAGGCTGAGGCAGAAGGATCGCCTGAGTCCAGAAATGCTGGGCTATAGTGCACTA 

AGTCAATTGGGTATCTGCACTAAGTTCX3GCATAAACGTGAGGGACCACAAGGTTGCCTAA 

GGAGAGGTGAACCAGACCAGCCTGGAAACAGAGCAAGTCAAAACTCCTGTCCTGATGAAG 

TAGTGGGACTGCACCTGTGCATAACCACTGTACCTGCCCG 

Sequence 2736 
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GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGACGAGATCTCGAT 
CGMGGCGAGATGGCGGACGTGCTAGATCTTCACGAGGCTGGGGGCGAAGATTTCGCCAT 
GGATGAGGATGGGGACNAGAGCATTCACAAACTGAAAGA 
Sequence 2737 

ATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTAAAAAAACAGTTTAT 

GTGCAAGGT GTATAA GAAAAGTAAAACATACCTTTGGTAAAAAGATTATAAAGGGGCATA 

AGAAT GTGG A1 1 1 1 1 ACCTACATTAMGGGTTAAAMCAATTATTGTTTTAAAAGTTTAA 

GCAAGTTTTAAMCGTT MTTAT AMGAAMTTCTGTGTGTAMCATATTAGCTAAAGTT 

AAMAGGTATCATCCAGTTTTTCTGTGAACTGGACATTAAAGTAAAAAATGCCACAGGTT 

TTTCTTAMGCATCAACCTGCTCTTTAACAAAAATTATAAAAGGTTAAAAAGAGTCTATA 

AMTCTTACCTTATGGTCAAACATGAAAAATTGGATAMTATGTCTCMGGGTTTATTAA 

MTTTCAGrTTAAMTTMTMCACACTAATATAAAGGTAAMTTTAGCTTATCTGGTAT 

AAAAATCATACNAGAAACATTATTAAATATNAAATGGGGT 

Sequence 2738 

TCTAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I I I I 1 1 1 1 1 I I I I M 

1 1 M 1 1 1 1 IGAGACAGNCTCCCTCTGTCACCTAGGCTGGAGNGCAGTGGNACGATCTTGG 

GTCACTGAAACCTCCGCCTACTGGGTTCAAGCAGGTCTCCTGCCTCAGCCTCCCCAGTCG 

CTGGGATTACAGGCACATGCCACCACACCTGGCTAATTTTTGTATTTTTAGTAGAGACGG 

GGGTCTNACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTTAGCTGATCCACCTGCC 

CTGGGCTC CCAAA GNGCTGGAATTACAGGCGTGAACCACTGNACCCAGCCCNTTGACCTG 

TT^f^TTATTATTTTGNGGTTMTGCCA 

NTCACAGGGGGG AANTTTTTTTAAAACAANAATTCC 1 II I I II I I A AGGAAAAAAAGGTT 

TGTisrmrraiiTGCANCc™ 

ATTCCI Mill ICC 
Sequence 2739 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACGAAGCCATCTTGGCT 

CTGTGGAACCAGCTCTACATCAACATGAAGAGCCTGGTGTNCTGGCACTACTGCATGATT 

GACATAGAGAAGATCAGGGCCATGACAATCGCCAAGGTATGTCCTCAGGGCCACTTAGGC 

TGCCTGGAGGGAGGG CAGC GCTGCCCCCCGCAGTGCCTGTGTCCAACAGTTCAACCTTCT 

TGCTGTGTAGCAGTGCTTTTGTGTCTCGTNAAGCAAGTCAGCTCACCCCTCCTTAGAGGT 

TCTGGTCTGTCCAATAGAGAACGGGNGGGATTAGCATATGGCTGATTATGAGAGAAAGAA 

GCMTNCTMTTTAGGGTGGCCTGACAAGCAAGCCAGAATTGCCTGTGGAAAAGTTATTG 

CACTCCTGTAAGAATTCTGGACCCTATTGCTCCTTTGATGTAATGGAAAGTTAG 

Sequence 2740 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAGTGA 

ACTGAGGGCCACCCTGGGAGGAAGCCGACTAGGCGAATTCACTTACTGACCGGCCTGGGC 

TGCTCTGAGACATGGAGGAAGCCAGTGAAGGTGGAGGAAATGATCGTGTGCGGAACCTGC 

AAAGTGGGGGTGGGGGGAGTTAANANTATTATCCCCANATTGNGGGGCGGGATCCTGCCC 

CGGGGGGAAAACTTTGGAACATCTCCGCAATAAGACAGA 

Sequence 2741 

GGGCANGGTACCCACTTGGATGACTGGGGAGAAGGGCTGGGCTGCTGCTGGGGAGTAACA 

CAGGCCTTGGGGCAGGGTTCAGGAGTTCATTAGTCTGGAGTCCAGATCGCCACCCAGGGC 

CCAGCCTGATGTAGTGTTCGCGTTCTCTCAGCGCTGCAGTTTTCCGATAAAGGAGAGGAC 

TCCTGTGTGCCAGAGCTCTGAATGGGAGCCTCTTCTCAGTCCAGCCAGGCAGAGGGTGAG 

GCTGCCACCTTATGGCCACTGGGGGAATTGGCTCTGGGCTTGGACTCCAATAAGGGGCCG 

GGAGCTGCAGAGACCTCCAAAAGGTCTCTTAACTAGTCCTCCAAA 

Sequence 2742 

CCGCGGTGGCGGCCGAGGTACTGCCATACCTGGCTAATTTATTTTTTGTGGAGATGGGAT 
CTCACTTTGTTGTCCAATCTGTTCTCAAACTCTTGGCTTCAAGTGAGCCTCCTGCCTCTG 
CCTCC CAAAA TATTGGAATTATTGGCATGAGTCACCATGCCAGATCAAGAAAATATTTAT 
GTATAATTTTATCATACCTCATTGGTCCTMTGTTTTTTTGCTTGTTTAGGTCCCTTCTA 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

448/467 



GAGATAGGAGAAGAGAGAGATCCCTTTCTCGGGGAGAGAAATNCAAGCCGTCCCGATCCT 
CTTAGGGCTNGGA GGTAA AATCNTTTGATMCTTTGTATTNAAAACTTTGCATCCATAGT 
ATGCTMGGChnTNTTTTANCCCAMAMTTTTN^ 
AAAAAGGGCCCCNAANACCCAAGNNAAAGGGNNGGNN CI nTTTTTI 
Sequence 2743 

CCGGGCAGGTAC I II N I H I 1 1 1 I II I TTTTT Cl II 11 1 11 1 I 1 1 M 1 1 I 11 1 1 1 I I 1 I 
M I I 11 11 1 1 I II I 1 1 1 I I I I I I M I 1 1 1 1 1 1 1 1 1 I 1 1 1 11 1 1 M I I N 1 I I 1 1 I I 1 1 1 I 

nrm iccccnggnnnnaaamttttaanngnnnggaaaaaaaaaaancnggggaaaang 

NNCCNTTNNTNNNAANCCNTNAANNAAAAAAANNCCNNNAAAANNAAAAAAAAACCCCAA 

MNNNNNhTTNNNAAAAAMNCNTNGGNNNAAAAAAAMGGGGGGNGNNANTTTNNNNCCC 

CAAAAAAANGGTNNGGNNGGGAAAAAATTNNTTNNCAANCCANAANTTNANTNCAAAAAN 

CCNTNNTTCCCNTGNCCNTNNAAAAAAANGGGNACCCCCNGGGGGGANANAATANNAAAA 

AAATTTANTTTAACCCCCCCCCCAA 

Sequence 2744 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI 1 11 I I I 1 1 11 I I I I GAGACGGA 

GTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCCCACTGCAAGCTTC 

GCCTCCTGGGTTCACACCATT CTCCTG CCTCAGCCTACTGAGTAGCTGAGACTACAGGCA 

CCCGCCACCACGCCCGGCTMTTTTTTGTATTTTTAGNAGAGAGGGCGTTTCACTGTGT^ 

AGCCAGGATCGTCTTGATCTCCTGACCTCGNGATCTGCTCGCCTCGGCCTCCCAAAGTGC 

TGGGCTTACAGGCGTGAGCCACCACACCCGGCCCAGTCTTGTGTCTTAAATACCATTTCC 

ACATTGACACCTCCAGAATATTTATCTCTAAACCTGACCTCCAGAAGGTGCAAGCANATT 

GTGTAAACCATGTCTCCACTACACGCCTAAAGGCA 

Sequence 2745 

CCGGGAGGTGTCCGAACAGGCAGGTTGGTGGGTTAAAGGTCTTAATCTTGACTCGAGATC 

TCTCCCCGGAGTTCACAGAGTAGGCGACGAAGCCGAAGCAGCTGGAGCGCGACCCGGAGG 

AGTCTGACTTCTCGTTGTCTTCATMTTTTCATTCGTTGCTTTCTTCATGGACTTGCGGC 

TGGGGGAGGATCCCCGCTGGTCGCCGAGCANGCGGGCGGGTAAAGGTAGGCCGCGAGAGC 

CAGGTTATNGAGAGGAGAGGAGGC 

Sequence 2746 

A GGTAC CTGTGACTAACAAGGGGTCTGGGAGGATCTGCTGCTCCCATGCCCTCCTTTGTG 

TGTTTTAMTCTGTTTGAGCCTTCTGGGCTCCTGCGAATTAGGGAGTGGCAGCTCCTCAG 

TCTMCTCCTATTGNGACCAGGTTGCCTMTTGGCCCTTTGGTTTGGGCACCCACTGTCC 

TTCTGCGTGGTTGGATAGATGCTGCTCCCAATGTCCCTGATCTCTTACAGACCCCTCTGA 

TTCTTCACTCTTGGCTrTGAGAGCCCCTGATGCCCTGCAGTCTTGACTGAGCTTCTAATG 

GTTGATCAGACCCTTGMTGTTGAGCTCTTTCCATACTAGACTTGAATATTCTCCTG 

Sequence 2747 

GGCGGCCGCCCGGGCANGGTACANGACATTTTCAMGTTGCCAGTGTTACTTTAATTGGA 

CTGCCTTCGTAATTCATTGCCTCTGCTTCAACAATGTGCAACTCATCCTTTGCACCAGCC 

CCTAMCTGACCGTTCTTAMGATAGCTGGTGCTCATTTTCATCATTATCCACCTTAAAG 

NGATMTCTTTGTCGGCCTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTNAGGGGG 

CTCATGTCCATGTCCATCGMTCTTCCATNGGGNGGCGGCACNCNNTTTTTNTAGNAAAG 

AAGGCGGNCGGAANATAAAAAAAACTNTNCTCCAAAAAAAGACCCGGNGCANGGAGGGNA 

NTCANACCAGGGGCCCCCGTTTNCCTNGGATCNTTTNATAAAAAAATNGGGGGNCCCCCN 

CNGNGNNNGGNNGNAAATTCTMTNNTAMNTTTTTANCCCCCCCCCCCCCCCCTNGGGG 

GGG 

Sequence 2748 

CCGGGCAGGACTTGGGAAGCTGAGGCATAAGAATCACTTGAACCCGGGAGGTGGAGGTTG 

CAGAGTGAGCCAAGATCGCAGCACTGCACTCTAGTCTGGGTGACAGAGCAAGACTCTGTC 

TCAAAAAAAAAAAGACTAGAGAATGTCAGGGAACACATGTGTATATTTAAACAACTTCAC 

TTTGCATTTAAAAACTCGAAGGACAGCGAAGGTGAAATCAATTCACGGGCCACCTAAACT 

TTTCAGTCTAGGACTNAGAGCTTGTANCCATNGATCTGTNGGGNTTGNACCTCGGGCCGN 
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TTCTANAACTAGTNGNATCCCCCGGCT 
Sequence 2749 

CCGGGCAGGTACAACATGAGACATGACGCCCTTCGGGACACATGCCTGAGGTAGTGACAA 

TCCMCTTTG GMGAG TGGMGCCCTAGTTTCAMTTCMGCATGCTTTGAGTATAAATT 

MGTTTACCTCTTTTTGCACAGCAACATGGCCMTC^ 

GAAA AGGAATCAT ACTGC TAAGAATTCAAACTTCAGCAGTCATAGGTAAAGTAAGGGAAG 

TTTTiSi TA MCCNT ATTTTTAGCCCCCNTACCCNGAACCCTNGNAMTTTTNGCNAGGG^ 

TTTTTCMTTTTTCNAGGGACAGGTTGGGGTTTCNCT^ 

NACCNTGGAANAAACCAGACCCTTTTAAAAAAGG 

Sequence 2750 

CCGCGGTGGCGGGGTTTCGCCATGTTGGCCAGTCTGGTCTTGAACTCCTGACCTCAAGTG 

ATCCACCCACCTTAGCCTCCCACAGTGCTGGGATTACAGGCATGAGTCACCACACCCGGC 

CAGTAGMCTCTTTA MCCTGAA AMTCAGTCMCTTTGCAGACTAGAGGAGGATGTTGA 

AC ACCTATGTGTGTA 1 1 II I II I CTTTACCAACTATGCACCTATTTTTCAGACACCTAAA 

AGTMTGTCTGTGAMCAGTGGGTTTTCTTTTT 

Sequence 2751 

CCGGGCGAATTGGAGCTCCCCCCNGTGGCGGNCGCCCGGGCAGGTACAATGCTTATAAAA 
TTCAATAATTTGTATTAAAATACAAAATCCNATAACAACCAGGAGTTCTTCGGAAGAAAA 
AAAAAATCACAAAACAACCCCAACAGTGGTGAAGAACTA 
Sequence 2752 

TA GGGC GAATTGGAGCTCCCCGCGGTGGCGGCCAAAAGCTCCCAGATTATTTGGAAACCA 

TGTTTTCTTCCTAGTCCATGGTAACAAGAAAAAGCCANTGGAAGCATCATTCCAAACAAT 

AATCTCCAAAGATGGTGGCAACCAAGTGTCAAATGGGGACTGCAGGCACAGAAGAGACCA 

CCCCAAACCCTGCCTGGGTGGACGAAGCAGGTATGCTAGAATAGTCCTGTCCTGCAGAAT 

AGGGAACGGCAGCTTGGTCGATCTGTGCCCTGGAAAAAGAAAATGAGTTGCAATAGAAGT 

GACTNTAAGACAGACAATGAACCTACTNTTAAGAGAGACAGGGCCAGGCACGGTGGCTCA 

CGCCTGTATCCCAGCACTTTGGGAGGCTTNAGGCGG 

Sequence 2753 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCAGAGACAATGGAATAAACAGCCAAG 

AATGTTAAAATAGCTATTATAAATATGATCCTTATGCTCATAGAGAAAAGGAAAAATATA 

AGTATAATGTGAAAAGAAATGGAAGATATAAAAAGAAATGCAACTTCTAGAGGTGAAAAA 

TATGTCTGAAATGAAAACACCATATAGATGGAATTACCAGGAAATTAGACACTGCAGAAG 

AAAAAAATCCATGATGTTGAAGTATATTGCAATTAAAACTATCCAAAATGAAACTGAGAG 

GGGGAAAAAAGCCCCTGAAATGTATGAAGAAAGCCTCAGTGACCTGTGAGACAATATCAC 

AATGGCCTAACATAAGTGTAATTTGGAATCCCCAGAAGGGGTAGCAGGCCCGAAAAAAAT 

AATTGTTTGAATAAAGGGNCAAAAGGTTTTCCC 

Sequence 2754 

CTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTT AI I I I II I 1 1 I I II 

TTTTTTTGAGANAGTGNGTGACTNTGNCACCAAGGCTGGAGGGNNGNGGNAAGANCATAG 

N TCACTGAAGCCTCCATNTN TGGAGTTNAAGGGATCCTCCCGCCTCANTCTCCNAAGCTA 

AAI 1 1 I I I M 1 1 I I I I I n 1 1 IAGNMNANACATGGTTTCACTATNTTTGCCCAGGCCAG 

GTCT AMCCTTNCTNACTTNMNGCMNTNNCTTCTNCCTTTAANCCTTCCCAAMGGGN 

TTTTGGNATANNNT NANACN CAGNCCCTTNTNCCCTGGGCTCTTTCNTTNTTAAAGAAAG 

GACAC TTTAM TCANTTTTTNCNCNCNCAGAGATNAATTGNGCAACAAACANTATTANGT 

TGGGGTTTTATTTNAGGGTTCCTNNCCCGGNNNGGTCCCTTTNANAAAANNTNNGGGAAN 

CCCCCCCGGNGTNTGGGGGGANTTNTNANNNTNCMNTTTATTTTATTNCCCCCCCCCCC 

NTCG 

Sequence 2755 

CGCGGGGGCGGCCGGGAANGGNCCNGGAANGGNAGTCANGCAGGGAGCGTCTGTCCGAAC 

GGAGGCTAGGTAAGAATATTTCACCATGAAAATGTTAAAAGACATAAAGGAAGGAGCTAA 

ACAATATGGACCCANCTCTCCTTATATGAGAACGTTATTAGATTCCATTGCTCGTGGAAA 
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TAGACTTATTCCTTATGATTGGGAAATTTTACCTAAATCTTCCCTTTCACCCTCTCAGTA 

TCTACAGTTTAAAACCTGGTGGATTGATGGAGTACCTCGGCCGCTCTAGAACTAGGGGGA 

TCCC 

Sequence 2756 

CTACTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTA U I 1 1 1 1 1 1 1 1 1 I I 1 1 1 
II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 IGGAGNGAAACAGGAGTGCTTTATGGTCTGAGTGGAGTGTTTG 
GGAGGAGTNCCTCCCGGNTCCTGCCTTGGGGCTCACCTCCCG 
Sequence 2757 

CCGCGGTGGCGGCCGAGGTACATCTTCTCCTAAAAACAAGGGTAGAGCCAATGGAAAGTA 

ATGGTTCTG7TACATAGAATGAGTTGTCGCGTTGATC7TAAATGATGTATTGGTAGATAT 

ACTTCCCAAGTGGATTAAAAAGTTAAAACTTACAGCATAACAAAGTATTAGACTTACTGA 

GGTG ACTTG AA TATCTCCTTTTG ATTTTCACTCT A I 1 1 1 I CTTTTCACCCATGGGAAAAT 

GATMTTTTTTAATAAACCAAGGCTCTTACCATAGCTGAACTTTAAAACTTAGACTGTCT 

TTTCTGTAAACGATTCTGAGGCAAAGGGAAATGACTAGAAGAGGATGAGTAAACAATAAC 

CTGAAATGGGAAACTCG AGGGAAGCACAGGC 1 1 1 II I 1 1 G GTTTGTTTTGGTTGATNCGT 

TTTTTGGTCTTTG 

Sequence 2758 

CTAC7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGCCGGTGGGCT 

CCCTGACATCCCTTCCAGGCAACCTGAAAGCACTGAAATAGCTTATGGCCCTGTGCCAGG 

GACCTTGGCCCAAGCTGCTGACCTCCAGGGTGGGGAGGGAGCTACCCCCAGGAGAAGAGT 

CACTCAGACAGCAGTATGAGCAAGCCAGCCAGCAGCTCCGTGCCTGCACCCAGCTCAGGG 

GAATCCCAGGGGGTTCAGATGCCCAGGAAGGAAAAGGGGACAGCGCTACTGCTATGGAAT 

GAGACXJACCACTTCTCCTGTTGTCCTTCCCAGCTTCTCCCCAACXJTCCCCTTTTCCTAGT 

TTATAAGACAGGAGAAAAGGGGAGAAAGCAAAAAGCTGGAAAGAAACAGAAGTAAGATAA 

ATAGCTAGACGACCTTGGCGCCCCACCTGGCCTGGGNGGG 

Sequence 2759 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAATTAGCAAGAGACATTTTCTGCATTGT 

GAGAAATCMCATAGACACCTTAMGACCCCTTTGAGAGTGTGGCTTTTTGAACTTTTCA 

GATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGGTGTAAATAGAATCTA 

ATGGCAGAATCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGATCAAGACCACAAAAT 

GTCCA GAGCTAT GACCACAGCTATACCTACCCATAAAATACGATACTGGAGTAGGGTATT 

TTTGTCTTTTTTTCTTACCTAAGAGCTAGCTAATCAGGACAGGTGATGCAGGTTCTGGAG 

CTCTACCAGGGCAAGTTCTATTTTCTTTTTTTGAGACAGAGTCTCACTGTCGCCCTG 

Sequence 2760 

CCGCGGTGGCGGCCGGGCAGGTAC 1 1 1 1 1 1 1 1 I I 1 1 I 1 1 1 1 1 1 1 1 1 1 CGAATATTNCTTT 

CAGTGTTCTCAGATTGACTTGACCAGCCTAAGACAGATGCCAGGGACATCCTCTTNTNTG 

CCTNTNAACACTTCAGTCAGATGGGAATATGGAAGGATCATATNCAAGAGGATCATA7TT 

TNTGAAGCCAATCCATTANATGTCAGGAAA 

Sequence 2761 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC7TGTAGATGTTCTTTGGTATCTGGGCA 

TTGMGAATTAGGTATTTATTATAATCTTCACAGTCTGAGCTTGTTCGTAGTGGTCCTTC 

TTGGAAAGGCTTTCTAGATATTCAGAAGGACCGTAGGTGTTGTGATCTAAGCTGTATCTG 

CTTTAGAGGGCACCCGAAGCCCCGTAATGCTCTGGTTCTTGCAGACTTCTGGAGATACTG 

CCTTGATGGTCTTGGATAAGATTTGGAAGAATCCTCTGGATTATCAGGCAGAGACTCTTA 

TTCTCTTTCCTCACTTTTTCCCAGAGTCTTTCTATGCTGAGCTCTCTGGAGCTGGGGGAG 

G 

Sequence 2762 

TNNCCGCGGTGGCGGCCGAGGTACTGTCCAACCAAAACTTTCCACNGNGAAAATTTTCCT 
GG GTGAGC CTCCCAGAAAAGCCCAGCTTAGTGTAAGCCAAAGACCTCCCAAGTCTGTCAC 
CAA I 1 1 I I ICCCTATTACTCACCTGATCATGTGGGCAATATCCCAGTTGGTCTCTGTAGA 
CAATGGTCCCTCTATTTCAACACCTTT7TCGGTGACAGTGGCGA7TTGAAACTGGAAGAA 
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AGCAGAGCATTCAATAATGCCCACGCCTTAAGTCCTTAAATGAAAGGTCAGGTGGAGGTC 

TTCCCCAATGTGAAAATAGGAGTCACACAAGTAAGGCGNATCTGTTCTTCAAAGCATAGG 
CTC 

Sequence 2763 

AGGTACTATAATCTCCTTCTCCACTGGCAAACAGATTATACCAGAAAACAGCAGTTCCCT 
ATGGAG AACAATTTCACA ATTATCCCCAAGTATACMTGCCAGGTMTTGCTTTCGGTAG 
TT ATT AC I 1 1 I t I I 1 1 I I I GAGACAGAGTCTTTGCTCTATCGCCCAGGCTTGGAGTGCAA 
TGGCTCGATCTTCAGCTCACTGCAACCTCC 
Sequence 2764 

GCGGNCGAGGNACAANCNACTTGGGGGGGCANAAAACCNGCCCCCCCCACGANGAGAAGG 

GGACNANGAGAANNNTTTACACACAAGNGGGGGANNNCCCCCNAAAAAACCGGGCGCAGN 

GGCNNACACNNNGNAANNCCAGNACNNNNGGGAGGCCGAGGCTNTTTNAAAAAAATTNNT 

TTNGAGGGGGGGGGGGCCCCCGN 

Sequence 2765 

CCGGGCAGGTACGCGGGNTTCCGCGGGGCTTGCTGGGAAGAGAGGCGAAGCCAGGTCACC 
TTTCAAGGACCCAGAAGTAGGGTTTTGGCCTAGGTAACCGGGGCAGAGATGTGGTTCGAG 
ATTCTCCCCGGACTCTCCGTCATGGGCGTGTGCTTGTTGATTCCAGGACTGGCTACTGTT 
TNCCT 

Sequence 2766 

ATAGGGCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACCCTCTGCTTTTGCAAG 

GACTCTACTGTGTCATCTTAAGTGAGACAGGTGCATATGTATACTAGTAAAATTCTCTGC 

TTTCCTAGCATTGGACAAAAACAAAMTCAACAAAAGAATTGCCTCAGTGTCTTAAACTG 

GGAT CCTTACTAGTTGACTAGGCACTTAGTTACTGAAGGATATGTGTGGAATTCAAGTTC 

TTTTCAACCTATAAGAAATATCCGGCCAGGTGCAGTGGCTCACGCCTGTAATCCCAGCAC 

TTTTGGGAGGCCGAGGCAGGAGGATCACAAGGTCAGGATTNTCAAGACCNGCCTGGCCAA 

CATAGTGAAATTCCTGNCTTCTACTAAAAATN 

Sequence 2767 

CGGCCGCCCGGGCAGGTACTATTACTAGGTTCATTGTTTCCAGAGGGGTGAAACGGGGCT 

TTGGAGAGGTTAAATAACTTGCCCAGGGTCACACAGCTATTAAGTGGTAAAGCTGGGATT 

TACATGAGCCCAGACAAAGAACCCAAGAAGCTAAGCTATTNTCTTGTAATACCTNCAACA 

TAGGAGGCAAGAAGTGAGGTATTATACAGGTTGAGGAGATA 

Sequence 2768 

CCGGGCAGGTACGCGGGCATCAGCTCCGTGGGMCTCACGAGCCTGGGGAGAGTTCGTCA 

TCCCCACATGGAACCTCAGTATGGCCAACAGGCAGCCCTCTGGTGGAAAAATCAACAATC 

AGTCCTTGCANCAACTTGATGGAACNCTAGTTGACCACCATAAAGATGTCAAGCCAGGCA 
GCA 

Sequence 2769 

AGGTACTACTGCTGAGGTCT CCAG GACAGAAGTCACCTCCTCTGGTAGAACATCCATCCC 

TGGCCCTTCTCAGTCCACAGTTTTGCCAGAAATATCCACAAGAACAATGACAAGGCTCTT 

TGCCTCGCCCACCATGACAGAATCAGCAGAAATGACCATCCCCACTCAAACAGGTCCTTC 

TGGGTCTACCTCACAGGATACCCTTACCTTGGACACACCCACCACAAAGTCCCAGGCAAA 

GACTCATTCAACTTTGGCTCAGAGATTTCCACACTCAGAGATGACCACTCTCATGAGCAG 

AGGTCCTGGAGATATGTCATGGCAAAGCTCTCCCTCTTCTGGAAAAT 

Sequence 2770 

CCGGGCAGGTACAGTTGGACCTGCTGGCATTCGAGGCCCTCAGGGTTCACCAAGGCCCTG 
CTGGCCCCCCTGGTCCCCCTGGCCCTNCTGGATCTCCAGGTGTAAGCNGTGGTGGTTAT 
Sequence 2771 

CCGGGCAGGTACTGTAGATTGAGATATAACAAAAAGATGATTTCTGAAATAATATTGGGA 
AAAGAATTTACCGGATGGTATTTTGTATCTTTGCACACCTCTGTTGTTATGATGCTATAA 
GGCAATCTTGTCCAACCAGCAGCTCACAGGCCACATGCCACCCAAGATGGCTTTGAATAG 
GGCCCTATACAAATTCGTAAACTTTCTTAAAACATTATG AA A I I I I I I I I II I II f I II I 
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ACM TAAGG CTCTTGAAMTTGTCATTACTTGTGTTTTCCTATACATTCATCTGTGTGAA 

AGCCTTTTTCTTCTTTGATTTAAAAAMTTAACTATACAGTTMTGGTTTAGMCT^ 

ACTAC 

Sequence 2772 

AGGTACACTCAAAGGCTATGTTCTCTCTCAGGACTCAAGATGAATTACTGGCAGAATTCC 

TCACTACATCCTTTAATGGAAACCTCTGTCGGTCCTCATCATCAGTGCTTTCATATTCTG 

ATTCTTCACTAGATAATTCCTCATCCTCGTCTGGCAAAGGAGGTTCCTCAGGTGGCTGAG 

GAGATGTGGGTGGAAGAGGTGCATGCAATGGCATATAGTCTTCATACATGGGAGGTCGCG 

CAGTAATTGGTCCAAMGGTGTGGGCAMTTCATTTTATTCATAAGATGAAGGACCTGTN 

CCTGCCCGG 

Sequence 2773 

CCGCGGTGGCGGCCGAGGTACTCAATGTTTAGCTCCCACTTAGAAGTAAGAACATGCCCA 

GCACTTTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAAGAGATCNAGACCATCTTGGCT 

AACAANGTGAAACCCTGTCTCTACTAAAAACATAAAAAATTGGCCCGGGTGTGGTGGTGG 

GCACCTGTAGTCCTAGCTACTCGGGAGGCTGAGACAGGAGAATCGCCTGAACCGGGGAGG 

TGGAGGTTGCAGTGAGCTGAGATGGTGCCCAACAAGAGGGAAACTNCTT^JTNAAAAAAAA 

AAAAAMGAAAACACAGGTATAATTTCCTACCATGTTCTTACNGGATCATMCTTTAAAT 

GGTCACCCGCGTNCCTTGCCCGGGGCGG 

Sequence 2774 

CCGGGCAGGTAC 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 AG AATAAAAG ACC ATTTTTATTCTA 

ATTTTAACTCAGAAATTATTATGCTTATTCMTTTMCACTTTCGCTGAAAGGTTAAAGA 

GATAAGAAGGACAGATTATMTTGCTTAATATTGCTATGGTAACTTCCATTCAAATACCT 

GTGAGTCACCAGAAGTCAAAAAGGTAGCCAGCATTGCAACACAGGATGGATCATGCAACA 

GAAACTAGCACCAGGTTACCTTATCTTATMTATTATTTGCTGTTAAAATGAAATTTTAA 

AACAGCACCAAMGTTAAGTTGGGGCTAAAACTGTTGTGCAGGAAAGATTTCATATAGCA 

G 

Sequence 2775 

TGGCGGCCGCCCGGGTNGGTACCAMGCCAGATCCTCCTGTTTTGTAGCAGGAAGCCCTT 

TATTAGTTNNTCTCTATCAATCCATCTTTNATAATNNCCAAAATAGGATAAG 

Sequence 2776 

CCCACGCGGGCGAAATTTTGGGGAGGGCAAACACCCCCCGCGGGNGGGCCGGGGCCCGCC 

TCCNAGTAAACCTAAGAGGGGGAGGNTNTTCCAANNCCCCGCAGNTNNNTTAAAANCCCC 

ATCAAGGCCCNAANCGGAANAACCCGGGGGGGACCCATCTNAAGGGGGGGGGGGGCCCCC 

CGNGGNACNCCCAAGCCAANNANGGNNNCCCCCCGNAAAGNNGGAGGGGGGCGNAAAANA 

GGCAGCCGCCCNCNNGCCGGNAAAANCAAAGGGGGNCCAAAAAGNCAGGGGGGGACCCNG 

GGCGGGGGGAAAAAAAGGGAAAANCCCCGGCNCCAACCAAAANNNCNANCAACCAAACCA 

ATAACCAAAAGCCCCGGGGGAAGGCCAAATAAAAANNGAGGGGNNAAMNGCCCCCGGGG 

GGGGNGGCC 

Sequence 2777 

ATGGGGGAAAACCCCGGGGGCGGCCGCNCNAGAACTAGTGGAGGNNTCCAGCCCGCAGGN 

AANNAAACCACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCAA 

CNGGNCCCCCCAGNGAGGGGGAAAAAGCGCGCCGGGCGGAANCAAGGNCAAAGCAGGNGC 

CCGCGGGAAAAAAGGNANCCGCCCACAACAACCACACAACAAACGAGCCGGGAGCAGAAA 

GAGGAAAGCCCGGG 

Sequence 2778 

ATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 N I I I i 1 I 1 1 I 

minimi iC CNCCCN MCCAGTTCTMCCGCANGNCAATTAGTTTTTNAAAAGTCTCT 

CTCTTCTCATGTTTTTTCCTGCTTTGAMCTCMGNGNTCTCTTTTACTTMTANATCAA 

GATCCAACACTAATTTGACATTAAATCAAGACTCTNTACAAGTNGACTATCNCTAACNGG 

CTGGGGGACTTCTGCGGAA 

Sequence 2779 
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ATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATTGCTGGCCTTTGCCCAAATTATGCTT 

NCCCCATTTGGTATGACCTGACACCATTGTGATCAGTCTGATGACCTGGCAGCATCCCAT 

C TGCCTA CCCAGTTCACTNTTGCTCCATTTCAGGGCCTCTTACAGGCAACTNCTNACATA 

TATTTTTGACACTGACTCATGCCTTCTNAGGCTNAGCTAACATCAGCCCATTCATTGATN 

CAGCAAGACAATTTGAGAACCACTATGTGCTCTGCCAGAGTGAATAACAAAACCGCAAGT 

ACCTNCCNGGGCGGNCGGTCTAGAACTAGT 

Sequence 2780 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA C I 1 1 1 I I 1 1 11 
I 1 1 1 I I 1 1 1 1 I I NTNCCTAACCAGTTCTMCNCMGTCMTTAGTTTTTCANAAGTCTCT 
CTCTTCTCATGTTTTTTCCTGCTTTGAMCT 
Sequence 2781 

GGGGGGGGNTTTGGGGGGGGAAAAAGCCCCGGGCGGGAGNGGACAGAAAAAGANNTNGGG 

GNNGNGAGGGGGTTAGGAGNAANCCCNNGNGGGTTTTCCNAAACCCCCAGNGANGCCGGG 

NCGGCNGNGCGCGCANNGGAGGGCAGNGCTATGNANCCGNCNNGGGNACNCCCAGGCCCN 

GGGNNCCACGNGGCCCNNCCGGCACGNACNGNGCCGCNAAACCNCGGNAGGNCAGGCGAN 

GNANCCCNCNGCANAANNGAAGGAAAAAAAGGGGAAAAGCGGGGGCAAAGAACCAACAAC 

NCNGGGGGAACCACAAGCNAGGGAACCCCGNGAGAANAANANANCNGGGGANAANNNNGG 

GGNGGANAGNAANAACAAAGGGAAAANGGCCCCCGAGGCGNGGAAAACCCAAAAAGAGAG 

GNNAACGGGGGGCGGNNAANNGNACCANNGGCGNAAAAAAAAAAGANAAAAAGAGGGCNN 

GGNGGGGGGGNCGNAAAAAANAACACACGGGACCCCNNNNAAGGGGGANACCCNCNCGGG 

GNGCCGGGGGGNCCCCAAGNAAAACCNNAAAGGGAGGGGNANNCCCCCCCCCGNGGGACC 

CGNANNAANGGGAAAAANCCCCGGAANGAAAGNCAAAAGGCCCNNGNAAACCGGNAAAAA 

ANCNCGGGGNCAGAACCCCCCNCANAANGGGGGGGGGGGGGCCCNAGGGGNAAACCCCCA 

AGGCCCGGGGGGGGGNGNNCCCCCCNNNNCAANNNNNNNNNNAAAGGGGNNNCNAAAAAA 

AAGCCAGCCGGCCNGCGGGNNCGGGAAAAAACNAACGGG 

Sequence 2782 

TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGCCGATCTCTGAGAAGTTNTGTTGCA 

CCACTGTGAAGGTCTAGATGCMGCTTGGCTCCCTCAGAMGGCGCTTCCCTTTTGCATG 

GCTGAGGATCCTTGAAGGAACCTGGTCAGTCTCCGGTTCAGCTTCCGACACCAGAGTGGA 

ACCCAGTAAGCACCATCAGGAATGGATTTCACTACMGTGTGGATMCTCTGATTTTCAA 

AGGAGTAGTTACTTGCAMTTACATCCTTGCTGAATTCAGGAGGTATGAAACCCTATTTT 

ACCATGTTAGAAAACAGCCCAGGATTTTCTCATTGCTCTGCCATCATATATGTCTATGAC 

TTGAGCCCTTATTTTTCCATCTGCAAAACMTMTGCCTATATGTCTTTGCATATAGATT 

TGGAAATCTTCATTCAAGGTTTAGTAGGATCA 

Sequence 2783 

CCGCGGTGGTCGAGCGGCCGCCCGGGCAGGTACTTTTCTTCTTGAAGTGAGTTTAGATCA 

CGTTTCAGCAAACGTCCTCTGGAGCTCCTTCCTCTGGGGAAGGAGCAGCGGAGGGCTGTC 

ATCACCACTGCACTCTTCCAACCTTCTCTTCTGCTCATTCACCAGTTCCATCCTCATCTG 

ACCATATTACTCCAGGMCTTCCCTTTTCCCTTGTAAATTCTGTTCCATTTTTGGCCAAA 

CCATGGTTTGCCTCTGCTGCATTAGTTTG AAGTCA I I 1 1 II I I CCAGACATTGTGGCGGT 

AGTTCCCAGATGTCCGAAAAGATGMGTTATCAGTGCACGAGCAGCCAGGGCTTTTCTTT 

ATGTTGATTCACCTTAGGACACAGCCCAACAATGCCTGCAATTGTCATTAGCCCGTGAAG 

AGGTTCGCTTAGGGCTGCCACAGCACAGAGTCCCAGCTGCGCCCGTTTCTTCCCCCCCC 

Sequence 2784 

CCGCGGTGGCGGCCGAGGTACTA A I I 1 1 I I I II I I I I I II I i I I 1 1 GTATTTTTAGTAGA 
GACGGGTTTCACCTTGTTAGCCAGGATGGTCTCTATCTCCTGACCTTGTCATCCGCCCAC 
CTCAGCCTCC CAAAGTGCTGGGATTATAGGTGTCAGCCACCGCGCCCGGCCTGAATATCT 
TTTTTATTTTTAAGCTTTCAATAMTCTTACT 

TTTMTGTATACATTTTGATGAGGTTGGACATATGCATACACTNGTGGTACCTGCCCG 
Sequence 2785 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I I I I I I I I M I I I I I I 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

454/467 



GAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAATTAGGGTTCACT 

GCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGAGACGGGGTTTT 

GCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACCCTCCTCGGCCT 

CCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTMTTCTCCTTTTTA 

GTGAGTTAGGGMCTGAGCCTCAGAAAACTTAMCGATTTCTCAGAAAACACTCAAGTGA 

TAMGTGGCCCCATTGGAMGGAGGTTTTTATCTTCTCATTGGCAGGCCCAGNGTTCATT 

GCACAATATCATGCTACCTCTTGGAATCTTTTAAAA 

Sequence 2786 

TCTTAGG G CG AATTG G AGCTCC CCGCG GTG G CGGCCG AGGTACTAATTTTTTTTTTTTTT 

1 1 I I I 1 1 I GTATTTTTAGTAG AG ACGGGTTTCACCTTGTTAGCCAGG ATGGTCTCTATCT 

CCTGACCTTGTCATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTATAGGTGTCAGCC 

ACCGCGCCCGGCCTGAATATCTTTTTTATTTTTMGCT7TCMTAMTCTTACTGACATC 

TAATTGATAMAGTTGCACATATTTMTGTATACATTTTGATGAGGTTGGACATATGCAT 

ACACTCGTGGTACCTGCCCG 

Sequence 2787 

CCGCGGTGGCGGCCGAGGTACAATACAATCTAGATGACGGTGCAGACTAAGTCAAGAACT 

AAAGTTGTGCAGTAACCCGAGTTAAGGCATGAATGCGGACACACACATGCACACACACAG 

CACCCATGCTATCAAGACACAGGATTTTTTCAGTTGCCTCATGAGAGGCAACCTGGGCTT 

GGCAGTTAATCA GAAC TGCTGAGCATTCCAGAAAATGCCCCCCACGACTTTATGCTAACA 

GCTGTGTGTATGTTTTAATCAAAAAATTAAAGAAGAAAAAAAACCTAAAAAACAAAGAAA 

AAAACAAACAAAAAATCACCAAAAACCTAGAAACCCCTTAATCTCTTACAATGGCTCTTG 

AGCATGGMCTCATGTAGCAGCATCMTGGCTGGCTCTTTMCMTTTGGAAATAAAAGG 

TTGGTTTACTATGTATTTCTTTGGTAGTCATCACTACCAAAGTT 

Sequence 2788 

CCGCGGT GGCGGCCG AGGTA C I I 1 1 I II 1 » 1 J I I 1 1 M 1 1 I 1 1 1 1 GCTTTNATGTCTTTA 

TGTATTTA1 1717171 IGAGACAGGGTCTTGCTGTGTTGCCCANGCTGGAGTGCAGTGGC 

CTGGTCATGGCATNMGGCTCACTGCAGCCTGGACCTCCTGGTTCAAGNGATCXDTCTTGT 

NTGAGTCCCCTGANANAAAACCCCACCCCCNNCTACANMATTTTNTGGAAACANGGGCN 

NNAANCTGTTNCCTANGCNTGTNTGGAACNCCTGGGCTCAAGGGANCTTGTANCCTTANC 

CNCCTAAAANAGCTGGGANTTATAAGGCATGANNNAATTGTANCCTGNCCCGGCGGGCCN 

CTTTANAAAMGTNGGAANCCCCCCGNCCTTGCAGGAAATTTCATTATNCAACCNTNTNN 

NAATCCCCCCCCCCCNCCAN 

Sequence 2789 

CCGCGGTGGCGGCCGCCCGGGCAGGTA C 1 1 I II I 1 II 1 I I 1 1 I I I I 11 M 1 1 1 I M M AG 
NAGANATGGGGTTTCACCATATTGGTCAGGCTGGTCTNGAACTCCTGACCTTGNGACCCA 
CCCGCCTTGGTCTCCCAAAGNGCTGGGATTACAGGCATAAGCCCCCGTGCCCGGCCCACA 
TGGTATTATTTTATATAAAACGCAAGTTMGTNTTTATGTGTC 

CTTTNTCAAAAAGAANTTNACTTATTNGTTAAAACCNTTTTTTGNCCTAAGGGCCAAATA 
NGNGAAMA AAANCCC NNATTNNNCTTTTTANATTTTTNriTGGCN 
ATTTNNCNNNTTTTTTTGGNGNNANTAAAAAAAAAGGGTTTTTTTN 
Sequence 2790 

TACTATAGGG CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTA C I 1 1 1 1 I I I 

I I I 1 1 I I I I I IGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 

TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTNTCCTGCCTCAGCCTCCTG 

AGTAGCT GGGA TTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGA 

GACGGGGTTTTGCCATGTTGGACAGGTTGATNTCAAACTTCTGACCTCAGGGGGAATTAA 

CCTTCTTNGGCTTCCA AAAAGGG TGNGGGATNAACNGGGTTGNGCCCCCCCCCCCCNGGG 

GGGGGA AAAAATTCN NTTTTTTTNGGGGGNNGGGGNNNCAANNNCCCCCXNAAAAAAAAA 

NNNNAAAI I I I I I I I I ANAAAAAANNANNCCTTTNTNAAAAAATGCCCCCCNCCCCAANG 

NNGNNGGGNTTTTTTT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

455/467 



Sequence 2791 

GGTCTTCTATGTGGGTGTCAAGGATATGCTCCCTCACGGCTTCCGAAGGCCCCCAGCAAA 

AGATCTMCATTCTTGCTCAAAGTTGCGAGAGAAAGTAGCACATGGAGTAGCTGGGTTGG 

GGCGGCGGCCTCTTCTCTTCAGCTCCCTTAGCTTGGCTCCGTAAGTGGATCACTTGCCAA 

ATGCTTTAGATGATTGCCTCTCAATAATTGAAAGGTGGTGGTAAGTTGTATTGCTGCACT 

GTCGGTGTTMGAGAMTTACTCTCACAAGAGCAGAGGCCTGMGATTCTTTCTTCTGAA 

GGGATGATGAGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGACCATGT 

CCTAI 1 1 I I I GGAGGGCGTTGGG 

Sequence 2792 

CCGCGGTGGCGGCCGAGGTACCTTCAATACCTTTAGTTGTCTCCCACACACGCGTGTGTG 

TGTGAMTCTTCTACMTATCTTCCCTTTTTTAGACCATGTTCACTGTCAAAAAGGTGCT 

TTAAGAGCAGTCTTTGGCTGGGCACGGTGGCTCACACCCGNAATCCCAGNACCCTGGGAG 

GCCGAGGCAGGCGGATCACCGAGGTCAGGAGATCMGACCATCTTGGCTAACAGGATGGT 

CTTAAGGGACAGNGAAACCCTGTCTTNAACTTAAAATACAAAAAATTANNTGGGCGTGGT 

GGGNACCGCNCCTGTAATCCCANCTACTCAANGAGGCTNAAGGCAGGANTAATCACTTTG 

AACCTGGGAGGTAGAGGCTTGCAGTTGAGCCAAGATTGCACCCACTGCACTTTCANTCCT 

GGGCCGAANAANAGCCANGGACTTTCATTTTTAAAAAAAAAAAAAAAA 

Sequence 2793 

CCGGGCAGGGACCNCGGGATGGTGNCAACTTATGACAGGACCCATGGGCCCTCCCNATGC 
ACACAGNACTNTTGGMTCTNATCCTTTTCCATGGCTCTGGCTCACACTTNCACAGNATT 
TACTCCTAAATATGCCCCCTGNGTTCA 
Sequence 2794 

T ATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANATTTGAATGGCTTTGA 

CTTTTGGCAGCTGCACAGNGCTAGGACTGGACCATGAAATATCTNTGGGCTNTGNCAATN 

ACATTTGGGTTAANCTAANCCTGATCCCATGTGTTCCTGGAAGAGAAGCCCCATGACATT 

CAAAGTCCTTGACAATNTGACACCAGCTTTTNTAACCNTATAAGGCCC 

Sequence 2795 

ACTACTATAGGGCGTCGAGCGGCCGCGCGGGCAGGTACGCGGGCACACTGAGGAATTATG 

ACTACTATGCAAGCCGAGTTCCAGAATCTATTAAAAATGCACTTGTTCCTTAAGGGAAAG 

TTTCATTTGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGCACTTTGGGAGGCCGAGG 

CAGGTGGATCACTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAATATCGTGAGACCCCA 

TCTCTACAAAAATACAAATTAACTGGGCATCCTGTCATGCGCCTGTCGTCCCAGCTACTT 

GAGAGGCTGAGGCAGAGGAATCTCTTGGGCCCGGAAGGCGGAGGTTGCAGTGGGCTGGGA 

TCGTGCCACTGCACTCCAGCCTGAGTGACAGGAGTTAAGCCCTGTCTCAGAAAAAAAAGA 

CAAAAACCCAAAAAGTACCT 

Sequence 2796 

TACAGAAGCCGGGAGCATAAAGACGTATAAGCCTNGGGGGTTGCTCTAATGTAGGTGAGG 

NTAACATNACATTANATNTGNAGTTGNCGCCTNACTTGCCCCGCTTTTTCCAGTTCGNGG 

TAMCCTTGTCNTTGCTCAGCNTGCATTTTAATTGTAATTNGGTCCAACNGCGCTGGANG 

GAGNAGGCTGGTTTTTGCCGTTATTTGGNGCCGCTTC 

Sequence 2797 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGATAAT 

MGA TGCM TTTGAA TCTTC ATCATACGAAGGATACTTTCTAGCTTGTGAAAAAGAGAGA 

GACC7TTTTAMCTCATTTTGAAAAAAGAGGATGMTTGGGGGATAGATCTATAATGTTC 

ACTGTTCAAAACGAAGACTAGCTATTAAAATTTCATGCCGGGCGCAGTGGCTCACGCCTG 

TAATCCCAGCCCTTTGGGAGGCTGAGGCGGGCAGATCACCAGAGGTCAGGTGTTCAAGAC 

CAGCCTGACCAACATGGTGAAACCTCATCTCTACCAAAAAAAAAAAAAAAAAAANGTACC 

TGCCCC 

Sequence 2798 

CCGCGGTGGCGGCCGAGGTACTGAGCCCTTCATTCCCCAAACTCAGACTCTTAGCTCTTT 
TGTCAACTCTGGGCAAAGGTCAGCATTGGAATCGAGCGGCCGCCGGGCAGGTACATTTTC 
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TCTCATCCACTTCCGATTCTCTTCATTGGCTGCAATATCTTTTTCTTCAMTCCTATTTT 

GTTGGCTTCTAGGAAACCMGCACATCTTGTTGTTTCTTCTTAATCGCTGTAGAGCCAGA 

GGMGATGCAATATATACACCGGATCACCATCCTGGGAACAGCGGCTGCGGTTGTTTGGG 

TCCTGAAAAGGGCTGTGGAGCAGCTGCAGCAATGGCTGGAATCX5AGCTAGGGGCTGAAAC 

AACGGTTGGCAGAGAAGGGTGGGGGAAGTGGGAAAAAGGAAGAACTCGCCAGAAGCCCCG 

CCTTCGNCTTTAGCAAGCTTCCACCGAGCTCTTCTAAGCGCTTGAGTTNCAGCCAAAAAC 

CCCCCCCCG 

Sequence 2799 

AGGTACTAACCTCTTACCTTCCAAGGTGGTAGAACATGCTTGAAAAGATAGTAAGTGAAA 

AGGGGTAGCCAGTGCTTTACTCACAAGACTGCTTGAACATGAGACTCAAGGAGGGACCTC 

AGCAGGCCTGGGGTGTTCAGCAACTATTCCTGGCCGGGGCATCTTGCAAAGGAGTTGCTG 

TGACAGTAAGCTCTTCCACTTTGAGACCGTCACCTCAGCCACGGCTCCCACCTGGGCTCA 

GGAATGGTCAGGCAACACGTGGGGCCCAGGATGGCGGTGGATGACTAAACTGCCGAAGAC 

CGCGCCGCTGCGACTACCGAAGTACCTGCCCG 

Sequence 2800 

CCGCGGTGGCGGCCGCCCGGGCAGGTACMGTATGCMGTTTTCTGTAAACAGATTACTT 

TCAAACAGTTGGGAGCCCCAGGGAGATAATAGAAGGTATCCTACTCAACAGCAGTATATA 

GAATGCTGTAGAAGAGGAATATTGTAAGAACAAGAACTATCTCTTGAAAACTAAAAATAT 

G ATMCTG AMTATAAATTTAAATAGGATTGGAAGCTAAAGCTAGGAAATTACCCTAGAA 

TGTTTTTTGAGAAATGGAAAATACCGAAGACCAAAAAAATAAAATAAAACTATGCTGTGA 

MGAGAMGATTAGCATTTGAGAAGGCGGGAAGTTGGTCCTGCTCAGATGCCGG7TTTCA 

GATGCCTTATTCTAAAAAATAGNGATAGAAAACAAATTTCTACAAAGCAAAATTACAATA 

AATTTA 

Sequence 2801 

CTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACGGGTGCCTGTAG 

TCCCAGCTACTTGGGAGGCTGAGGCAGGAGAACTTGTTTGAACCCGGAGGTGAAAGTTGC 

AGGGAGCCGAGGTTGTGCCACTGCACTCCAGCCTGGGAGACAGAGCAAGACTCTGTCTCC 

AAMCAAACAMCAMCAAAAAMCCCTGTAGCTITGGGATCAGCCTTCTCTTCTGTTGTT 

TTTCTTTAAAAAATAAAAATTAAAAATAGGCTTCAAGTGATCCTCCCGCCATGACCTCCA 

AAACTGCTGGGATTGTAGGTGTGAGCACTGCACCCAGCCGTATGTTTTTTTCTACATAAA 

AAACAGCACAGGATTATCTTCCAAAGCTAACAAATATGTTCAAATAACCACAACCCCAAN 

TNNAAAAAAAAAAATNAAGTACCTGCCCGGGGCGGCCCCTCTAAAAACTAANTGGGATCC 

CCCGGGGC 

Sequence 2802 

NNANGNCCCNNNAGNGAGGGGAAAANGCGCGCCGGGCGNAANCAAGGACAGAGCNGNNNC 

CCGGGGGAAAAGGANANNCGCNCACAANACCACACAACANACGAGCCGGGAGCANAAAGN 

GGAAAGCCNGGGGGGCCNAANGAGGGAAGCCAACCCCACAAAAAANGCGGGGCGCNCACN 

GCCCGCNGNCCAGGCGGGAAACCCGGCGGGCCAGCNGCAANAAAGAAAAnnnnnnnnnnn 

nnnnnnnnnnnnnGNNNGCGAAAAGGGCGCNCNNCCGCANCCCCGGACACCNAAACGCNG 

NGCNCGGNCGGNCGGNNGNGGNGAGCGGGAANAGCNCACCCNAAGGCGGGGAA 

Sequence 2803 

NC CACCGC GGTGGCGGCCGTNCGGGCAGGTACTCCTTTCAGAGGGTCATCTCCTCCACAA 

GTAi i i i i iGT TTCTT TGGCTGGTCTGGGTCCAATGCTGTTGCCATCCCCAGCTTCAGAC 

TGTTCTCCTTGTTTTGGAGMCTTTCTTTGGACTGTATCTTCAGAGACACTCCTGGTCAA 

GGGGCCTCAGAGGACCCAAACGCTCTGAAACAGCGTCTTAGCTCATCGCCGAGTGTCAGC 

TCTAGCTCTTCGGAGCGCTTTCTTCTCCCCCCGCGTACCT 

Sequence 2804 

CCGCGGTGGCGGCCGAGGTACAGAGAMATATTTTTTAAAAATCTCATCAGGCTAGGTGA 
GGTGGCTCGTGTCTGTAATCCCAGCACTTTGGGAGGCCACGCTGGGTAGGTTGCTTGAGT 
CCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACACCGTCTCTACAAAAATAATAC 
AAAAATTAGTGAGGCATGGTGGCACACACTTGTAGTCTCAGCTATATTACTTGAGAGGCT 
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GAGGTGAGAGGATCACCGTAAGCATGGGAGGCAGAAGTTGCAGTGAGCCTAGAATGCGCC 

ACTGCACTCCAGCCTGGGCGATAGAGCAAGACTGTNTCAAAAAAAAAANATTTGGGGAAT 

AAAGGAAAGTTCCTGCCCCGGGGCGGGCCCGNTTNTAGAAACTTAGNNGGATCCCCCCCG 

GGGCCTTGCAAGGAATTTNCGATATCNAAGCTTANTTCGANTCCCCGTCCACCCTNGAAG 

GGGGGGGGCCGCGGNTACCCCAAATTTT 

Sequence 2805 

TAGGGCGAT TTGGAGCTNCCCCCGGTGGCGGCCGCCCGGGCATGGTA CI 1 1 I 1 1 1 1 1 1 1 1 
I 1 1 1 1 1 1 1 I IGCCAGCTTTTCTAAGCAAATAGATTGTCTGAATTAGTCACAGAATAATTT 
TGTGAAMTTCATGTTTAAGTAGCAACTACCCTTTCTTTTTTTATATATTTTTAAGGNAT 
TAGTTTATCTTCTTCTAACTGGNGCAGTCACTTMTGTTTTCATTAATCTTCGACCTGGA 
GAGNGAAATACTGATATTTCTAGAAAAAAATTCTACTCCTCTGATTATTTGAAATGCTGA 
GGAAAATGTCCCTCCCATAGTAAAACTTGTAAATAAGGAACTATATCATATTCAGTAGCT 
GNGTTCTGTTCCAT C I ti l l 
Sequence 2806 

CCGGGCAGGTACATTCTCTGTTCATTACTTAGTTCTTAAGGATATGTGTTCATCCATTCT 

GTCGCTGGCTCAGAGTTTGCTTCACTCTCTAGACCAGAGTATAATTTCATTTGGCAGTCT 

CCTATACAAATATGCATTTMGTTTTTTGACACGTACGCGGGGGACTCAACAGAAATGGG 

TTTCCAGAAGAATAATGAAAAGTTGTGGGTAGGAAAATGAATCATTTGGACTCTTCAATG 

AAATGGAGTGAGCCCAGGAGAGCTCAGCCAACAGAGGCACTCTGGGAACCTGTTAGTAAA 

GCCAGGCTGGCCAAATGCCATTTGATTTTGAACCTCGTAGGTCCCCACTCACCCTCTGCC 

AGGAGCT 

Sequence 2807 

AGCCTCACCCGC GGTGGCGGCCGAGG TACCCCGGGTGTTCCTTTTTGTTCAAAGTCTATT 
TTTATTCCTTGATATTTTTC 1 1 1 1 1 I 1 1 I 1 1 1 1 GTGGATGGGGACTTGTGAATTTTTCTA 
AG GTGCT ATTTAAACATGGGAGGGAGAGCGTGTGCGGGCTCCAGCCCCAGCCCCGCTGCT 
CACTTTTCCCACCCCTCTNCTCCCACCTGCCTCTTGGCTTCTCAGGNCCTCTGNCTCTTC 
CCGACCTTCTCTTCTTCTTGAAAACCCCTTCCTNCCACAGCTTGCAGCCCCATCCTCCCC 
GGGCTTCCCTNCCTAAGTCTGGTCCTGGCGGTCCTCTNGTCCCCCGGGNTTTCNAGAGGA 
CAACTTNCCCCAAAGGCCCAAAGCAGTTTTTTCCCCCCCTAGGGGGGGGGGA 
Sequence 2808 

CC GCGG TGGCGGCCGCTCGGGCAGGGTACATGCCACCACGCCTGGCTTATTATTATTGTT 

TTGTTTTGGAGACAAGAGTCTCGTTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTC 

AGCTCACTGCAGCCTCCGCCTCCTGGATTCAAGCAATTCTCCTGCCAGAGCAGCTGGGAT 

TACGGGCACGTGCCACCATGCCCGGCTMTTTTTGTATTTTTAGTAGAGATAGGGTTTCA 

CCATGTTGGCCAGCCTGGTCTCGAACTCCTGACCTCAAGTGATCTGCCACCTCGGCCTCC 

CAAAGTGCCCGAGGTTACAGGCATGAGCCACTGCACCCGGCCTATTATTATTAACTTAGT 

GTTTGCTAAGTGCTTTATAGATACGGACTTGCTTAAATCTTATAATAAGTCCTGAAAGAT 

GGGGTGATAACCTCATTTAAGAAATT 

Sequence 2809 

AGGTACTGTAMTATTACCATTATTTAAAATGTTGACATTTCTGCATTAAGTAGAAACTT 

TCTAAATGCCTAAATACCACTCAAACATGACTTAAAAGAAATTGAATGACTCACCACTAT 

GACCTTCAAGAGTCTGATTCATAGAAAGGTTACTGGGGGCTGCAAGGCCCCTCAATTTTG 

CATCATCTGTCTGCGTCTCTAATTTCAAAACTTTCAGTAATCCATCTTCACCACCGCATG 

CT ATGAA CCCTTGTTCCTTGTTCCAGGATACACACTGCAGCTTCACGTTATTGGGAATGG 

AAA 1 1 ii ui iGCTCAGGTAGAAGAACATCGTGGGATCCCCGAGAGGGTCACGGCGGCCGC 

T 

Sequence 2810 

AGGTACGCGGGGGGCTCTGAGAGGAGTCTACCTTGCCTTCTTATGGGAAGGGAGACCCTA 
AAAMCmCTCCTCTTTGTCCTCCTTTTTCTCCCCCACTCTGAGGTTTCCCCAAGAGAA 
CCAGATTGGCAGGGAGAAGCATTGCGGGGCAATTGTTCCTCCTTGACAATGTAGCAATAA 
ATAGATGCTGCCAAGGGCAGAAAATGGGGAGGTTAGCTCAGAGCAGAGTAGTCTCTAGAG 
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AAAGGAAGMTCCTCMCGGCACCCTGGGGTGCTAGCTCCTTTTTAGAATGTCAGCAGAG 
CTGAGATTMTATCTGGGCTTTTCCTGMCTATTCTGGTTATTGAGCCCTTCCTGTTAGA 
CCTACC 
Sequence 281 1 

GGCAGGTACI I I 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 GCAACATTGCCCTTTGATGTCCCCATGAGG 

GCCAGGCCCAGGCAGMCCCATCCCATTTTATCCTTAMCTCANMGGAMTTTGTCTAA 

ATATTAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAA 

GCTGTCTNTCTTAGAGTTCMTGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATA 

TTGGMTATCTAMTATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAA 

CATTTAGTGGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGAC 

TGTGCAAATGGTATTT 

Sequence 2812 

NNCGGCGGCCGGGCGGGNACGCGGGNGAGCNNTACGAGGGNCAGGAANCCCAGANGCCGC 

NCGACCTGGAGNCAGCCTACAACGCNTTCAACCGCTGCCGGCANGCCCGAGCCCGGGGCA 

CTAGCCCNNGCACAGAAGGGCAGAGNCNGAGGCGANGGCNCCNGGNCCCCNGNCCGCCAC 

ACAGGCCTTTGGTrsfTTNTCACACAACNCACGGGGCGGCAGCCGCCNGAAAGNAGACNGNC 

CCCGGGGGCAGAACANNGGGGGCCGGGCCCCNCCCCACAANAAAGANGCNCNCCGACAAA 

AAAAAAAAAAAAAAAAAGNACCNCGGCCGCNCNAGAACNAGGGGANCCC 

Sequence 281 3 

CGGTACCCMGCTmGGTTCCTTTMGTGAGGGNTNMTTGNCGNCGNCTTTGGCCGTA 
ATTCMTGGGTCMTAGCTTGGTTTCCTGTGGTGNAAATTGGTAATCCCGCTTCAACAAA 
TTCCCACNCAACATACGNAGCCCG 
Sequence 2814 

CCGGGCAGGTACGTTCTTTTNGCTTTTCC 

ACACATTTAGAAAAMGTTAMCCCCAAGACNCTTTGGGGATAGGTTAATTTTMGANGC 
CCAACTTTGGACTNGGATTAAAGGMNTACCTTAAAAANCCCNGGNAAMCAATTATTTT 
TTGGG 
Sequence 2815 

GGCGGCGGCCGAGGTACAGAGAAGCCATCANTTTAGAGGGCAGCANAAAACCAGAAGCCN 
GNTTTGATCCCTNAACACCAAGANGCCTNTAACAACANGNCACCAGCACCCCCAGGAAGG 
CCAAGGAGTCCCACAGAAAAACCTAGGGNNAGACCAA 
Sequence 2816 

GCGGCGGCCGAGGNACMGNAANCNCCTTTTTTGGGGGGGGGAAAAACCCCCCCNCCCCN 

NNNNCNACCCAGANGAGGAGGGTTTCNGCCCCCAGGGGAACANCNCCNAAAANCAGCNCG 

G CCNNNGNC GGGACCAGNGCCAGGNGGACAGCCAGNGNCCNGGCCAGANGAAAGGCNGCC 

Gl I 1 1 I I I I INTTNGGGGGGGNNCGGGGGGGGGCCCCCCCAAAAAAAANCAGCCACCACC 

AGGNGGGGNGGGGAGGA 

Sequence 2817 

AGGTACCCTGAGGTGCTCCGCTGGGGACTCTGCTCATTCTGGGGGTGCAGTTGACGGCTG 

GTCGTGATCTTTCCCCGTAATCTGTCCCCTCTTACGGAACCTAGTCTCCCGTTCTGGCCA 

TGGCCTTTCTTCTTGGACACTGCTTAGGANCCCAGAMGMGTATTGTTATCAAATTCTT 

AAAGCCTTAGGAAGAAAGTCNAGGGAGTGGGAAAACCAGGCTTCTGANAAAGAATACCTG 

NTTGGCCCACCTGNATCTTCCNAGGNCANCCACGGAANTCCCGGGCCCCTTCCAATCAGG 

NAAGGTCGGNAATCTCTGATGGTCNATCGGTTCNATGGCCAACCTGGCCAACCAGTTTGA 

AAAAAAA 

Sequence 2816 

CCGGGCAGGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGT 
GGTCACCTTTGCTCCAAGTCAACGTTACAACGGAAGTAAAATCTGTCGAAATGCCACCCA 
TTGAMGCTTTTGAAGTGAAAGCTTCTTTCCTTGGGGACCAATGGTGGGCT 
Sequence 2819 

AGGTACi i i m i i i i i i i 1 1 I I I I I I I GGGTTAGGATGGTTCTAACCTGATGGGTTGTGT 
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TTACAGTGGGGTTTCCCCCAAAGGTTATCTTCCTGCTTTCCTCTGTAAATAGGGCAGTTG 

CTGCAACAGATTAMTACACTCGGGCCACCTGTGGGTTAGTGGGTCMGMTTTTTGACA 

GAAAGGCTATAGGCTGCCGGTGGCCCCCGTGTGTTTGGGTGAGCACCCCTAGAGCTATCC 

CTTTATTCACATTGACGAAGAGGTGGAATGGCTGTTCTAGGGAAGGTAGAACTAAGGCAG 

TTATGAGCAGGTGTTTTMTTCCTCTACTTGTTGGACCTCCCGTGGT 

Sequence 2820 

AGCTTATCGATACCCGTCGACCTTCGAGGGGGGGCCCCGGTACCCAGCTTTTTGGTCCCT 

TTMNTGNGGGNTAMTTGGCGCCGCTTTGGGCGNAANTCANTGGNCANTAGCTTGGTTN 

CNTGGTGGTGNAANTTGGTAATCCCGNCTTCNACANTTCCACACANANNACGAAGNCCGG 

GGAANCNTAAAGTGGNAAANCCTGG 

Sequence 2821 

C GGCC GCCCGGGCAGGTACCATCTCTTGGGAAAACCATGCTACCTCTTCTCTCTGTTCTC 

TATTTTGCCACTAGAGAAATAGAAAATAAGGCTGGGAGCAGTGGCTTATACCTGTAATCC 

CAGCACTTTGGGAGGCTGAGGCGGGAAGAATCACCTGAGGTCAAGAGTTTNAAGACCAGC 

CTTGACTAACANTGGTNGAAMCCCNCGTNTTTTn"NTTTAAAAAATACC 

Sequence 2822 

CCGCGGTGGCGGCCGAGGTACTCCAATCCGGGTGACAGAGGGAAGACTCTGTC7TAAAAA 

GAAAAMAAATCMTAGAMTCGGTTTTTATTTATTTATAGTATGTGCTTTGAGTGGTTC 

TCMTCAMTTGATTTTTCCCCCTTCCAGGGGATATTTC 

TTTGCCCCATCTTTGGGTATCATATTGGCATCTAGTAGGTAGAGGCCAGGGTTGTTGCTA 

AGCACCCTATAATCCACAGGACAGCACCCAAAAACAGTTACCCAGCCCAAAATGTCAGTA 

TTGTCAAGGTTGAGMGCCTTMTTTAGATGTAATGTTAAGAGTTCAGTAAATTGGCCAG 

GCACGGTGGCTCACGCCTGTAATCTCCTAGCACTTTGGGAGGCAAAGGCAGGCAGATTGC 

CTGAGCTCANGAGTTCGAGATCACCCCGGGCAACAAGGTGAAACGCTCTGTCTACTAAAA 

TACAAATCC 

Sequence 2823 

CCGCGGTGGCGGCCGCCCGGGCNNGTACTCCTCTTGCTACCACCTTTGTTGCAGAAGATG 

AAGGGGAGAGGGAGCTTCTCCCTATGGCCTCATGGCTTCTTGTGAGACAGATCAGTCCAG 

CCAG ATAC AG AGCAAAGC AGCTTTGC ATCACCGCGGGCC AGTTGCTG ATGCCCAGCTTT A 

TGTCTAAAAAAAAAAAGTGGAAGCCTCAAGGGGGATGGAGGATAGCAAGAAGAATGGGTG 

CCTTGGCCCCAGAGGCATTGTAGGGAGAGGAAGACAATGTATCTCATCAGGGTTCTCAAC 

ATTATGAGATTATCACACATCMCTATCTTTGGAGGGGCTGAGTGATTGAGTTATGGCTC 

TGACTCCTCTCTGGGGGTGGAGAGTGAAAGATGACAAANAAGGCCATCTGTCCCCTAGGA 

GACACAGTTTGCAGTATAAGACAGGACANAAGAGAACAGAAAAACAAATNCAACTGGAAA 

AAAGGGGTGG 

Sequence 2824 

AGGTACCTTAGAGCAACATGCAAAGCTTCCCTNCTCAGCAATCCCAGGTTGGGGCCCCCT 

GTCTTCCTATCGTCCTACCCGTCACAACCACCACTGCAGGCTTCTGATGCTCTGTTTTCC 

TCCTCTGTCTCAGTTCAGTTGCTCTGAGTTAGAGAGGAGCTCTCTGGGACTGGAGCAAAT 

GACT GCACCGGCCCTCATGGGACTTCCATTTCAGTGGATTCAGGGGAAAGCAGCACGTGT 

TTTTGAGAGACCTTGGCMCGCAGGCGACCTTGACGTCGACCAAAANMGAGAGATTTGG 

AATGNTTGTTTACAGCCTCCGGTCAAAAAAAAA 

Sequence 2825 

TTAGGGCGTTTGGAGCTCCCCGCGGTGGCGGCCGCCCNGTCAGGTACCANNNCTTAGCAh 

GGAANNTGGACAACANAAGCTNTAAATCCTCTTGCATCGNCACGNTNAATTTGCACTGAC 

CAATCTGTTGGCACAGTAACTGGTTATMGCTAMTTTCTACATTTTGGCTACAAGTATN 

CCAMTNCACCTTTTAAAAAATCCTATGTNAGATGCCATCTGGTGTTAATGATTTGCACA 

CCC CTTAM TTGAMNTATTNCAAATAAATCTNACGGATTTATATANNATNATTAATGNN 

TNTATTTTAAAMGACAATCTGANMTAACACTTCCCCTMTTGTTGTCTTAATAATGAC 

CAAGAGCTGNNGAAAAATNATTCACACTGNTACGTCGTTNTGTTGGTTTGCTCACGGGGG 

AAGGGGGGTTG 
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Sequence 2826 

GCG MTTT GGAGCNAAACCNCGGNGGCGGCCGAGGTACTTCGGGGAGAGNNTNTCCTNCC 

NTCATTTTGMCNCCAGCGGCCTCTTCCCCTTCCNGGGGCTGCTTGCCCTGGGAACNCTG 

GCACCTTGGGCTGNGGAAGGCTCTGGAAAGTCCTTCAAAGCTGGAGTCTGTCCTCCTAAG 

AAATCTGCCCAGTGCCTTAGANACAAGAAACCTGAGTGCCANAGTGACTGGCAGGGGCCA 

AGGGAAAAAAAAAAAAGTTGCCCTNANCCNTNGGGGGAAAAAAAAGNGCCCNGGAAACCC 

NGGGGGGNCCCCC 

Sequence 2827 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGGAACACTTGAGGTCATGAGTCTGA 

GACCAGCCTGGCCAATACGGAAAAGCCCCGTCTCTAACAAAAATACAAAAATACAAAAAT 

TGGGTGTGGTGGTGTGCACCTGTAGTCCCAGCTACTCAGAAGGCTGAAGTAGGAGAATCG 

CTTGAACCTGGGAGGCAGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGAGACAC 

TCCATCTCAAAAAAA GAAA AGAAAGGAAAAAAAGAAAAGAAAAGAAATGTTGAGGCAAAT 

GMTATACMACACATTTTAGATTAGCATTTGAATTAGTAAACTGCATAAAAAAGATCCA 

ACATGAGCTGACATCATCCAATCCATTGAGGGCCCAAATAGAGCAAAAAGGCAGAGGAAG 
AGCAAATTC 

Sequence 2828 

CCGCGGTGGCGGC CGAGG TACAGAAAGAAGAGTATCX^ATTTCAACTACTAGGTTAACTGC 
CTTTGATGAGCTTGATTTTACCTGGGTCACTAATTCCACAGAACCAATGTAGGTGTCTGG 
GCGGAGCAAAATAT GCTCCA ATTGTGTTTTCTTTTGATAGATTCTTTCAACAGACAGTCT 
TTTCTTAGCATCTTCATTTTTCTTTATTTTGTTGACTTGCATAT^ 

CAATGGTGACACTTCCATGGTGACGGTCGTGAAGGGGCTCAAGAACCCTGAAAGCGACTA 

AACAGGCAGGACCCCACGAGACCACCCCCGACCAAGGCGNTTCTCCACAGACGCGCGTCC 

CCCGCGTACCCGCCCG 

Sequence 2829 

CCGCGGTGGCGGCCGAGGTACTTTAAN 1 1 1 I 1 M M H I I 1 1 1 I I I I A l I 1 1 1 1 1 1 II I I 

m I M t I 1 1 I I I I 1 1 I I I I II I I I I I I 1 1 I I I II I I 1 1 I I I I I M I II 1 1 1 M I I 1 1 M I 

1 l M l l l l l l M I I 1 1 I I I I I NGGGNNGNNN ANNNTTNTTTTAAAAANGGNAAAAAAAAA 

ANNGGGGGGNGGNNNNNGNNNAAAANNANNNGNNNNGGGGNNAANCCCCCNNCCNNNNNG 

GGNMNCCCCNNNNNAANNNNNNNrrTTTNNGNGGNNNNNCCCNTTNTNAAANAANCGGGG 

TNAAAAAAAAAAAAAAATTTTNGGGGNNATTTTTTNNNGNNNCCCCCTNANCNNGGGGGG 

GNNTNTTTNCTNTTTTNANACCCCCGGGGGGG 

Sequence 2830 

CTACT TAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTAC I " II I 1 1 1 rGTTTTGT 

TTTGGI M M M it l l iGGCTTGACTCAGGATTTAAAAACTGGAACGGTGAAGGTGACAG 

CAGTCG GTTGGAG CGAGCATCCCCCAAAGTTCACAATGTGGCCGAGGACTTTGATTGCAC 

ATTGTTGTTTTTTTMTAGTCATTCCAMTATGAGATGCATTGTTACAGGMGTCCCTTG 

CCATCCTAAMGCCACCCCACTTIsrTNTNTMGGAGMTGGCCCATTCNTTTCCAAGTTNC 

CNNANGGGGANAMA NANCNN TTNTTTTC^ 

CCCCCCCCCAAAMCCTTTTTNNm 

CCNCI I I I 1 1 I I II I I I II 1 1 CACNCCAAAAAAAAAAAAA 

Sequence 2831 

CCGCGGTGGCGGCCGATGT A C \ I 1 1 I II I I I M M M I I I I I 1 1 I I 1 1 I M I I I I II I I I 

n i I I I I 1 1 i i i i i i n I 1 1 1 l MCCCTTTGACT^^TTTTTAANGCCNCGGGNGGATNTC 

NTTGCCCANCTCCCANNAATGTTTGCCCTTNTMNTCTGTTCCACTTTTAGGNGGNAGCC 

ACCAGGCCTTANCCATCCCGGGTACCTNGGCCGTTTTAAAANNAGGGGNATCCCCCGGGN 

TNNANGMTTTNNAAANTNAAAGCTTNNNNGAAACCNNCCCNCNNTNGNGGGGGGGGCCC 

CGGGCNNNAAATTTTTTNTTT^ 

AAAAAAANGNAAAAAAN I I 1 1 I M I I NTNGGNAAAAAA A I 1 1 1 1 1 I 1 1 CCNNAAAAAAAT 

NNCAAAAAAAAAAAANAAANGGGNGGGAAAAAA 

Sequence 2832 
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CCGGGCAGGTACAAAGATTCCTCACTGCGTGCTAAGAAAACAGATCCAGGCCGGGCACGG 

GGGCTCACACCTATNANCCCAGCACTTTGGAAGGCTGAGGCGGGTGAATCACCTGAGATC 

AGGAGTGCGAGACCAGCCTGGCCAACATGGCAAAACCCTGTCTCTACTAAAAACACAAAA 

ATTTGCCGGGCATGGTGGCAGATGCCTGTAATCCCAGCTACTTGAGAGGCCAAGGCAAGG 

AGAAANTTGCTTTGAACCTGGGGAAGGCCGAANGNTTGCAANTGAGCTTGAAGATTCGGC 

MCNMCTTGCACTTTCCAANNCCNTNGGGNTGACNAGGANGATAAGGACCTTCCNTTCT 

NCAAAAAMAAAAAAAAMGGGANAAAAAAAAAAAMGGGNTCCCTTTTNGGCCCGGTTT 

TTT ANAMC TTMGTGGGMTTCCCCCCNGGNCCTTGNGNGGGMmCGNNTNTNNNAA 

AGCTTTTTTCGAATTCCCCGCCCCANCCNTNNGNGGGGGGGGG 

Sequence 2833 

CTAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAAGGTACTTT C I I I U 1 1 1 1 1 1 1 I M 1G 

AGATGMGTTTTGCTCTTGTTGCCCAGGATGGAGTGCAATGGTGCAATCTCAGCTCACTG 

CMCCTCCG(X)TCCTGGGTTCAAGTGATTCTCCTGCCTCATCCTTCCTAGTAGCTGGGAT 

TACAGGTGCCCACCACCACACCCAGCTMTTTTTGTATTTTTAAGTAGAGAATGGGGTTC 

ACCATGTTNGGCCAGGCTGGTTTNAMCTCCTGACCTTAAGGNGAANCCCCCTTTGCCTT 

NGGCCNCCAAAAAGGGNTNGGNAATNANNAGGGGGGNNACNCNCCCCCTTNNNCNNANAA 

AAAGGGGGN 1 1 1 1 1 1 1 rNTNTGGGGGGGGGGAAANATTTTNNAAGGGGGGGNGCCCCCCC 

NNC CCCTNA NAAAAAAAAANAAANCCCCCCCCCCGNGGGGANNAAMNNNTTTNTATAAA 

ANANTTTTTCCCCCCCCCCCCCNGGGGGGGGGGGCCCCCCCCCANC I rTTTTTTTTTTTN 

TAGNAAGNGNNNCAANCCCCCCNAAAA 

Sequence 2834 

GGGGNAMCCCCGGNGGCGGCCGCCCGGTTTGGAACCNGGGTTNAAACCCCCGGNTTTNA 
ACCNCANAACCGCAAGANAACGGGNGNAAAAAAAGGGAAACANANCAGCNGTCCAAAGAA 
AACAAAANGNGGGCAAACC 
Sequence 2835 

GGTGGCGGCCGAGGTACTGATCATGGAACTCTCGGGGAGGAAATGATGTTTTCCTTCTAC 

CCATCTTATGTTCATTGGCTGGG GCTC CTGGAACAGAAGACAGATTTACAAAAGAGAAAG 

GCACACAAATTTATGTMTATMGTTTTACATGACATGGGAGCCTTTATAAGGAAATGAC 

CCAAGGMATGGTTAMCCTGAGTGGTTTTGNGTTAGGTTTGATGAGCAATGAAAAGCTA 

TGGAGAACTATGATAGGAGGAGTGTGAGCTAAACGCAATGAACTGGGGGAAACT 

Sequence 2836 

TACTTAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I I I II I II 11 1 I 1 1 1 
I I I I i 1 1 iCTCAAAACGGCCTTCCTTAGTCCTGTAAAACCTGGAATGCACATAGTCCTGT 
AATGGCCATCCCAGAAGTGAATCTACATGATTATTCAAATTCAAATACCTAANAAAAAAA 
ATCACTTGAAGTTNTGCMCTTCCCMATGCAMTTCCTGANAAAGAAACTTGACTGAGC 
CAG7TCMCTTTCTGTATTGGATTANAAATCCTATCTTGCTGGCTGGTCTGTANATTGGG 
TTGCTCTTGAGTCAAMTGANCCTTTTTTTAMGTTMGCCNCTTTTGCCCNT^ 
GGGGCCCCNCT NGGGGGGG GNNAAAAAMTMTTTNNCCCGGGGTTTTNTTTTTTTNCCC 
CCNAAAAANCTN I I I I 1 1 I I ICNAANCCCCCCNNAAAAAAANNGGNCCCCCCCCCCCTNG 
, NNGGGGGGGGGGGGGGGTNTTTTTNTCCNCCCCCCCCCNCAAAAA 
Sequence 2837 

CCGGGCAGGTAC I I I I II M 1 1 1 I I I 1 1 I I II 1 1 1 GGGTTTTTAGTAGAGACAGGGTTTC 

ACCATGTTGGTCAGGCTGGTCTTGACCTCCTGACCTCATGATCCACCGGCTTCGGCCTCC 

TGAAGTGCTAGGATTACAGGTGTGAGCCACTGCGCCCAGCCCTGAGAAATAGTTCTTCTA 

ATTGTCATCCAGTTTTCATCTGAGTCCTGTTGTTCTTTGGATATGTGCCCTTCCAGAGCA 

CAGCAGGGGTTGTTCMGTCTTCCANAAMGCAGCTCTTGTTTCTCCTCATGTGGTGGGA 

GTGGAGTCAGAGCGTGGCTCAGGCCCCACATTCTCAGCTGTTTGGATCTGGGGACTCGAA 

GTTTCTGGTGGTTACITCTGAMGTCTTTTCCAGGATAATTATTCTTGCTCGGTT^ 

GCATCTCTGACAGGCTGGTTTNCCTGCTTCCCCGCTT 

Sequence 2838 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I 1 1 1 I I I I I 1 1 I I I I 
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TTCi i i M 1 1 CTCTGTGGAAAMGAMGMTTTGACTTTATTTAG AAAGTCTACAAAATA 

CA GAAGAC GA TAACTCGC TTGCTGTAAGTCAGGAAATAAATAAATTCTAGGAGCCGGGCA 

ATA 1 1 1 M lAACI I I I I 1 1 1GAGACAGGAGTTTGCTATGTTGCCCAGGCTGGAGTGCAGT 

GGTGCGATCTCAGCTCACTACAACCTCTGCCTTCTGGGCTTAAGTGATCCTCCTGCCTCA 

GCCTCCCMGTAGCTGGGACTACAGGCATGAGCCACCATGCATGGCTMTTTTTGTATTT 

TTTGTAGAGACGGGGTTTCACCATGTTGTCCAGGCTGGTCTCAA 

Sequence 2839 

AGGTACTTTTTTTC I I I I I 1 1 I I I II 1 1 I I I GGAG ACAGGGTGTTTATCTGTCACCCAGG 

CTAGAGTGCAGTGGCGGGATTACTGCTCACTGCAACCTCGACCTCCTGGGCTCAAGTGAT 

CCTCCCAGCTCAGCCTTCAAGAGTAGCTGGGACTGCAGACCTGCACCACCACGTCCAGCT 

GCCCGGTTAA I I I I I I I CTGTCGGTTTGAAGAGGGGAGAAGGTCTCACTATGTTGCCCAG 

GCTTGTCTCAAACTCCCGGGCTCAAGCAATCCTCCCACTGTTGGCGTCCCAAAGTGCTTG 

GGGTTACAAGGTGTGAGCCACCACCACACTGGGGCTCTGCTCTGCCTTTCTGAGTTTTGG 

GTTTTCTGCTTATGGNGGGGGAGCTrrGTTCCCGTTCTTCCCCACAAAGAACCCAGGGAT 

GTGGCACMGCTTCCCTGCCCGTTTTTCCTTTAACTTCAAGTTGGG 

Sequence 2840 

CCGGGCAGGTACAACTGGAAACAGCCACCGGAGAGAGAACTGTCTCGCCTTCGCCGGCTT 

TACCAGGGTCATCTCCAAGAAGAGAGTGGCCCCCCACCTGAGTCAATGCCCAAGATGCCC 

CCTAGAACACCAGCGGAAGCCTCCTCCACTGGGCAGACAGGCCCTCAGAGTGCTCTGTAG 

GAGCTGTAGACTGGGAAGAGAGGCCAGGCGTGGTGGCTCACTCCTGTAATCCCAGCACTT 

TGGGAAAGCCAAGGTGGGCCTTGATCACTTTGANTCCCAAGGAAGTTTTTGAGACCAGCC 

TTNGGCACCATGGTGAAMCCTTTGTCTTTACCAAAAAATACAAAAATTTAGCTGGGTGT 

GGTGGTGCACACCTGTAGTCTTAACTATTGGGGGAGGCTAAGGTAGGGATTCACTTTGAT 

TCCCAAGGAGGCGGAGGGTTTTGCANTTGAGTTTGGCANTTCM 

AGCCTTGGGGTGGACAAGCTTAAMCCCTTTNTTTTCAAAAAAAAAA 

Sequence 2841 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGTCAATGGAGTAAT 

GCTCCCAGGAGAGTTATGGCTGCTCTGCTATGTCATGCAGGTTGTTAGGGAAGTAGGGGA 

AAGCTAGAAGTTACAGGCCTTACCCAGCTCCCATGCAACCCAAAAGGCCAGTCTCACTCC 

CACCGTGCCCCACCCTGACAGCACCMGTTTGTTTCCAGGCAGTGAGTGAGCAGGGCTGA 

GMCTTGTCCCAGGCTACCAGCCTGCCAGCTGAGAAAGAMGCATGGCTTTTGCATCTTT 

TTG CCTG TTGAGTCTGCGCACTGGATTTATGCCCTCCCTCGAGTTTTGGCCGGGAGATTC 

ACGTTTTGGTTCAAGTGGTTACCAAAGTTTACTTGGGGAGGTTTCCnTTTCCTTTGGGTC 
TT 

Sequence 2842 

CNAATT GGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGGTA Cl I I I I II I I H II I HI 

TTTTGTAi I I I I AGTAGAGACAGGGTTTCATCATGTTGGCCAGGCTGGTCTCGAATTCCT 

GACCTCAGGTGATCCGCCTGCCTTGACCTTCCAAAGTGCTGAAATTACAGGCATGAGCCA 

CGATGCCCAGCCTGAGGAACAGATTTCTATATGGCAAATAATAAAGGCAAATAAAAATTA 

ATGCTAAAATAGAATGAGGAMGTATTNTTTTNTTCACCAGAATGGTTGTNANCANAAAT 

GNTTTGCACCAGGGTGGNNTTNAAAAAMCCCCNCNCTTNNATTTTGGGTTNTNCCTGGG 

GGGNTNTNGGGNNCATNAMNACCNTTTTANNNTNTTTTTTTNANN 

Sequence 2843 

AGGTACTTTAACTTC I 1 1 I I I 1 1 I I I I 1 1 1 GAGATGAGTTTTGCTCTTGTTGCCCAGACT 

GGAGTGCAATGGCGCAATCTCGGCTCACAACAACCTCTGCCACCCGGGTTCAAGCGATTC 

TCCTG CCTCG GCCTCCTGAGTAGCTGGGATTACAGGCATGCATCACTATGCCCAGCTAAG 

TTTGTATTTTTAGTAGAGACGGGGTTTCTCCATGTTGGTCAGGTTGGTNTCGAACTCCCG 

ACCTCAGGCGATCTGCCCGCCTCGGCCTCCCAAAGNGCTGGGATTACAAGTATGAGCCAC 

CGCACCCAGCCTAAGATCCMGATTCTTATGTTTTCTCTCCTTGCCTTTTGGAACCTGCC 

CGGGGCGTGAGCGGCCCGCCGGGCCAGGTACCAACNAGAAACNCAAACACCTTGCNCAGT 

NTNTCNAGGCACCNTTTCCAAAMCCAAMTTTTGGANAAGGTGAAACNTTAACTTNATA 
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TNGGCTTAAATTTTAA 
Sequence 2844 

ACCGCGGTGGCGGCCGAGGTA C 1 1 1 1 1 1 1 NTTNTTT A I I 1 1 1 1 1 1 ATT A 1 1 1 1 1 1 NTGGG 

GACGGAGNGNCCCTCTTGNTGCCCAGGCTGGAGCGCNATGGCGTGATCTTGGCTCACTGC 

AGCCTTCGNCTNCCGGGTATCAAGTGATTCTCCTGCCTCAGCCTCCCNAGTAGCTGGGAT 

TACAGGCATGCNTNNACCATGCCCAGCTAATTrTGTATTTTTTAGTANCAAAACCGGGGG 

TTTCACCCATATTGGGTCAGGCCTGGTCTCGAACTCCCAGACCTCAAGGGTGGATCCCGC 

CCACCCTCGGCCTTTCNCNAAACCTGCTGGGGAATTACCAAGGCCGTTNAAGNCCAACCC 

GNCGCCCTNGGCCANGGGGGACCTNATACTTCTTTTrTTAAAAAAAAAGACATTTTTGTN 

GGGGGGCNTCACCACCCNTTATATTTNAAAAATTAGGTTNCCTTGCC 

Sequence 2845 

CCGGGCAGGTA C 1 1 I 1 1 1 I I 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 I CCTCATCTTTCTTTGATAATGN 

GNGAAGCCAAGGATAGGCAGGAACCCCAAGTTCCNTNTNTCCTGGGATTGTCACAAAATT 

CCCCCTNTCX5CCATCGCATCTATTrTTTGTTTGTTTTI^nTTTNTTCCCAAAGAGGGAA 

ATACCATGGGCTTTANAGGAATCATCGCTCACTGNGGGAACCTTGCCCCCTCCCCAAGCC 

ACCCCACTGNGCCCATAAACGTGCTGCCTGTAACCAATTGTTCCTGTTGAATAACAATGC 

GAGCTGAGGGGCTTTTTNTGCCTGAGCTGCAMTANATTAGGCTGCTCCCTTTTATGTGN 

GCAAANACATTACCCAAAAGC 

Sequence 2846 

CCGCGGTGGCGGCCGCCCGAGCAGGTACCTCAGAATGTAACTGTATGTGGAGGTCTTTAA 

AGCAATGATTAAGTTAAAATAAGGCTATTAGAGTGGGGCCCTAACTCTAAATGACTGGAT 

TTATATGAAGGAGCTAGAAGAAGGACAGACACACCAGGGTCCTGTGTGCACAGAGGGACA 

ACCATGTGAAGAGGCAGCAAGAGGCCAACTGCAAGCCGAAGAGAGAGGTCTCAGGGGAGA 

CCAACCCTGCCAGCACTTTGATCTCAGGCTTCCAGCCCCCAGACCAGTGATAAAATAAAT 

TTCTGCTGTTTAAGCCACCCTGTCTGTGGCATTTTGTTCTAGTGGCCCTAGCAA 

Sequence 2847 

CCNGGGGCGGCCGAGGTACCATNANGCTTGCAGGGGCTGAAGCATGGTTTGTCCANAACC 

CCAACCACCAGGTCTATCGNNCTCTTTCTGNCACCTTTTTNCT CI 11)1 I CCTTCTNCCC 

TTGCACCTGAGGNCCTGGAAGGCCTTGATGAGGCCCAGCAAACAGGCATTCTCACAGCTG 

GGTTTATAGTCTTTGGGCCCCTTACTCAGTATCCTGGGAACCCTGGGCCAGGAAGTTAAC 

AGTGGTCAATCANAAANTNCTGAANAAAATCCCCCTCCCCCTG 

Sequence 2848 

CCCCCCGNG CNGGN NCANNTTITGGGCNNTTrTrGGGTTTTCAGNANGGTTNGTGGAGNA 

TCCCCNGNGGTTTTTNNNAAACCCCCCCNCNNAACANAGACCAAANGGGGGGNNGNGNAG 

GGGGGNGGGGNCNNTNAGAGAGGNGGGGCGGGGGCAGCGGGGGGGNAAGNNGGNNGNNNA 

GGGGANGGGGAGGGGCNNNCCCCCANACNNGGAGGNGGAAGGGGAAAGAACGGCNAGGGG 

NAAAGGCCGGGGGGCCACCNAGCNCNGGGGGNNCNACNGANGGGGGAAGGAACGGGGGAA 

ACCAAAGGCCCNNCCCCCACCCAANAANGGCCNGGGAAAAACCCACANNCNGGGNAAGGA 

NAAAGGGGCCNGANGGCGAAGGANGGCNCCCAAGCAGGAGGCGGGNNGGGGCCGCAACNG 

CCCCNAAAAAANGCCCCCGGGNNNACCCCAAGGGGGGANGGGGGNCANAAAAGNNGGGGG 

GGAAGGAACNGGGCCNNAGGGNGGCCCCCNGNNAGANGNGNNNNGGCGGAAANCCCANGC 

GCCNGGGGGGNNNANCCAAGGGGGNGGGA 

Sequence 2849 

GAGATGCAGTCGATTNCATACCTANTGGGTCCCANTCCTNNNTNNGGNCNGTTGNGAAGC 
CGGATAGTGACTGAGATCACTGGGTAGACCTTGTCCACCTTGGCATTCTTGTCTGCCAAG 
GTCCATGGCCCATGGGGATGGGGACAATTTGAGTGGG 
Sequence 2850 

GGCAGGAACTNTTCnTrNMTCTTTTTTThrrNMGTNAGNGGTAATTTAAAAATCTGAAAT 
ATAGGCTGGGCGTGGNGGCTTACGCCTGNAATCCCAGCACTTTGGGAGGCTGAAAGTTGG 
GGCNGGATTCATCTGAGCTCGGGAGTTCAGGGACCAGCCTGACCAACATAGAGAAACCCC 
GNCTCTACTAAAAATACAAAAATTAGCCANGGCGTGGTGNGCACCATGCCTGTAATCCCA 
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GCTGTTTCAGGGAAGGCCCNANGCCANGANAAATTTGCNTGGAACCCCGGGGAGGGCCGG 

GAANGTTTGTTGGATGGAGCCCCGMGATCANCCAATTTGNNNANCTNNCAMCTTTTGG 

GNTTGMCAMGMGCCNAAAAAmTCCCCMTTCTTTCATNAAAATACAANGAAAACNT 

TANANAAAMTTTTTTGAAAAATTAATGGGGACCCCNTGGGAAANGAGGCCCNNTAAAAA 

AAAAAAAAAA 

Sequence 2851 

CNANCGGAAGGGGCCCCNGGGGGGGGGGGANAAGGGCAAAACCCNGGNCTNAGCAACCAC 

ANGGGGGGGCCGGAGCCNNGGGGGGCCAGGACCAGGGGAGAGGGAAGCCCCAGCCCNGAG 

GCGNNCGCACCACNCNNAGCACCAGCGCACCAANACNGCAGACGAGGAAGGAAGCACAAG 

CNCCCCACNNNACAAAGGGGAAACCGAGGCGCGGGNAGCGCGGNCCCGGGGACGGCCGCA 

CAAGAACNAGGGGGACCCCCCCGGGCGGCAGGAGNGCCGGNNAGAAGCCAAAACGAANCC 

GGCGAACCNGGGAGGGGGGG 

Sequence 2852 

AGGTACGCGGGGATGCGCAGTCGTGAGTCCTCTTGTCCTTGAGCGTCAACCTTCTTTCCC 

TGMGTGGCTG GGGTTC CTGTTTCCTTCTTTGATTGACAACTTGTGTTAACCCTCGCACA 

TCTCTGGGCCAATTTTTGCTTGAAAATGGCAGCTCCCGAGCAGCCGCTTGCGATATCAAG 

GGGATGCACGAGCTCCTCCTCGCTTTCCCCGCCTCGGGGCGACCGAACCCTTCTGGTCAG 

GCACCTGCCGGCTGAGCTTACTGCTGAGGAGAAAGAGGACTTGCCCGAAGTACCTGCCCG 

GGCGGCCGCTCTAGAACTAG 

Sequence 2853 

CGGNGGCGGCCGCCCGGGNATGGTACCCTCTGTCACGGCTTCCTTTTNCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCANNCATGTTGCCTNCTGAACAAAGCCGTTGAAGATG 

AAGAATGGGCAAAATCGCCCCATACGGAACAAGCGCANCCTNGGGAGCCCGATACCTGGC 

NNGCGAACACCAMCGGGAGAATTTCGCCAATATGGATGTGACAGCGGTTCX^CATTAAAG 

CGGTGATAGGGATTTTT 

Sequence 2854 

CCGGGCAGGTACGCGGGGTGGGCATTCTGGGTAACAGAGCTATTTACTTCCTGCGGGTGC 

ACAGGCTGTGGTCGTCTATCTCCCTGTTGTTCTTCCCATCGGACGAAGATGGCCCTGGAG 

ACGGTGCCGAAGGACCTGCNGGCATCTGCGGGCCTGTTTGCTTGTGTTCGCGTGGTCAAG 

ACTAGTACCACCAGNTTTAGAATATGATGGCTTGTNGACAACATGTTGATGTCATATGNT 

ATCAAAATCG AAAC GGNGTCANCTCCGMGMGGATGGGTTATTATTGACtTGCACCTTA 

GCCTNTTCGCTTTTGCATGGGGATA^^ 

TACAGGCCTTGGGGrn"CNTTTCAMGGTOT 

Sequence 2855 

CTTTCATGTGATCTTTGTGGCAGTGGGACAGGAAGTAGGCGCGGGCCCTCAGGTTCTCCC 

TATCGAAGCGGTCTATGGAGATAGTTGGATACTCGGCCATCTGCCCCTCGAAAGAACTCA 

TAGCGCCGTCGATCCCAGAGTCCGGGACCCCAAAACCGCAGCTGAAGCCAAGGCCAGCCC 

TGACNCGCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATTCCCCCGGGCTGCAGG 

GAAT TNGATA TCAAGCTTATCTGATACCGACCGACCTTCNAGGGGGGGGCCCGGTTACCC 

AAGCi i M i i GTTCCCTTATAGTGGAGGGTTTAAATTTGCGCCGCTTGGGC 

Sequence 2856 

GGGCGATTTGGAGCAAACCCCGGGGGCGGCCGCCCGGNTTGGTACCAAAANTNCAAACNA 

CCANTTTNGAANCCGGCGNNGACGNNNGCGGNCCNAGCTACTCTGGAGGCTGAGGNGGGA 

GGANCGCTNGAGNCTGGGAGGCAGAAGTTGCAGNAAGCCGAGATCATGCCACTGCACAAG 

CTAGGTGACAGAATGAGACTCTGTCTCAAAAATAATTAAAAAGCCTCTGCCCCAAACTCG 

TTAAAAGA7TTTATAACCACAACTGCTGNTTCTGNGNAGATGCATCTGCATGCCCAGGAG 

CAGTAAATGCMTMMNCATTTGGNTATACTTTGAACACAAAATAAACGGGTGAGGCTT 

TTACTTTCGAAAAAAAAAAAAAAAANAAGGGNCCTCGGCCGCTCTAAAACTAGGGGGANC 

CCCCGGGCCGCANGGAAATCGATA 
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Sequence 2857 

ACATTAATTTGCGTTTGCGCTCACCTGCCCGCTTTTCCAGTCCGGGGAMCCTTGNCGTG 
CCAGCTTGCATTTAATTGNAATTCGNGCCCAACCGCTGCNGTNGGAGAGGCCGGCTNTTG 
CCGTATTTGGGGCGCCTCTTCNCGCTTTNCTCGGCTTCACTTGACTC 
Sequence 2858 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTCCTCAGCTTGGG 

CTTCTTCCTCCTNCATCACCTGAAACACTGGACCTGGGGGTAGCCCCGCCCCAGCCCTCA 

GTCACCCCCACTTCCCACTTGCAGTCTTGTAGCTAGAACTTCTCTAAGCCTATACGTTTC 

TGTGGAGTAAATATTGGGATTGGGGGGAAAGAGGGAGCAACGGCCCATAGCCTTGGGGTT 

GGACATCTCTAGTGTAGCTGCCACATTGATTTTTCTATAATCACTTGGGGTTTGTACCTG 

CCCG GACACATCCAGTAGGCTAAGGGGATGCnTTCCTrTTCTGGGGTTTTTCGGGGGGTT 

TTTTGGAGCGGGGAGAGGGATGMNGAGGTGCTCCCCTTMTTTCTTTATTGAGAATGAT 

GCCGTGGATACTTGAATTTAAGCANTTGTCACATGGGCAGTGTCCTACCTGGGG 

Sequence 2859 

AG GGCG ATTTGGAGCTCCCCGCGGTGGCGGCCGGGCANNTACTTTGCGGTTTTTGGGACT 

TGATTTTNGCAGAGGGATCGGGCACTGAAGGTGCAGTTCTCAAAATCACACCTGNAGGCT 

GGCTCCTCGCTGTGGGTATCCAGGTGCTTCTGGAGGTCAATAAGATTCTTGCAGCTGTAG 

TCACAACAGTCACATTTAAAGGGCCGGTCCTCACTGTGACGAAAGCGCATGTGGTTGCGG 

AGGGAGGAAGGCAGCGGGCAGGTCATGTCACACAGAGGGCACTTATAGTGATTCACATGG 

TTGCGCA 

Sequence 2860 

ATGCGTTGCNGCTCACTGCCCCGCTTTCCAGTCGTGGAAAACNCTGTTCGTGCCCAGCCT 
GCATTTAATGGAMTCGGCCAMCCGCCNCCGGGGGAGGAGGGCCGGTTTTGCCGTATTT 
GGGGNCGCTTC^CCCGCTTCCnTCGCTCMCTGGACTTCGCTTGCGGCTNCGGGGTTCG 
NTTCCGGGCTTGCTGGGCCGAGGCCGGGTATTTCANCCTTCAACTTCAAAAG 
Sequence 2861 

CCCTAGGGCGTTTTGGAGCTNCCCCCGGTGGCGGCCGAGGTAC i I I I I I 1 1 M I I I II I N 

CCCGGAGNTTTNTMNAAGATTTATTTAGCAAAAATACATATAGCCATTATTGCAAGACT 

TAAATGAGA TGNT AAATGTTCMCCCAATTTTCTTTCCTGGATMGTTTTTCTT^ 

CCCTGTCAGTTTTGAAAACATAATACCAGAAGAAGGGGGGCCCAATTCCACAGAGAGCTC 

CCAAGMTGAGTTTCTGGGAGTGAGTCTGAAGTTGAGATAAACCTTTGCTGATCTTGCTT 

ACGTTCAATGCATCTGGGCAGCGTCTTTGATGAGCCCTGGCGGTTAGGCTGGTGGCACTG 

MGCAGGCCTCCAGGGTCTCCTGTTTMGCAGGATTTTAAGGCCAACCCTGC 

Sequence 2862 

CATCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGGA 

GGCTGACGCAGGAGGACCGCTTGAGCTCAGGAGTTCAAGACCAGCCTGAGCACCATAGTG 

AGACCTCATCTCTACTAAAAAAAAAAATAAAATACCAGGCATGGTAGCATGTGCCTGTAG 

TCCCAGCTACTCTAGTCCCAGCTACTTGGGAGGCTGAGGTGAGAGGATCACTTGAGCCCA 

GGAGATCGAGGCTGCAGTGAGCCATTATCACGCCACTGCACTCCAGCCTGGGCAACTAAG 

CMGACCCTGTCTCAAAAAAATTTTAAAAAATTTAAAAAATAAGAAAATCCAAGCTAGGT 

TGAAATCTGAATGTTGAGCAGNTCAGTGAGGCACAAACTTAGCTTAAGAAAGTCAACCCT 

GCCCACTTGCCATTTTGAAGGTTATTACTAGCCAAAATTACN 

Sequence 2863 

TA TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAATTAGCAAGGAGA 

CATTT TCT GCATT GTG AGMA TCMCATAGACACCTTAMGACCCCTTTGAGAGTGTGGC 

TTTTTGMCTTTTCAGATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGG 

TGTAAATAGAATCTAATGGCAGAATCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGA 

TCAAGACCACAAA ATGTC CA GAGCTAT GACCACAGCTATACCTACCCATAAAATACGATA 

CTGGAGTAGGGTATTTTTGTCTTTTTTCTTACCTAAGAGCTAGCTAATCAGGACAGGTGA 

TGGCAGGTTCTGGAGCTCTACCAGGGCAGGTCTATTTTCTTTTTT 

Sequence 2864 
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AGGTACAGCTCTTCTCTTGTCTTTCTGAGCTCTTACCAGMTTGCCTTCCGTGTCTTTTT 

TTTTGGAGATGGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCAGCT 

CACTGCAACCTNTGCCTCCCTGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGGGGCT 

GGGACTACGGGTGCACGCCACCATGCCTGGCTAATTTTTTGTATTTTAGTAGCGACAGGG 

TTTCACTGTGTTGCCCAGGCTGGCCTTGAACTCCTGAGCTCGGGCAATCTGCCCGCCTNA 

TTCTCCCAACATGCTAGGATTATAGGCATGAGCCACCACGCCCTGCGGTGTCCATATTTC 

TMCAGCAGTCACTTCAGAGCCATCTAGGTTTCTTCTAACATGCACCTAAAAACTCTTTG 

GCCTNTACTACCTAGNTNTGAAACCATTTT 

Sequence 2865 

CGGCCGCCGGGCAGGTACAGNTGCATCANCTGCTCGTAGGACATGTCCAGCAGCTGGTCG 
AGGTCCACGCCGCGGTAGGTGAACTTGCGGAAGGTCCGCTTCTTCTTCTGCTCTACTTCT 
GCCACCCGCGTACCACGGCTATCCTTATAGCTTTTTAAT 
Sequence 2866 

CGCCCGGGCAGGTACTTTCCCCTCCCCCAGAAAAGCAGGACTTGCCCGCTAAGGGTGAAG 

GACCAAGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAG 

AGGTGTCCCTGCAATGATTAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGTGACCAGCG 

CCAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAAAGG 

AATTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCT 

Sequence 2867 

CCGCGGTGGCGGCNCGCCCGGGCAGGTACTTTCCCCTTCCCCAGAAAAGCGGGACTTGCC 

GCTAAGGGTGAAGGACCAAGGCGGGCATCCCTGCATGGTCTGACACCCTTGAAACGTGGG 

TGAATAATCAGAGAGGTGTCCCTGCAATGATTAAACAGCAAGGGAAGGCTGCCTTCCCAG 

TCCGTGACCAGCGCTGGAGTTTTGGGTCCACAGATAAAACGTGTCTCCTTTGTCTCTACC 

AGAAAATGAAAGGAATTGAAACTAANAGAAGGGAGAGATTAAAAAAAAAAAAGAAAAAAA 

AAAAGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTT 

TAAGTGAGGGTTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCTGTTTCCTGTGTGA 

AAATTGTTATCCCGCTCACAATTCCACACAAACATACGAACCCGGGAGCATAAAGTGTAA 

AAGCCTGGGGGTG 

Sequence 2868 

CCGCGGTGGCGGC CGAGG TACAAATTGTCGTTTTTATTCCTCTTATTGGGATATCATTTT 

AAAMCTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCWGCTTCAGTCATCAGCCAGCAAGAGGGAACAAMCTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

AACTGCTAATGCAAATTAAMCATATTTCATTTTAACATGATTTCAAMTCAGTTT7TCA 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAMCATTGTAMTATTTACTCTTTGCAMCCCTGGTGGTATTTTATTTTGG 

CTTCA7TTCAATCATTGAAGTATATTCTTATTGGAAATGTACCTGCCCGGGCGGGCGCTC 

TAGAACTAGTG 

Sequence 2869 

GCGAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTCTAATTTGAATTTGTAATGAGTCTGA 

TGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGAATATCTTATAT 

CCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATATTATGACAACTGAA 

CATGCTTATGCATCCCTCATGCCTTTAI 1 1 I 1 1 A TTTTGGGAAATCTTTCAGCTTCAGTT 

TTTGCTGATATTTATGTGATTCTTTGTTCTGCAATTCAAATTTCTGGGAGCCAAACAGTC 

TCCTTGGTTCAGATTACTGTTTTTTGACTAGAGCTTCTCGCTTCAGATTCTGTCATAAGA 

TTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACATTATTTT 

CTATGGCTATGTATTMGAATTTTGTGCMTTCTGTTTTTCTTAGAAGGCTGAGGGTGTG 

TTGTCAGACACCATGACTGATGTGACAGGTGTATTTTATTATGC 
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Sequence 2870 

ATAGGGCGMTTGGACTCCCCGCGGTGGCGGCCGAGTCCTACCCTTTACTTTTTCCCCAA 

GACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACCCAC 

CTCTCCCMGAGCAGACATTAAACATCTTTGTGCTTTGAAGAGAGTGAATTTGGATAGTC 

TTGTGATTCTCMGACTMCTTCCAGMTTATACTTTAACCCCTTCCAGATATGGTCCGC 

CTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTAATT 

GTCMCCTCCAGTGTATGACTCTAGAAATATGAGGAAAAGCTTTTCAGTTTTTAAAATTG 

CCATTTAAATTTAGTCTATTAAAAACAAACCTAGAGGTCTTGGGTTGCAGTTGATTTCAG 

AGTATATTAATTTAGTGGGTCCCNAAAGTATTACATNTATTTATATTCTGGAATGAAAAG 

G 

Sequence 2871 

CCGCGGTGGGGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTAACTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCTC 

TGAGCTTMTATTGTMTTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGCCCACAGAACCTTCACG 

TGTATTCACAGCCTCAATGCCATAAGGAAACTCTTT 



